Implementation of Chernobyl disaster optimizer based feature selection approach to predict software defects

Kunal Anand; Ajay Kumar Jena; Himansu Das

doi:10.12688/f1000research.150927.2

Home Browse Implementation of Chernobyl disaster optimizer based feature selection...

ALL Metrics

Views

Downloads

Get PDF

Get XML

Export

▬

✚

Research Article

Revised

Implementation of Chernobyl disaster optimizer based feature selection approach to predict software defects

[version 2; peer review: 2 approved, 1 not approved]

Kunal Anand ¹, Ajay Kumar Jena¹, Himansu Das¹

PUBLISHED 17 Dec 2024

Author details Author details

¹ School of Computer Engineering, Kalinga Institute of Industrial Technology, Bhubaneswar, Odisha, 751024, India

Kunal Anand
Roles: Conceptualization, Funding Acquisition, Methodology, Validation, Writing – Original Draft Preparation

Ajay Kumar Jena
Roles: Formal Analysis, Investigation, Software, Supervision

Himansu Das
Roles: Methodology, Validation, Writing – Review & Editing

OPEN PEER REVIEW

REVIEWER STATUS

This article is included in the Kalinga Institute of Industrial Technology (KIIT) collection.

Abstract

Background

Software Defect Prediction (SDP) enables developers to investigate unscrambled faults in the inaugural parts of the software progression mechanism. However, SDP faces the threat of high dimensionality. Feature selection (FS) selects the finest features while carefully discarding others. Several meta-heuristic algorithms, like Genetic Algorithm, Particle Swarm Optimization, Differential Evolution, and Ant Colony Optimization, have been used to develop defect prediction models. However, these models have drawbacks like high cost, local optima trap, lower convergence rate, and higher parameter tuning. This study applies an innovative FS technique (FSCOA) rooted in Chernobyl Disaster Optimizer (CDO) technique. The proposed procedure intends to unwrap the best features for a prediction model while minimizing errors.

Methods

The proposed FSCOA investigated twelve public NASA software datasets from the PROMISE archive on Decision Tree, K-nearest neighbor, Naive Bayes, and Quantitative Discriminant Analysis classifiers. Furthermore, the accuracy of the recommended FSCOA method was correlated with existing FS techniques, like FSDE, FSPSO, FSACO, and FSGA. The statistical merit of the proposed measure was verified using Friedman and Holm tests.

Results

The experiment indicated that the proposed FSCOA approach bettered the accuracy in majority of the instances and achieved an average rank of 1.75 among other studied FS approaches while applying the Friedman test. Furthermore, the Holm test showed that the p-value was lower than or equivalent to the value of α/(A-i), except for the FSCOA and FSGA and FSCOA and FSACO models.

Conclusion

The results illustrated the supremacy of the prospective FSCOA procedure over extant FS techniques with higher accuracy in almost all cases due to its advantages like enhanced accuracy, the ability to deal with convoluted, high-magnitude datasets not grounded in local optima, and a faster convergence rate. These advantages empower the suggested FSCOA method to overcome the challenges of the other studied FS techniques.

Keywords

Software Defect Prediction; Feature Selection; Wrapper approach; Chernobyl Disaster Optimizer, Optimization

Corresponding author: Kunal Anand

Competing interests: No competing interests were disclosed.

Grant information: This work was funded by the Kalinga Institute of Industrial Technology, Bhubaneswar (KIIT-DU/309/24)
The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.

Copyright: © 2024 Anand K et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

How to cite: Anand K, Jena AK and Das H. Implementation of Chernobyl disaster optimizer based feature selection approach to predict software defects [version 2; peer review: 2 approved, 1 not approved]. F1000Research 2024, 13:844 (https://doi.org/10.12688/f1000research.150927.2) First published: 29 Jul 2024, 13:844 (https://doi.org/10.12688/f1000research.150927.1) Latest published: 17 Dec 2024, 13:844 (https://doi.org/10.12688/f1000research.150927.2)

Revised Amendments from Version 1

The reviewer's suggestions have been incorporated into the revised manuscript. There have been several new additions in addition to textual changes. Figure 1 has been updated. Once more, to preserve high quality, all of the figures are provided in PNG format. There are two new tables in the updated version. A summary of the cited literature on software defect prediction is presented in Table 1. Likewise, Table 2 provides useful information on the datasets that were employed in this investigation. Tables have since been renumbered. The updated DOI has been indicated in the paper's data availability section, and the new data has been added to the Fighshare data repository.

See the authors' detailed response to the review by Francis Palma
See the authors' detailed response to the review by Shabib Aftab
See the authors' detailed response to the review by Ahmed Abdu

Introduction

In today’s scenario, humankind needs good-quality and reliable software to help them perform their daily tasks without spending more time and effort. Owing to this immense call for exceptional and dependable software, conducting a rigorous investigation of software under development is crucial. However, the complexity of software increases every passing day, making the overall software development work very challenging.¹^,² A fault in software can significantly damage its quality and reliability, leading to more frequent maintenance activities. This can result in higher operational costs for the software, ultimately leading to user dissatisfaction. A software fault can be characterized as the disparity between the actual and predicted behaviors of the software. Software testing empowers developers to identify and rebuild faults. However, conventional testing approaches are costly and time-consuming. Hence, it is imperative to detect faults in a software module during the early stages of evolution.³

Software Defect Prediction (SDP) enables developers to expose deficiencies in software components in the early stages of development by employing data analysis and machine learning (ML) approaches. An effective SDP mechanism can lead to the systematic and profitable advancement of high-quality and reliable software products without defects.⁴ Researchers have suggested several ML-based SDP approaches⁵^–⁸ for effectively predicting defects. These methods analyse past data from different stages of development, such as testing data and debugging records, to derive any pattern or trend that can detect potential defects. The most widely employed ML methods in SDP are DT,⁹ SVM,¹⁰ neural networks,¹¹ logistic regression,¹² and NB.¹³ However, these approaches suffer from many challenges including high dimensionality being one of them.

Feature Selection (FS)¹⁴ is a potent mechanism that can be employed to overcome the issue of high dimensionality. FS allows developers to select only relevant features and carefully discard insignificant features. In SDP, FS is a vital step that will enable developers to choose the best set of features that can significantly enhance the predictive accuracy of a defect prediction model. Applying FS approaches is essential when dealing with datasets with high dimensionality. Several FS approaches have been implemented in the SDP. FS techniques are broadly classified into three categories: filter techniques,¹⁵ wrapper techniques,¹⁶ and embedded techniques.¹⁷^,¹⁸ Filter-based FS techniques are autonomous for any training strategy, and apply statistical properties to identify the best traits. Nonetheless, wrapper-based FS procedures select the best characteristics based on the classification accuracy of the prediction model. Embedded-based FS techniques combine feature selection with model training. Existing literature shows that researchers have mainly applied evolution-based algorithms¹⁹ and swarm-based algorithms²⁰ for FS purposes.

This exploration aims to boost the classification truthfulness of a defect-foretelling model while minimizing errors. For this purpose, this study uses some of the widely used meta-heuristic algorithms, namely the Genetic Algorithm (GA),²¹ Particle Swarm Optimization (PSO),²²^,²³ Differential Evolution (DE),²⁴ and Ant Colony Optimization (ACO).²⁵ Although GA has been a proven FS approach,²⁶ it is costly because it computes the optimal features using genetic techniques such as selection, crossover, and mutation over a set of generations. The PSO-based FS approach²⁷ aims to find the optimal traits by emulating the fragment’s movement while probing a search arena with several dimensions. The algorithm adjusts the location and velocity of the fragment by considering singular and group knowledge. However, the PSO-based FS approach sometimes results in a local optima trap and a lower convergence rate. DE-based FS techniques²⁸ compute optimal characteristics by employing operators such as mutation, crossover, and selection on a population of potential solutions over several iterations. However, these approaches require several parameter tunings, making them a tedious choice among researchers. ACO-based FS methods²⁹ determine the excellent characteristics of a search space by implementing the foraging behavior of ants. However, these methods suffer from a slow convergence speed and low accuracy, especially in large datasets.

The limitations of the FS as mentioned earlier approaches motivated us to propose a novel FS approach (FSCOA) inspired by the Chernobyl Disaster Optimizer (CDO).³⁰ CDO mimics the process of nuclear radiation, which involves the propagation of alpha, beta, and gamma fragments while attacking humans after an explosion. These radiations fly at a very high speed from a high-pressure point (the point of explosion) to a low-pressure point (the standing of the individual place). The proposed algorithm comprises an initial population of the candidate solutions. Furthermore, it computes the gradient descent factor (GDF) for alpha, beta, and gamma fragments when they attack humans. Finally, the optimal solution was achieved by calculating the average of the GDF values over several iterations. The primary objective of the proposed FSCOA approach is to unwrap the most informative features to produce a precise prediction model. The proposed algorithm has advantages such as its ability to deal with convoluted, high-magnitude datasets that are not grounded in local optima, which can be an issue in alternate FS procedures. The primary contributions of this study are as follows.

(i) To implement a novel FS technique namely FSCOA, by applying the CDO,³⁰ a metaheuristic algorithm
(ii) To assess the conduct of the proposed FSCOA-based fault forecasting model on four different classification algorithms, NB, QDA, DT, and KNN using 12 benchmark NASA software defect datasets.
(iii) To correlate the performance of the proposed FSCOA approach with several baseline FS approaches such as FSGA, FSPSO, FSDE, and FSACO.
(iv) To validate the statistical implications of the proposed FSCOA approach using Friedman and Holm tests.

The experimental outcome shows that the proposed FSCOA was better than the other FS approaches examined in most situations and then became the best-performing FS technique for designating the best array of features.

The remainder of this paper is organized as follows. Related Works Section discusses the existing literature on FS approaches. The next segment i.e., Feature Selection based on Chernobyl Disaster Optimizer Algorithm, elaborates on the proposed FSCOA approach and the detailed methodology used in this study. Result analysis segment presents the empirical findings and interpretations. Statistical analysis segment outlines the statistical analysis. Threats to Validity section discusses the risks to the validity of the proposed work. Finally, the conclusion segment presents the conclusions and scope for prospective work.

Related works

Defect prediction role, in software modules, is critical in creating high-quality and reliable software. SDP permits developers to detect and debug defects in software modules during the prior stage of the software advancement process. Unfortunately, conventional SDP processes face several threats, including the curse of dimensionality. The curse of dimensionality indicates the presence of many attributes in a dataset. Many of these attributes do not make any compelling knowledge and hence, are treated as noise. Feature selection (FS) is a potent tool to tackle the challenge of the curse of dimensionality. FS allows developers to establish the best possible set of traits that can enhance the model’s predictive accuracy by discarding irrelevant traits. However, it is imperative to observe that conventional FS procedures are expensive and time-consuming.³¹ Recently, the application of ML to SDP has gained considerable traction. Several ML-based SDP approaches have been proposed. This section describes some of these studies as follows.

Das et al.³² proposed a novel FS technique called FSGJO based on the Golden Jackal Optimization (GJO) algorithm. The proposed FSGJO technique was employed on four classifiers, namely, KNN, DT, NB, and QDA, using 12 SDP datasets from the PROMISE repository. The authors compared the efficacy of the recommended FSGJO technique with alternative FS techniques, namely, FSDE, FSPSO, FSACO, and FSGA. Based on their experimental findings, the authors observed that the proposed FSGJO technique enhanced the prognostic performance of the model. It was also noted that the prospective FSGJO method was exceptional compared to other studied FS techniques in selecting the optimal set of characteristics. However, the authors mentioned that the proposed FSGJO technique needs its parameters to be tuned.

Khalid et al.³³ inspected numerous existing ML methods and optimized ML procedures on three publicly accessible NASA datasets. The authors applied PSO and ensemble approaches and scrutinized the results. The experimental findings revealed that the SVM and optimized SVM outperformed the other models in terms of accuracy. However, this study was conducted using a limited number of datasets. Again, the experimental findings cannot be generalized owing to the need for additional optimization algorithms.

Kumar and Das³⁴ enforced GA to supervise learning classifiers such as KNN, DT, and NB. Twelve NASA datasets from the PROMISE archive were used. Using accuracy and failure rate as performance metrics, the performance of the proposed model was assessed. Based on their experimental results, the authors asserted that the suggested FSGA technique improved the behaviour of the defect forecast model correlated with the scenario in which the FS was not made. However, in this study, the FS approach used only the GA. The effects of alternative optimization methodologies were not investigated.

Thirumoorthy et al.³⁵ suggested a hybrid SDP method based on the TOPSIS and hybrid Rao algorithms (THRO) to uncover the finest traits. The authors used three benchmark NASA SDP datasets to implement their proposed THRO-based FS algorithm on SVM and NB classifiers. The impact of the proposed algorithm was assessed using six metaheuristic FS techniques. The authors noted that the proposed THRO-based FS algorithm enhanced the model’s classification performance and outperformed other studied FS approaches. However, they also pointed out that this enhanced performance of the proposed method came at the price of increased computational cost.

Batool et al.³⁶ offered a comprehensive and well-organized analysis of the extant literature. That employed DM, ML, and DL, among other techniques, for fault prediction. The endeavour was motivated by the need to find answers to research problems stated in the evaluation that might not have been addressed in the works evaluated or called for a different viewpoint. The authors claimed that SDP frequently employs DM and ML techniques, such as DT, NB, SVM, NN, ET, and EA. Although they are used less frequently, researchers have also used DL approaches such as CNN, MLP, LSTM, and DNN to predict software errors. The authors emphasized the need for larger datasets and the importance of concentrating on using the same methods with combinations of different datasets.

An SDP architecture based on stacked stacking and heterogeneous FS was proposed by Chen et al.³⁷ The two main objectives of this study were to increase SDP accuracy and optimize software testing resource allocation. The method is divided into three steps: feature selection, model creation with a nested-stacking classifier, and evaluation of the predictive behaviour of the model. For the experiments, two datasets were used: Kamei and PROMISE. The investigation included both within-project and large-scale cross-project defect prediction (CPDP). The model’s behaviour was illustrated using the AUC and F1-score evaluation metrics. The initial results showed that for the two sets of software failure datasets, the proposed framework performed better in terms of classification than the baseline models. However, the authors pointed out that nested-stacking is ineffective and that the optimal combination of the baseline model was determined via complex experiments.

Arora and Kaur³⁸ suggested a heterogeneous fault prediction (HFP) model to develop an effective forecasting model utilizing supervised training approaches. The writers completed the FS in two phases. They began by selecting features based on their importance. They removed the shared features from the datasets in the next step. An integrated approach was used to select the best characteristics. Random Forest Importance (RFI) was used for the FS. According to the suggestion made by Gao et al.,³⁹ the authors selected the top 15% of attributes throughout the FS phase. The proposed framework was applied to two open-source projects, MySQL and Linux, for the supervised ML classifiers, SVM, NB, RF, AdaBoost, DT, and LR. The behaviour of the planned model was graded using the Area under the ROC curve (AUC). The authors concluded that the most accurate logistic regression fault prediction is based on the recommended approach. The AUC data demonstrated that the suggested technique accomplished better than the existing Cross Project Fault Prediction (CPFP). However, in this study, other commonly used performance criteria such as accuracy, precision, and recall were not employed to grade the impact of the proposed approach. Once again, only supervised learning algorithms were used in the study, and no optimization algorithms were used.

Anand et al.⁴⁰ conducted a correlative performance assessment of various FS techniques in SDP. Chi-Square (CS), Correlation Coefficient (CC), Fisher’s Score, Information Gain (IG), Mean Absolute Difference (MAD), and Variance Threshold (VT) are among the filter-based FS approaches used in this investigation. Wrapper-based FS strategies also encourage the use of Backward Feature Elimination (BFE), Exhaustive Feature Elimination (EFE), Forward Feature Elimination (FFE), and Recursive Feature Elimination (RFE) methodologies. RFI and LASSO Regularization are among the embedded FS techniques utilized in this study. The recommended model uses six publicly accessible benchmark NASA datasets for the NB, SVM, DT, and KNN classifiers. The authors used the performance evaluation criteria of the F1-score, recall, accuracy, and precision. The authors’ experimental results showed that Fisher’s score behaved more precisely than other FS techniques. However, compared to the no-FS situation, it was found that all FS strategies enhanced the model’s behaviour. A drawback of this study is that it neglected to examine the impact of optimization strategies on the FS.

The dynamic re-ranking approach-based WFS technique was introduced by Balogun et al.⁴¹ in response to the exorbitant processing expenses of wrapper-based FS (WFS) methods. The recommended technique was constructed using 25 public domain datasets that were extracted from the NASA, AEEEM, PROMISE, and ReLink archives using classifiers such as DT and NB. The findings of the experiment illustrated that the recommended method reduced computing time and enhanced model performance when executing FS. The suggested method used both the FFS and WFS techniques, which is a disadvantage. FFS has variable performance across datasets and classifiers, whereas WFS suffers from the stagnation of local optima and high computing costs. Once more, only two supervised classifiers were examined in this work: SVM and K-NN, two more well-known classifiers, were not examined.

Balogun et al.⁴² proposed an inventive hybrid multifilter wrapper FS arrangement based on rank aggregation to select critical features to address the aforementioned shortcomings. The recommended course of action was implemented in two steps. In the first lap, a multifilter FS mechanism based on rank aggregation was used, which combined the separate rank lists from multifilter methods to build an original, dependable, and non-disjoint rank list. This resolves the filter rank choice issue. In the second lap, the upgraded wrapper FS approach, which was predicated on dynamic re-ranking, was used once more to preprocess the accumulated ranked attributes. The competence of the recommended method is illustrated using NB and DT classifiers on benchmark software fault datasets. The tests used accuracy, area under the curve (AUC), and F-measure values as evaluation criteria. The authors used their findings to concentrate on the issues of filter rank choice and local optima stagnation in HFS, demonstrating the suggested method’s ingenuity in selecting the best characteristics while enduring or boosting the impact of the forecasting models. They concluded that applying the recommended technique significantly improves the behaviour of the model. However, the model was limited to only two classifiers to achieve satisfactory results. Consequently, the potential of extrapolating the results to alternative classifiers has not been explored.

Alsghaier and Akour⁴³ presented an SDP model by fusing the GA, SVM, and PSO. Three stages were implemented: GA-SVM for GA integration, PSO-SVM for PSO integration, and GAPSO_SVM for the reciprocal iteration-based integration of GA-SVM and PSO-SVM. During the experimentation phase, 24 benchmark SDP datasets (12 NASA MDP and 12 open-source Java applications) were subjected to the proposed model using the SVM classifier. Experiments were conducted using the WEKA Tool and MATLAB 2015 to validate the theoretical model. The impact of the developed approach was assessed using evaluation metrics, such as accuracy, recall, precision, F-measure, specificity, error rate, and standard deviation. The experiment results showed that combining the GA with SVM and PSO had a beneficial effect on the model and enhanced its performance when applied to both small- and large-scale datasets. However, the precision metric needed to be more sufficient to appraise the suggested procedures.

Alsghaier and Akour⁴⁴ built on their earlier work⁴³ by combining GA, SVM, and Whale Optimization Algorithm (WOA) to forecast defects. The remainder of the experimental configuration remained the same as in a previous study.⁴³ Through experimental data, the researchers discovered that the behaviour of the defect forecast model was improved for both large-scale and small-scale datasets when the GA was integrated with SVM and WOA. For the datasets under study, WA-SVM performed more accurately than GAWA-SVM, and GAWA-SVM produced the worst outcomes. Again, the proposed method outperformed SVM for the NASA MDP and open-source Java projects regarding SD scores. This illustrates how combining SVM with optimization techniques enhances the prediction performance. The NASA, GA-SVM, and GAWA-SVM datasets produced the best outcome in terms of specificity. This proved that the GA-SVM and GAWA-SVM procedures are appropriate for software defect predictions when enforced on an enormous dataset.

Balogun et al.⁴⁵ used NASA datasets from the PROMISE archives to thoroughly evaluate the FSS algorithms on NB, DT, LR, and KNN. Their findings imply that the studied FS techniques enhanced the system’s performance. Information Gain, one of the FFR techniques, demonstrated the best results. Consistency Feature Subset Selection (CFSS), based on the Best First Search in FSS methods, has the most significant impact on forecasting models. However, there were variations in the performances of the classifiers’ and datasets.’ Scientists have also found that models constructed using FFR-based techniques are more stable than those constructed using FSS-based approaches. This study focused only on FFS procedures, and the effects of the WFS techniques were not investigated in detail.

Table 1 presents the summary of the above-discussed literature, along with some recent advancements in software defect prediction.

Table 1. Summary of referred literature in software defect prediction.

Author / Year	Objective	Description / Methods used	Findings
Das et al. 2023³²	To propose a novel FS technique called FSGJO based on the Golden Jackal Optimization (GJO) algorithm to identify the best traits from a defect prediction dataset.	KNN, DT, NB, and QDA classifiers were employed using twelve SDP datasets extracted from the PROMISE project. The behaviour of the proposed FSGJO method was compared with other FS techniques, including FSDE, FSPSO, FSACO, and FSGA.	The proposed FSGJO method achieved enhanced accuracy compared with other studied FS techniques. On the drawback side, the suggested method requires parameter tuning.
Khalid et al. 2023³³	To investigate various ML techniques and optimized ML processes on three NASA publicly available datasets.	The authors used ensemble and PSO techniques in their work and carefully examined the outcomes.	The SVM and optimized SVM performed better than the other models. This investigation used a small number of datasets, and other widely used optimization techniques were not explored.
Kumar and Das 2022³⁴	To enforce Genetic Algorithm based FS technique to select fine traits from a defective dataset.	This study used classifiers like DT, NB, and KNN on twelve NASA datasets from the PROMISE archive. Accuracy and failure rate were used as performance measures.	The proposed FSGA technique enhanced the defect forecast model's behaviour. However, the impact of alternative optimization techniques still needs to be examined.
Thirumoorthy et al. 2022³⁵	To propose a hybrid SDP approach to find the best set of attributes, based on the TOPSIS and hybrid Rao algorithms (THRO).	Using three benchmark NASA SDP datasets, the authors implemented their suggested THRO-based FS algorithm on SVM and NB classifiers. The impact of the suggested algorithm was evaluated using six metaheuristic FS approaches.	The suggested THRO-based FS algorithm improved the model's classification performance and outperformed other tested FS approaches. However, a shortcoming was the high computing cost.
Batool et al. 2022³⁶	To provide a thorough and structured analysis of the body of existing literature.	Numerous pertinent published publications that employed DM, ML, and DL, among other techniques, for fault prediction were studied. The endeavour was motivated by the need to find answers to research problems stated in the evaluation that might not have been addressed in the works evaluated or called for a different viewpoint.	SDP regularly uses DM and ML techniques such as DT, NB, SVM, NN, ET, and EA. Researchers have also employed DL techniques, including CNN, MLP, LSTM, and DNN, to forecast software problems despite their less frequent application.
Chen et al. 2022³⁷	To propose an SDP architecture based on stacked stacking and heterogeneous FS. This study's two main objectives were to increase SDP accuracy and optimize software testing resource allocation.	Two datasets, PROMISE and Kamei, were employed for the experiments. The study encompassed both large-scale cross-project defect prediction and within-project defect prediction. The AUC and F1-score assessment measures were used to show the model's behaviour.	The suggested approach outperformed the baseline models in terms of categorization for the two sets of software failure datasets. Nevertheless, nested stacking could be more effective and challenging trials were used to find the baseline model's ideal combination.
Arora and Kaur 2022³⁸	To propose a heterogeneous fault prediction (HFP) model using FS on both origin and destination datasets to create a successful forecasting model using supervised training techniques.	For the FS, RFI was utilised. The suggested framework was used for the supervised machine learning classifiers SVM, NB, RF, AdaBoost, DT, and LR in two open-source projects, MySQL and Linux. The Area under the ROC curve (AUC) was used to grade the proposed model's behaviour.	The LR-based model was found to be the most accurate. According to the AUC data, the proposed method outperformed the current Cross Project Fault Prediction (CPFP). However, other popular measures like accuracy, precision, and recall were not used. Once more, no optimization methods were employed in the study.
Anand et al. 2022⁴⁰	To evaluate the correlated performance of several FS methods used in SDP.	This study employed several filter-based FS techniques, including Chi-Square (CS), Correlation Coefficient (CC), Fisher's Score, Information Gain (IG), Mean Absolute Difference (MAD), and Variance Threshold (VT); Wrapper-based techniques like Recursive feature elimination (RFE), forward feature elimination (FFE), backward feature elimination (BFE), and exhaustive feature elimination (EFE); and embedded FS approaches including RFI and LASSO Regularisation. The suggested model used six publicly available benchmark NASA datasets for the NB, SVM, DT, and KNN classifiers.	Fisher's score behaved more precisely than other FS approaches. Nonetheless, every FS strategy improved the model's behaviour compared to the no-FS scenario. This study's failure to investigate how optimization tactics affect the FS is one of its shortcomings.
Balogun et al. 2021⁴¹	To introduce the dynamic re-ranking approach-based WFS technique to address the excessive processing costs of wrapper-based FS (WFS) approaches.	The suggested method was built using classifiers like DT and NB. It was based on 25 public domain datasets from the NASA, AEEEM, PROMISE, and ReLink archives.	The significant findings were improved performance and decreased computation time. The drawbacks of FFS and WFS approaches threatened the proposed work. Only two supervised and two other popular classifiers, SVM and K-NN, were not considered.
Balogun et al. 2021⁴²	To suggest a creative hybrid multifilter wrapper FS configuration that uses rank aggregation to choose important aspects	Benchmark software fault datasets were used to demonstrate the effectiveness of the suggested approach, which employed NB and DT classifiers using the proposed method. The tests employed F-measure values, accuracy, and area under the curve (AUC) as evaluation criteria.	The suggested method greatly enhanced the model's behaviour. However, the model could only use two classifiers to get good results. As a result, the possibility of extrapolating the findings to other classifiers is yet to be investigated.
Alsghaier et al. 2020⁴³	Combining the GA, SVM, and PSO to present an SDP model	The suggested model was tested using the SVM classifier on 24 benchmark SDP datasets (12 NASA MDP and 12 open-source Java apps). Evaluation criteria like accuracy, recall, precision, F-measure, specificity, error rate, and standard deviation were used to gauge the effectiveness of the created method.	When the GA was combined with SVM and PSO, the model's improved performance benefitted both small—and large-scale datasets. Nevertheless, the precision metric was inadequate and required improvement to evaluate the recommended techniques.
Alsghaier et al. 2021⁴⁴	To foresee flaws, they combined the Whale Optimisation Algorithm (WOA), SVM, and GA, building on their previous work.⁴³	During the experiment, 24 benchmark SDP datasets (12 NASA MDP and 12 open-source Java apps) were used to test the proposed model using the SVM classifier. The efficacy of the developed method was assessed using evaluation criteria such as accuracy, recall, precision, F-measure, specificity, error rate, and standard deviation.	The GAWA-SVM yielded the lowest results for the datasets under investigation, while WA-SVM outperformed GAWA-SVM in accuracy. Regarding SD scores, the suggested approach fared better than SVM for all datasets. The GA-SVM and GAWA-SVM approach yielded the best specificity. This demonstrated the suitability of the GA-SVM and GAWA-SVM processes for software defect predictions when applied to a large dataset.
Balogun et al. 2019⁴⁵	To comprehensively analyse the FSS algorithms on NB, DT, LR, and KNN.	They used NASA datasets from the PROMISE archives on NB, DT, LR, and KNN.	Information Gain emerged as the best FFR technique. Consistency Feature Subset Selection (CFSS), based on Best First Search in FSS techniques, significantly impacts the forecasting models.
Abdu et al. 2024⁵⁴	To provide a defect prediction model using a deep hierarchical convolution neural network (DH-CNN) based on several source code representations.	Semantic-graph features collected from the control flow graph and data dependence graph using Node2vec were fed into semantic-level DH-CNN, while syntax features derived from abstract syntax trees using Word2vec were given into syntax-level DH-CNN. Furthermore, the suggested model incorporated a gated merging method that combined DH-CNN outputs to estimate the ratio of both feature types.	In both cross-project and within-project scenarios, DH-CNN performed better than current techniques.
Abdu et al. 2024⁵⁵	To suggest a unique defect prediction model that leverages a hybrid deep learning approach to combine traditional and semantic information.	A CNN-MLP hybrid classifier was used, where semantic characteristics were retrieved from projects' abstract syntax trees (ASTs) using Word2vec and processed by the CNN. A multilayer perceptron (MLP) processed the conventional features taken from the dataset repository. After integration, the CNN and MLP outputs were sent into a fully connected layer for defect prediction. Extensive testing was done on several open-source applications to confirm CNN-MLP's efficacy.	CNN-MLP significantly improved defect prediction performance. Additionally, CNN-MLP performed better than current techniques in effort-aware and non-effort-aware scenarios.
Abdu et al. 2023⁵⁶	To propose a graph-based feature learning model for Cross project defect prediction (GB-CPDP).	Used Long-Short-Term Memory (LSTM) networks to learn predictive models. Node2Vec was used to convert CFGs and DDGs into numerical vectors. Nine open-source Java programs from the PROMISE dataset were used. F1-measure and Area under the Curve (AUC) were the performance measures.	The experimental evaluation showed that GB-CPDP performed better than state-of-the-art CPDP techniques. The outcomes demonstrate how well GB-CPDP works to enhance cross-project defect prediction performance.
Abdu et al. 2022⁵⁷	To methodically illustrate current software defect prediction methods based on the salient characteristics of the source code.	Ninety of the 283 articles on software defect prediction that were the subject of an extensive literature assessment were critically reviewed by analysing the semantic feature approaches to present critical problems and challenges.	Such an extensive survey may help research communities determine the present issues and potential avenues for future investigation.

All the previously stated FS approaches, whether supervised or unsupervised, have disadvantages that significantly impact the model’s performance, including (i) high cost, (ii) entrapment in local optima, (iii) low convergence rate, and (iv) fine-tuning of excessively many parameters. The primary drawback of the previously stated FS techniques is the need to modify the regulating parameters accurately while choosing ideal characteristics. These shortcomings motivated us to propose a novel FS technique (FSCOA) that draws inspiration from the Chernobyl Disaster Optimizer (CDO).³⁰ The 1986 nuclear reactor core outburst in Chernobyl served as an impetus for the development of the CDO meta-heuristic algorithm. The process of nuclear radiation, in which alpha, beta, and gamma fragments propagate and damage humans following an explosion, is replicated by CDO. From the high-pressure point (explosion site) to the low-pressure point (individual standing), the above-mentioned radiations travel extremely rapidly. The algorithm comprises an initial population of potential solutions. Moreover, it calculates the alpha, beta, and gamma fragment gradient descent factors (GDF) during human attacks. Determining the average of these GDF values over several iterations yields the best result.

Feature selection based on Chernobyl Disaster Optimizer Algorithm

Feature selection determines the crucial attributes that have the greatest impact on the desired variable, which helps increase machine learning model accuracy, reduce computing costs, and reduce the risk of overfitting. The mechanism of selecting the best features begins with creation of a set of subgroups of attributes. Further, the adequacy of these subgroups is assessed and compared to determine which subgroup is the best or until the abort standards are met. In the final lap, the subgroup with best features is incorporated to build the defect forecasting model to compute the predictive accuracy. The 1986 Chernobyl nuclear reactor catastrophe⁴⁶ is recognized as one of the lowest nuclear disasters in the modern human past, in terms of both cost and casualties. Inspired by the Chernobyl nuclear reactor core eruption, the Chernobyl Disaster Optimization (CDO)³⁰ is a meta-heuristic optimization technique. In order to choose the most appropriate subset of characteristics for classification, a novel FS approach using Chernobyl Optimization Algorithm (FSCOA) is therefore proposed in order to address the aforementioned problem. Figure 1⁵⁸ shows the blueprint for the proposed FSCOA method.

Figure 1. Blueprint of the suggested FSCOA methodology.

This study suggests a novel FSCOA technique for selecting the first-rate subgroup of attributes for categorization. The primary intent of the recommended technique is to identify the best attribute combination that will lower the model’s fitness. Broadly, the proposed methodology has been implemented in the following three steps:

Step-1: First, the selection of the relevant SDP datasets is crucial. Twelve publicly benchmarked NASA software defect datasets taken from the PROMISE archive⁴⁸ were used to assess the persuasiveness of the suggested FSCOA strategy. The datasets were AW1, PC1, PC2, PC3, PC4, KC1, KC3, CM1, JM1, MC1, MC2, and PC5. Following the selection of the datasets, an in-depth examination of the datasets was carried out to determine any missing, inconsistent, or categorical data. It became apparent that there were no missing data in the datasets. Nevertheless, a few datasets contained categorical data. The data were categorized prior to the generation of the numbers. Furthermore, the original feature value ranged from 0 to 1 and was normalized. Subsequently, an 80:20 split between the training and testing datasets was created for each normalized dataset.

Step-2: The two preeminent criteria to develop and investigate the model are the population size and maximum number of iterations. Higher values will improve the model’s performance will also lengthen the computation time. In this study, the population size and maximum tally of the iterations were set to 30 and 200, respectively.

Step-3: By applying the recommended FSCOA methodology, four supervised learning classifiers (DT, KNN, NB, and QDA) were used to construct the model using the optimal features that were chosen. The best predictive classifier was then determined by comparing the accuracy of the proposed FSCOA approach with that of the other FS models under study.

Figure 2⁵⁸ shows a complete flow diagram of the proposed FSCOA technique.

Figure 2. Flow-diagram of recommended FSCOA approach.

Initializing the criteria, such as population size $(M)$ , problem dimension $(F)$ , lower bound $(LowBound)$ , and upper bound $(UppBound)$ , is the first step of the procedure. Subsequently, a random binary population of $M$ fragments with dimension F, where $Z = [Z_{1}, Z_{2}, Z_{3}, \dots, Z_{M}]$ is the total number of features. $Z_{i} = [Z_{1}, Z_{2}, Z_{3}, \dots, Z_{M}]$ is the $i^{th}$ fragment location in the $F$ dimension feature space, where $i = 1, 2, 3, \dots, M$ is the specimen proportion, and $Z_{i, f}$ is the $i^{th}$ fragment standing of the $f^{th}$ trait of the population. Many classification techniques, including DT, KNN, NB, and QDA for fitness (error) computation, have been considered for the examination of randomly selected characteristics. The FS algorithm aims to select the best subset of ideal features that may reduce the fitness of the learning algorithm. The error $({Err}_{i}^{t})$ is estimated as the disparity between the estimated outcome $({EO}_{i}^{t})$ and actual outcome $({AO}_{i}^{t})$ . Eq. (1) can be used to describe this phenomenon.

(1)

{Err}_{i}^{t} = {AO}_{i}^{t} - {EO}_{i}^{t}

By distributing the aggregate of the total errors by the entire count of instances in the testing data, the fitness ( ${FitValue}_{i}^{t}$ ) of the learning algorithm was calculated. This is characterized by Eq. (2).

(2)

{FitValue}_{i}^{t} = \frac{\sum_{i = 1}^{p} {Err}_{i}^{t}}{p}

Here, $i = 1, 2, \dots . ., p$ and $p$ represent the tally of the instances in the trial data, $t$ represents the current iteration.

The transfer function depicted in Eq. (3) was employed to transform the initial fragment standings into a binary equivalent.

(3)

TF = \frac{1}{1 + ({exp}^{(- 10 \times (Z_{i, f} - 0.5))})}

The proposed FSCOA approach employs the CDO algorithm to determine the optimal features for a given dataset. In CDO, different types of emissions are released from the nuclei as a result of radioactivity caused by nuclear instability. The most prevalent types of these emissions are alpha, beta, and gamma fragments. These fragments, which are very dangerous to people, fly from a high-pressure point (the point of explosion) to a low-pressure point (the standing of individual standing). When a human is attached to a CDO following a nuclear explosion, it simulates the effects of radioactive decay. The primary processes of nuclear explosion and human attachment require the use of gamma, beta, and alpha fragments. Humans were most likely to be on foot when they were attacked. Human walking speed can be enhanced, and it can be estimated to be between 0 and 3 miles per hour.⁴⁷ Based on this, Eq. (4) can be used to model linear reduction at this speed.

(4)

{WalkSpeed}_{human} = 3 - t * ((3) / max_iter)

Alpha fragment

The gradient descent factor $({GDF}_{α})$ of the alpha fragment while threatening humans can be computed using Eq. (5).

(5)

{GDF}_{α} = 0.25 \times ({POS}_{α} (t) - {PROP}_{α} \times D_{α})

Here, ${POS}_{α} (t)$ is the prevailing standing of alpha fragments, ${PROP}_{α}$ represents the dispersion of alpha fragments and can be calculated using Eq. (6); $D_{α}$ is the discrepancy between the individual standing and standing of alpha fragments, which can be determined using Eq. (8).

(6)

{PROP}_{α} = \frac{π \times rad \times rad}{0.25 \times {Speed}_{α}} - ({WalkSpeed}_{human} \times rand ())

Here, $rad$ is a random value between 0 and 1, ${Speed}_{α}$ is the speed of alpha fragments that can be in the range of 1–16,000 kmps. This can be normalized using Eq. (7)

(7)

{Speed}_{α} = log (rand (1 : 16000))

(8)

D_{α} = | {Area}_{α} \times {POS}_{α} (t) - {Avg}_{T} (t) |

Here, ${Area}_{γ}$ is the propagation area of gamma fragments that can be calculated as $π * rad * rad$ where $rad$ is a random value between 0 and 1, ${Avg}_{T}$ is the average of the total standings that can be determined using Eq. (17).

Beta fragment

Eq. (9) can be used to determine the gradient descent factor $({GDF}_{β})$ of a beta fragment assaulting a human.

(9)

{GDF}_{β} = 0.5 \times ({POS}_{β} (t) - {PROP}_{β} \times D_{β})

Here, ${POS}_{β} (t)$ is the current standing of beta fragments; ${PROP}_{β}$ represents the propagation of beta fragments and can be calculated using Eq. (10); $D_{β}$ is the discrepancy between the human standing and the beta fragment’s standing, which can be determined using Eq. (12).

(10)

{PROP}_{β} = \frac{π \times rad \times rad}{0.5 \times {Speed}_{β}} - ({WalkSpeed}_{human} \times rand ())

Here, $rad$ is a random value between 0 and 1, ${Speed}_{β}$ is the speed of beta fragments that can be in the range of 1–270,000 kmps. This can be normalized using Eq. (11)

(11)

{Speed}_{β} = log (rand (1 : 270000))

(12)

D_{β} = | {Area}_{β} \times {POS}_{β} (t) - {Avg}_{T} (t) |

Here, ${Area}_{β}$ is the propagation area of the beta fragment and can be calculated as $π * rad * rad$ where $rad$ is a random value between 0 and 1. ${Avg}_{T}$ is the average of the total standings that can be computed using Eq. (17).

Gamma fragments

The gradient descent factor $({GDF}_{γ})$ of the gamma fragment while making an assault on humans can be computed using Eq. (13).

(13)

{GDF}_{γ} = ({POS}_{γ} (t) - {PROP}_{γ} \times D_{γ})

Here, ${POS}_{γ} (t)$ is the prevailing standing of gamma fragments, ${PROP}_{γ}$ represents the dispersion of gamma fragments and can be calculated using Eq. (14); $D_{γ}$ is the discrepancy between the standing of the human and the standing of gamma fragments, which can be determined using Eq. (16).

(14)

{PROP}_{γ} = \frac{π \times rad \times rad}{{Speed}_{γ}} - ({WalkSpeed}_{human} \times rand ())

Here, $rad$ is a random value between 0 and 1, ${Speed}_{γ}$ is the speed of the gamma fragment in the range of 1 to 300,000 kmps. This can be normalized using Eq. (15)

(15)

{Speed}_{γ} = log (rand (1 : 300000))

(16)

D_{γ} = | {Area}_{γ} \times {POS}_{γ} (t) - {Avg}_{T} (t) |

(17)

{Avg}_{T} = (\frac{{GDF}_{α} + {GDF}_{β} + {GDF}_{γ}}{3})

Finally, Algorithm 1 provides a summary of the entire proposed FSCOA process.

Proposed FSCOA approach

Algorithm 1.

1. Initialize Populace Size $(M)$ , Dimension $(F)$ , Lower Bound $(LowBound)$ , UpperBound $(UppBound)$ , Maximum Iteration $(max_iter)$
2. Generate the binary feature subset $Z_{i}$ randomly
3. Initialize the alpha ( ${POS}_{α}$ ), beta ( ${POS}_{β}$ ), and gamma ( ${POS}_{γ}$ ) standings
4. while $(t < max_iter)$ do{
5. for $i = 1$ : M do
6. for $j = 1 to$ F do
7. The values of the initial standing of the fragments are converted into their corresponding binary values using Eq. (3).
8. Compute the fitness value $(FitValue)$ for the alpha, beta, and gamma fragments using Eq. (2)
9. if $(FitValue < \propto_{score})$
10. $\propto_{Score} = FitValue$
11. Update ${POS}_{α}$
12. endif
13. if $(FitValue > \propto_{score}) and (FitValue < β_{score})$
14. $β_{Score} = FitValue$
15. Update ${POS}_{β}$
16. endif
17. if $(FitValue > \propto_{score}) and (FitValue > β_{score}) and (FitValue < γ_{score})$
18. $γ_{Score} = FitValue$
19. Update ${POS}_{γ}$
20. endif
21. end for
22. end for
23. Compute human walking speed $({WalkSpeed}_{human})$ using Eq. (4)
24. Compute the speed of alpha $({Speed}_{α})$ , beta $({Speed}_{β})$ , and gamma $({Speed}_{γ})$ fragments using Eq. (7), Eq. (11), and Eq. (15), respectively.
25. for $i = 1$ : M do
26. for $j = 1$ : F do
27. Determine ${GDF}_{α}$ using Eq. (5)
28. Determine ${GDF}_{β}$ using Eq. (9)
29. Determine ${GDF}_{γ}$ using Eq. (13)
30. Update $Z_{i}$ using average of total standings using Eq. (17)
31. end for
32. end for
33. $t = t + 1$
34. } //end of while loop
35. Return finest solution, $Z_{i}$
36. end procedure

Result analysis

This section deliberates on the empirical findings of this research. The persuasiveness of the proposed FSCOA approach was graded by employing 12 publicly benchmarked NASA software defect datasets extracted from the PROMISE archive.⁴⁸ KC1, KC3, CM1, JM1, MC1, MC2, MW1, PC1, PC2, PC3, PC4, and PC5 were the datasets. First, an in-depth examination of the datasets was performed to identify missing, inconsistent, and categorical data. It became apparent that there were no missing data in the datasets. Nevertheless, a few datasets contained categorical data. The data were categorized prior to the generation of the numbers. Again, we noticed that the datasets comprised of continuous data. The datasets were altered using the min–max normalization method⁴⁹ with the goal of overcoming this problem. The original feature value, which originally ranged from zero to one, was transformed using this technique. Subsequently, an 80:20 split between the training and testing datasets was created for each normalized dataset. Extensive information regarding the datasets enforced in this exploration is shown in Table 2.

Table 2. Specifics of the enforced NASA datasets.

Datasets	No. of instances	No. of features	Non-susceptible classes (SC)	Susceptible classes (SC)	Susceptible (%)
PC1	705	38	644	61	8.7
PC2	745	37	729	16	2.1
PC3	1077	38	943	134	12.4
PC4	1287	38	1110	177	13.8
PC5	1711	39	1240	471	27.5
CM1	327	38	285	42	12.8
JM1	7782	22	6110	1672	21.5
KC1	1183	22	869	314	26.5
KC3	194	40	158	36	18.5
MC1	1988	39	1942	46	2.3
MC2	125	40	81	44	35.2
MW1	253	38	226	27	10.6

The configuration of the computer on which the experiments were administered was as follows: Intel Core i5-6200 CPU with clock rate 2.40 GHz and 8 GB RAM. The aforementioned techniques were employed in a Python 3 environment using the Jupyter notebook. First, the input dataset was uploaded using Pandas. The datasets were altered using the min-max normalization method.⁴⁹ Using train_test_split from sklearn.model_selection, the dataset was partitioned into training and testing datasets at a ratio of 80:20. Populace size and the highest number of iterations were the two primary criteria for developing and validating the model. The model will provide superior outcomes with higher values, but it will also increase computing time. In this investigation, the population size and highest number of iterations were set to 30 and 200, respectively. Four supervised learning classifiers, DT, KNN, NB, and QDA, were used to assess the behaviour of the proposed FSCOA approach. Further, the conduct of the proposed technique has been correlated with the some of the widely used FS techniques namely FSDE, FSPSO, FSGA, FSACO. The fitness error plots for the suggested FSCOA approach and other studied FS strategies utilizing the examined classifiers, DT, KNN, NB, and QDA, were obtained using matplotlib.pyplot. This study used accuracy, a frequently applied performance indicator metric, for assessment purposes. Accuracy can be expressed as a simple proportion of the total instances of instances that were correctly classified. Eq. (18) is used to calculate it from the confusion matrix, as follows:

(18)

Accuracy = \frac{TP + TN}{TP + TN + FP + FN}

Here, $TP$ , $TN$ , $FP$ , and $FN$ represent true positives, true negatives, false positives, and false negatives, respectively.

The performance of the recommended FSCOA algorithm is evaluated against several other FS procedures such as FSDE, FSPSO, FSGA, and FSACO in terms of classification accuracy and the count of selected attributes on 12 datasets studied in this research work. Because of the stochastic character of the previously mentioned techniques, we carried out ten runs of the trials to ensure that the performance of each procedure remained persistent, with an initial random population. The median accuracy of the proposed FSCOA, along with the other studied FS approaches, is listed in Table 3.⁵⁹

Table 3. Accuracy percentage and number of features selected by four classifiers for twelve datasets.

Sl. No.	Datasets	FS Algorithms/Classifiers	KNN	DT	NB	QDA	Attributes selected
1	KC1	Without FS	69.62	72.15	74.26	74.26	22
		FSDE	76.46	77.09	77.22	78.1	8.2
		FSPSO	74.64	73.12	76.12	76.27	8.3
		FSGA	76.47	76.03	77.22	77.93	9.4
		FSACO	77.69	76.85	77.47	77.85	4.5
		FSCOA	77.13	75.32	77.34	78.27	8.2
2	KC3	Without FS	74.36	76.92	66.67	76.92	40
		FSDE	80	90	76.92	86.92	17.7
		FSPSO	76.15	81.03	71.54	79.74	16.9
		FSGA	79.23	87.69	76.15	87.69	18.8
		FSACO	86.92	85.13	79.49	86.41	8.9
		FSCOA	82.05	86.41	80.25	86.41	12.77
3	JM1	Without FS	73.35	69.94	78.99	75.85	22
		FSDE	77.18	78.73	79.85	79.72	4.9
		FSPSO	75.07	72.94	79.16	79.05	8.9
		FSGA	76.36	73.29	79.62	79.83	10.3
		FSACO	79.26	79.49	79.89	79.79	3
		FSCOA	79.13	79.66	79.91	79.82	3.75
4	CM1	Without FS	75.76	80.3	77.27	83.33	38
		FSDE	86.82	89.7	83.48	88.48	18.2
		FSPSO	83.33	83.18	81.97	85	14.5
		FSGA	85.3	88.94	83.18	88.48	17.8
		FSACO	87.42	87.27	84.55	88.18	12.1
		FSCOA	88.33	83.64	84.39	88.79	15.4
5	MC1	Without FS	96.48	97.74	95.73	97.49	39
		FSDE	97.74	98.57	97.71	97.74	19.2
		FSPSO	97.56	98.49	96.31	97.49	12.4
		FSGA	97.59	98.67	97.71	97.74	19.2
		FSACO	98.02	98.34	97.74	97.76	13.4
		FSCOA	97.94	98.59	97.71	97.96	15.7
6	MC2	Without FS	76	68	92	84	40
		FSDE	87.6	90.8	95.6	96	18.4
		FSPSO	80	75.2	92.8	88.4	17.2
		FSGA	85.2	89.6	93.2	95.2	18.4
		FSACO	89.2	86	96	95.2	7.2
		FSCOA	94.4	82.4	96	96	12.9
7	PC1	Without FS	89.36	88.65	87.23	86.52	38
		FSDE	93.76	95.04	91.21	93.26	19.1
		FSPSO	90.85	92.06	89.65	89.72	16.2
		FSGA	93.48	95.04	90.64	93.48	18.9
		FSACO	93.97	93.97	92.63	92.91	10.8
		FSCOA	94.4	92.77	92.77	93.83	17.9
8	PC2	Without FS	96.64	95.3	93.96	97.32	37
		FSDE	97.79	98.66	96.98	97.58	14.7
		FSPSO	97.45	96.51	95.84	97.38	15.2
		FSGA	97.65	98.32	96.31	97.48	17.1
		FSACO	97.89	97.25	97.22	98.12	10.7
		FSCOA	97.99	97.85	97.32	98.19	12.52
9	PC3	Without FS	82.41	78.7	68.98	62.03	38
		FSDE	86.34	86.39	86.85	86.44	16.6
		FSPSO	84.77	82.92	80.83	83.38	13.5
		FSGA	85.93	86.57	86.81	86.76	17.2
		FSACO	86.71	84.44	87.08	86.82	11.6
		FSCOA	87.04	85.83	87.18	86.9	13.8
10	MW1	Without FS	78.43	74.51	76.47	80.39	38
		FSDE	87.25	87.84	83.53	88.63	13.5
		FSPSO	84.71	82.75	78.82	84.31	12.9
		FSGA	85.69	87.06	82.16	86.67	17.2
		FSACO	86.67	85.29	87.25	90.39	8.7
		FSCOA	87.45	85.68	89.02	89.8	10.7
11	PC4	Without FS	84.49	91.09	86.82	47.67	38
		FSDE	90.11	93.45	91.59	92.71	17.6
		FSPSO	86.63	92.64	89.11	86.98	14.4
		FSGA	87.6	93.53	91.74	92.49	18.6
		FSACO	91.16	92.4	91.4	91.82	14.2
		FSCOA	91.74	92.95	92.33	92.75	15.5
12	PC5	Without FS	67.06	72.59	70.55	69.39	39
		FSDE	76.33	77.73	71.57	72.57	18.4
		FSPSO	71.98	73.53	70.82	70.59	15.7
		FSGA	75.63	77.81	71.46	72.92	19.3
		FSACO	76.85	75.16	72.19	71.11	14.8
		FSCOA	78.63	77.23	72.45	72.711	16.4

The median accuracy of several classifiers forced on diverse datasets, both with and without feature selection, is shown in Table 3. The table also displays the average tally of the attributes selected by the respective FS approach. The classifiers were evaluated using a range of datasets and previously discussed FS techniques. The experimental findings showed that the suggested FSCOA technique exceeded the other studied FS procedures in the majority of instances. The baseline methods studied in this work, such as FSPSO, FSDE, FSGA, and FSACO, suffer from several drawbacks, such as local optima trap, slow convergence rate, low accuracy, and parameter tuning. The proposed FSCOA technique addresses many of these limitations as it has ability to discover and search any domain of search space with good efficiency and speed and quickly escape from local minima. For most of the datasets, with the exception of KC1, KC3, JM1, and MC1, the suggested FSCOA performed best when combined with KNN. With the exception of KC1, MC1, and CM1, the bulk of the datasets showed that the recommended FSCOA worked best when paired with NB. The majority of the datasets demonstrated that the suggested FSCOA performed best when combined with QDA, with the exception of KC3, MW1, and PC5. With the exception of JM1, the majority of the datasets showed that the previously researched FS approaches outperformed the suggested FSCOA strategy when used in conjunction with DT. Similarly, applying the proposed FSCOA technique to the JM1 dataset with all the analyzed classifiers, except KNN, yielded the highest accuracy. Furthermore, the bulk of the datasets yielded the best accuracy for all examined classifiers, with the exception of DT. It is crucial to remember that the suggested FSCOA technique could only provide the best prediction using QDA and NB classifiers for the KC1 and KC3 datasets, respectively.

Figures 3 through 6⁵⁸ display, respectively, the fitness error plots for the suggested FSCOA approach and other FS strategies utilizing the examined classifiers DT, KNN, NB, and QDA. Error plots of all 12 datasets are included in each graph. The error plots show that in most cases, the error plot of the suggested FSCOA is smaller than those of the other FS approaches employed in this investigation. Furthermore, the error plot of the suggested FSCOA methodology matches that of the various existing FS methods. However, the error plot of the suggested FSCOA approach exceeds that of the other FS techniques that have been evaluated under certain circumstances.

Figure 3. DT fitness error plot.

Figure 3⁵⁸ shows that in most datasets, the fitness error plot of the proposed FSCOA approach using the DT classifier is smaller than that of the other FS models, with the exception of CM1, MC2, and KC3. However, for the KC3 dataset, the error plot overlapped with that of the FSDE after 190 iterations. The error plot for the CM1 dataset is located above the FSDE and FSACO. Furthermore, the plot for the MC2 dataset is above the FSDE.

The fitness error plot of the proposed FSCOA approach with the KNN classifier is lower than that of the previous FS models for most datasets, as shown in Figure 4,⁵⁸ with the exception of KC1, KC3, MC1, and PC2. The error plot after 115 iterations corresponds to FSDE and FSACO for the PC2 dataset, but it is above the FSACO model for the KC1, KC3, and MC1 datasets.

Figure 4. KNN fitness error plot.

As shown in Figure 5,⁵⁸ the fitness error plot of the suggested FSCOA technique with the NB classifier was lower in the majority of datasets than that of the prior FS models, with the exception of CM1, KC1, MC1, PC2, MC2, and PC3. After 75 iterations, the error plot for the MC2 dataset matches that of FSACO. For the PC3 dataset, the error plot of the suggested FSCOA method matches that of FSACO after 175 iterations. However, for datasets CM1, KC1, and MC1, the error plot was above that of the FSACO model. Moreover, for PC2 and KC1, the plot was above the FSDE.

Figure 5. NB fitness error plot.

Figure 6⁵⁸ illustrates that for most datasets (except CM1, JM1, KC3, MW1, PC2, and PC3), the fitness error plot of the proposed FSCOA approach with the NB classifier is smaller than that of the previous FS models. Following 180 iterations, the CM1 dataset’s error plot aligns with that of the FSACO dataset. The error plot of the proposed FSCOA algorithm matches that of FSGA after 175 iterations for the JM1 dataset. The error plot is above that of the FSACO model for datasets MW1, PC2, and PC3. The graphic is also above FSGA and FSDE for the KC3 dataset. The error plot of the proposed FSCOA approach is above that of FSGA for the PC3 dataset.

Figure 6. QDA fitness error plot.

The FS algorithms employed in this study use several hyperparameters. For 200 iterations, an examination was performed with a population size of 30. The crossover rate $(CR)$ in FSGA and FSDE was maintained at 0.8 and 0.9, respectively. For FSGA, the mutation rate $(MR)$ is 0.01. In the FSDE, the scaling factor $(SF)$ was set to 0.8. In FSPSO, the maximum inertia weight $(IWmax)$ and the minimum inertia weight $(IWmin)$ have been fixed as 0.9 and 0.4 accordingly. Two were chosen as the acceleration factors. The fixed values of alpha $(α)$ , rho $(ρ)$ , and beta $(β)$ in FSACO were 1, 0.2, and 0.1, respectively. The governing criterion speeds for alpha $({Speed}_{α})$ , beta $({Speed}_{β})$ , and gamma $({Speed}_{γ})$ were adjusted appropriately for FSCOA using the Rand function. In addition, the radiation propagation radius $(rad)$ was similarly fixed between 0 and 1 using the rand function.

Statistical analysis

This section provides extensive statistical scrutiny of the empirical findings of this work. Statistical analysis⁵⁰ is a popular method for quantifying, examining, evaluating, and drawing conclusions from data. Tests classified as parametric or non-parametric were the two breeds used for statistical analysis. A type of statistical analysis, known as parametric statistical testing, assumes that the data under study conform to a specific probability distribution, most frequently a normal distribution. Several assumptions, such as the independence of observations, homogeneity of variance, and normality, must hold true to employ parametric tests. Ensuring that the assumptions are met is essential when conducting a parametric test, because failing to do so may result in erroneous results and invalid conclusions. Therefore, it is crucial to confirm these hypotheses in advance and, if necessary, to use non-parametric validation. A type of statistical scrutiny known as non-parametric statistical testing does not depend on a specific probability distribution hypothesis for the data under study. Non-parametric validations, on the other hand, are more broadly applicable and resilient to assumption violations than parametric tests because they depend on the hierarchy or placement of the data. However, when the presumptions of the parametric validations are satisfied, they might be less effective than the latter. It is essential to choose a statistical validation suitable for an exploration topic and the properties of the data being examined. In this study, the Friedman Test,⁵¹ a non-parametric rank-based test, has been carefully considered. Based on the effectiveness of the classification, each model associated with the trial was ranked according to the Friedman test. The lowest count correlates with the greatest slot and the largest count correlates with the smallest slot.

To begin with, Eq. (19) was employed to determine the average rank ( $AverageRankModels$ ) of all graded models (FSDE, FSPSO, FSGA, FSACO, FSCOA, and Without FS), in addition to a number of classification models (KNN, DT, NB, and QDA). Table 4⁵⁹ presents an illustration these findings.

(19)

AverageRankModels = \frac{\sum RankModels}{Total number of Models (A)}

Table 4. For twelve NASA datasets, the average rank of all FS algorithms (Friedman Rank).

Sl. No.	Datasets	FS Algorithms/Classifiers	KNN	DT	NB	QDA	AverageRankModels
1	KC1	Without FS	69.62 (6)	72.15 (6)	74.26 (5)	74.26 (6)	5.75
		FSDE	76.46 (4)	77.09 (1)	77.22 (3)	78.10 (2)	2.5
		FSPSO	74.64 (5)	73.12 (5)	76.12 (4)	76.27 (5)	4.75
		FSGA	76.47 (3)	76.03 (3)	77.22 (3)	77.93 (3)	3
		FSACO	77.69 (1)	76.85 (2)	77.47 (1)	77.85 (4)	2
		FSCOA	77.13 (2)	75.32 (4)	77.34 (2)	78.27 (1)	2.25
2	KC3	Without FS	74.36 (6)	76.92 (6)	66.67 (6)	76.92 (5)	5.75
		FSDE	80 (3)	90 (1)	76.92 (3)	86.92 (2)	2.25
		FSPSO	76.15 (5)	81.03 (5)	71.54 (5)	79.74 (4)	4.75
		FSGA	79.23 (4)	87.69 (2)	76.15 (4)	87.69 (1)	2.75
		FSACO	86.92 (1)	85.13 (4)	79.49 (2)	86.41 (3)	2.5
		FSCOA	82.05 (2)	86.41 (3)	80.25 (1)	86.41 (3)	2.25
3	JM1	Without FS	73.35 (6)	69.94 (6)	78.99 (6)	75.85 (6)	6
		FSDE	77.18 (3)	78.73 (3)	79.85 (3)	79.72 (4)	3.25
		FSPSO	75.07 (5)	72.94 (5)	79.16 (5)	79.05 (5)	5
		FSGA	76.36 (4)	73.29 (4)	79.62 (4)	79.83 (1)	3.25
		FSACO	79.26 (1)	79.49 (2)	79.89 (2)	79.79 (3)	2
		FSCOA	79.13 (2)	79.66 (1)	79.91 (1)	79.82 (2)	1.5
4	CM1	Without FS	75.76 (6)	80.30 (6)	77.27 (6)	83.33 (5)	5.75
		FSDE	86.82 (3)	89.70 (1)	83.48 (3)	88.48 (2)	2.25
		FSPSO	83.33 (5)	83.18 (5)	81.97 (5)	85 (4)	4.75
		FSGA	85.30 (4)	88.94 (2)	83.18 (4)	88.48 (2)	3
		FSACO	87.42 (2)	87.27 (3)	84.55 (1)	88.18 (3)	2.25
		FSCOA	88.33 (1)	83.64 (4)	84.39 (2)	88.79 (1)	2
5	MC1	Without FS	96.48 (6)	97.74 (6)	95.73 (4)	97.49 (5)	5.25
		FSDE	97.74 (3)	98.57 (3)	97.71 (2)	97.74 (3)	2.75
		FSPSO	97.56 (5)	98.49 (4)	96.31 (3)	97.49 (4)	4
		FSGA	97.59 (4)	98.67 (1)	97.71 (2)	97.74 (3)	2.5
		FSACO	98.02 (1)	98.34 (5)	97.74 (1)	97.76 (2)	2.25
		FSCOA	97.94 (2)	98.59 (2)	97.71 (2)	97.96 (1)	1.75
6	MC2	Without FS	76 (6)	68 (6)	92 (5)	84 (4)	5.25
		FSDE	87.6 (3)	90.8 (1)	95.6 (2)	96 (1)	1.75
		FSPSO	80 (5)	75.2 (5)	92.8 (4)	88.4 (3)	4.25
		FSGA	85.2 (4)	89.6 (2)	93.2 (3)	95.2 (2)	2.75
		FSACO	89.2 (2)	86 (3)	96 (1)	95.2 (2)	2
		FSCOA	94.4 (1)	82.4 (4)	96 (1)	96 (1)	1.75
7	PC1	Without FS	89.36 (6)	88.65 (5)	87.23 (6)	86.52 (6)	5.75
		FSDE	93.76 (3)	95.04 (1)	91.21 (3)	93.26 (3)	2.5
		FSPSO	90.85 (5)	92.06 (4)	89.65 (5)	89.72 (5)	4.75
		FSGA	93.48 (4)	95.04 (1)	90.64 (4)	93.48 (2)	2.75
		FSACO	93.97 (2)	93.97 (2)	92.63 (2)	92.91 (4)	2.5
		FSCOA	94.40 (1)	92.77 (3)	92.77 (1)	93.83 (1)	1.5
8	PC2	Without FS	96.64 (6)	95.30 (6)	93.96 (6)	97.32 (6)	6
		FSDE	97.79 (3)	98.66 (1)	96.98 (3)	97.58 (3)	2.5
		FSPSO	97.45 (5)	96.51 (5)	95.84 (5)	97.38 (5)	5
		FSGA	97.65 (4)	98.32 (2)	96.31 (4)	97.48 (4)	3.5
		FSACO	97.89 (2)	97.25 (4)	97.22 (2)	98.12 (2)	2.5
		FSCOA	97.99 (1)	97.85 (3)	97.32 (1)	98.19 (1)	1.5
9	PC3	Without FS	82.41 (6)	78.70 (6)	68.98 (6)	62.03 (6)	6
		FSDE	86.34 (3)	86.39 (2)	86.85 (3)	86.44 (4)	3
		FSPSO	84.77 (5)	82.92 (5)	80.83 (5)	83.38 (5)	5
		FSGA	85.93 (4)	86.57 (1)	86.81 (4)	86.76 (3)	3
		FSACO	86.71 (2)	84.44 (4)	87.08 (2)	86.82 (2)	2.5
		FSCOA	87.04 (1)	85.83 (3)	87.18 (1)	86.90 (1)	1.5
10	MW1	Without FS	78.43 (6)	74.51 (6)	76.47 (6)	80.39 (6)	6
		FSDE	87.25 (2)	87.84 (1)	83.53 (3)	88.63 (3)	2.25
		FSPSO	84.71 (5)	82.75 (5)	78.82 (5)	84.31 (5)	5
		FSGA	85.69 (4)	87.06 (2)	82.16 (4)	86.67 (4)	3.5
		FSACO	86.67 (3)	85.29 (4)	87.25 (2)	90.39 (1)	2.5
		FSCOA	87.45 (1)	85.68 (3)	89.02 (1)	89.80 (2)	1.75
11	PC4	Without FS	84.49 (6)	91.09 (6)	86.82 (6)	47.67 (6)	6
		FSDE	90.11 (3)	93.45 (2)	91.59 (3)	92.71 (2)	2.5
		FSPSO	86.63 (5)	92.64 (4)	89.11 (5)	86.98 (5)	4.75
		FSGA	87.60 (4)	93.53 (1)	91.74 (2)	92.49 (3)	2.5
		FSACO	91.16 (2)	92.40 (5)	91.40 (4)	91.82 (4)	3.75
		FSCOA	91.74 (1)	92.95 (3)	92.33 (1)	92.75 (1)	1.5
12	PC5	Without FS	67.06 (6)	72.59 (6)	70.55 (6)	69.39 (6)	6
		FSDE	76.33 (3)	77.73 (2)	71.57 (3)	72.57 (3)	2.75
		FSPSO	71.98 (5)	73.53 (5)	70.82 (5)	70.59 (5)	5
		FSGA	75.63 (4)	77.81 (1)	71.46 (4)	72.92 (1)	2.5
		FSACO	76.85 (2)	75.16 (4)	72.19 (2)	71.11 (4)	3
		FSCOA	78.63 (1)	77.23 (3)	72.45 (1)	72.71 (2)	1.75

Table 5⁵⁹ summarizes the findings of grading the median of all ranks of diversified amalgamation setups (FSDE, FSPSO, FSGA, FSACO, FSCOA, and Without FS) for all the datasets used in Eq. (20).

(20)

AverageRankDatasets = \frac{\sum AverageRankModels}{Total number of Datasets (B)}

Table 5. AvgRank of all FS configurations.

Sl. No.	Datasets	Without FS	FSDE	FSPSO	FSGA	FSACO	FSCOA
1	KC1	5.75	2.5	4.75	3	2	2.25
2	KC3	5.75	2.25	4.75	2.75	2.5	2.25
3	JM1	6	3.25	5	3.25	2	1.5
4	CM1	5.75	2.25	4.75	3	2.25	2
5	MC1	5.25	2.75	4	2.5	2.25	1.75
6	MC2	5.25	1.75	4.25	2.75	2	1.75
7	PC1	5.75	2.5	4.75	2.75	2.5	1.5
8	PC2	6	2.5	5	3.5	2.5	1.5
9	PC3	6	3	5	3	2.5	1.5
10	MW1	6	2.25	5	3.5	2.5	1.75
11	PC4	6	2.5	4.75	2.5	3.75	1.5
12	PC5	6	2.75	5	2.5	3	1.75
	Average	5.8	2.52	4.75	2.92	2.48	1.75
	RankDatasets	AvgRank6	AvgRank3	AvgRank5	AvgRank4	AvgRank2	AvgRank1

The following are the average ranks for all the correlated configurations included in this observation. ${AvgRank 1 = 1.75, AvgRank 2 = 2.48, AvgRank 3 = 2.52, AvgRank 4 = 2.92, AvgRank 5 = 4.75, AvgRank 6 = 5.8}$ . The median ranks of the models were employed to gauge the $X_{F}$ statistics, referred to as $X_{F}^{2}$ using Eq. (21) and a presented value of 23.29.

(21)

X_{F}^{2} = \frac{12 \times B}{A \times (A + 1)} \times [\sum_{i = 1}^{6} {(AvgRank (i))}^{2} - \frac{A \times {(A + 1)}^{2}}{4}]

Twelve datasets (B=12) and six models (A=6) were considered in this experiment. The Friedman statistic ( $F_{F}$ ) value was computed using Eq. (22) using (B − 1) and $X_{F}^{2}$ .

(22)

F_{F} = \frac{(B - 1) \times X_{F}^{2}}{B \times (A - 1) - X_{F}^{2}}

The value of ( $F_{F}$ ) estimated to be 6.978. The critical value was determined as 2.383 by employing the degrees of freedom as (6 − 1 = 5) × (12 − 1 = 11) and (6 − 1 = 5), with α = 0.05, as the significance level. Given that the critical value of 2.383 is smaller than that of the Friedman statistic ( $F_{F}$ = 6.978), the null hypothesis is rejected. It also determines whether to adopt an alternate theory. This implies that two or more configurations are distinct from one another. The Holm method is usually employed to investigate the Post Hoc test after the null hypothesis is jilted and the substitute hypothesis is endorsed. By employing the Holm technique, the $p value$ and $z - value$ were applied to assess how well each distinct model performed relative to the other models.⁵² Eq. (23) was used to obtain the value of z. The $z - value$ and normal distribution table were used to calculate the value of $p$ .

(23)

z = \frac{AvgRank (i) - AvgRank (j)}{\sqrt{\frac{A \times (A + 1)}{6 \times B}}}

In this experiment, the terminologies B, A, and $z$ represent the number of datasets, number of configurations employed in this investigation, and value of z, respectively. The terms $AvgRank (i)$ and $AvgRank (j)$ represents the average rank of $i^{th}$ and $j^{th}$ model, respectively. The $p value$ , $z - valu$ e, and $α / (A - i)$ of the recommended configurations were compared, and Table 6 summarizes the findings. For this particular instance, we set the significance level, α, at 0.05.

Table 6⁵⁹ illustrates that in most cases, the p-value is lower than or equivalent to the value of $α / (A - i)$ with the exception of the FSCOA and FSGA models and FSCOA and FSACO models. It resolves that the FSCOA model is statistically noteworthy and attains a superior dossier when matched to other configurations, excluding the FSGA and FSACO models. However, there was no statistically significant variation in the performances of these models.

Table 6. Holm procedure.

Sl. No.	Model used in FS	z-value	p-value	$α / (A - i)$
1	FSCOA: without FS	5.307	0.00001	0.01
2	FSCOA: FSGA	1.533	0.06	0.0125
3	FSCOA: FSDE	1.009	0.156	0.0166
4	FSCOA: FSPSO	3.931	0.000042	0.025
5	FSCOA: FSACO	0.956	0.16	0.05

Threats to validity

Any factual study must analyze the threats to the reliability of its investigatory observations and address them appropriately. This section reports the menace to the validity of the recommended procedure specified in this experiment.

• This research has utilized twelve standard public NASA datasets extracted from the PROMISE archive. However, the implication of the proposed FSCOA approach’s behaviour on the real-world project datasets largely remains unclear.
• The behaviour of the fault forecasting model largely depends on the selected applications, applied classification methods, and the quality aspects of the datasets.⁵³ This experiment employed four supervised learning classifiers, DT, KNN, NB, and QDA, to choose the finest traits from a software defect dataset using popular optimization algorithms such as DE, PSO, ACO, and GA, along with the suggested FSCOA approach. However, the behaviour of the proposed approach may have a varied impact on its performance when combined with other meta-heuristic approaches and classifiers.
• To gauge the efficacy of the proposed FSCOA approach, this study utilized a well-known evaluation criterion known as accuracy besides fitness error. Several other performance measure metrics can be applicable to precisely examine the impact of the proposed approach on the suggested defect prediction model. Again, the study, in its current form, used two widely used statistical tests to establish the model’s validity. This may restrict the statistical findings of the suggested defect prediction model.

Conclusion

This study proposed a novel FS approach, based on a meta-heuristic approach called Chernobyl Disaster Optimizer (CDO), referred to as FSCOA to select the finest traits from a software defect dataset that can significantly enhance the predictive accuracy of the defect forecasting model. The proposed FSCOA technique exhibits nuclear core reactor disruption to determine the best attributes by carefully discarding irrelevant or insignificant ones. This study investigates the impact of the proposed FSCOA approach on twelve publicly available NASA datasets, taken from PROMISE archieve, by employing four widely used classifiers (DT, KNN, NB, and QDA). The proposed work was intended to enhance the classification of the defect prediction model using the optimal features. Besides, another purpose was to correlate the predictive behaviour of the proposed FSCOA approach with other existing FS techniques, namely, FSDE, FSPSO, FSACO, and FSGA. The experimental data suggested that the proposed FSCOA technique bettered the predictive performance of the defect forecasting model. Further, the statistical validity of the proposed FSCOA-based forecasting model was investigated by applying the Friedman test. The test outcome showed that at least two models were significantly different, leading to the repudiation of the null hypothesis. This necessitates the use of the Holm test. In this regard, the experimental findings suggested that the proposed FSCOA approach demonstrated higher performance when selecting the optimal set of features correlated to the studied FS procedures. However, the behaviour of the proposed FSCOA approach may vary across different datasets and classifiers. In the future, we aim to expand the scope of this research by employing real-world project datasets. We also look forward to investigate the efficiency of the suggested FSCOA approach by increasing the count and variety of classifiers, especially ensemble classifiers, and employing more optimization algorithms for feature selection along with exploring other widely used performance measures.

Data availability

Underlying data

Anand, Kunal (2024). Dataset 1: Zip file containing the underlying data of the presented methods and results in png files. figshare. Figure. https://doi.org/10.6084/m9.figshare.25681782.v2.⁵⁸

This dataset contains the following underlying data:

• Figure 1. Blueprint of the suggested FSCOA methodology.jpg
• Figure 2. Flow-diagram of recommended FSCOA approach.jpg
• Figure 3. DT fitness error plot.jpg
• Figure 4. KNN fitness error plot.jpg
• Figure 5. NB fitness error plot.jpg
• Figure 6. QDA fitness error plot.jpg

Anand, Kunal (2024). Dataset 2: Underlying data of the presented results in csv files. figshare. Dataset. https://doi.org/10.6084/m9.figshare.25683600.v2⁵⁹

This dataset contains the following underlying data:

• Table 1 Summary of referred literature in software defect prediction
• Table 2 Specifics of the enforced NASA datasets
• Table 3 Accuracy percentage and number of features selected
• Table 4 Average rank of all FS algorithms for twelve NASA datasets.csv
• Table 5. AvgRank of all FS configurations.csv
• Table 6. Holm procedure.csv

The data are available under the terms of the Creative Commons Attribution 4.0 International license (CC-BY 4.0).

Acknowledgments

The authors would like to express their gratitude to the Kalinga Institute of Industrial Technology, Bhubaneswar, for funding (KIIT-DU/309/24) for publication of this article.

References

1. Mall R: Fundamentals of software engineering. PHI Learning Pvt. Ltd; 2018.
2. Prasad D: Automated software testing: foundations, applications and challenges. Jena AK, Das H, Mohapatra DP, editors. 2020; pp. 1–165. Springer;
3. Anand K, Jena AK: Software Defect Prediction: An ML Approach-Based Comprehensive Study. Communication, Software and Networks: Proceedings of INDIA 2022. Singapore: Springer Nature Singapore; 2022; pp. 497–512.
4. Saifan AA, Abu-wardih L: Software defect prediction based on feature subset selection and ensemble classification. ECTI Trans. Comput. Inf. Technol. 2020; 14(2): 213–228. Publisher Full Text
5. Malhotra R: Comparative analysis of statistical and machine learning methods for predicting faulty modules. Appl. Soft Comput. 2014; 21: 286–297. Publisher Full Text
6. Harshvardhan GM, Gourisaria MK, Pandey M, et al.: A comprehensive survey and analysis of generative models in machine learning. Comput Sci Rev. 2020; 38: 100285.
7. Hammouri A, Hammad M, Alnabhan M, et al.: Software bug prediction using machine learning approach. Int. J. Adv. Comput. Sci. Appl. 2018; 9(2). Publisher Full Text
8. Gökçeoğlu M, Sözer H: Automated defect prioritization based on defects resolved at various project periods. J. Syst. Softw. 2021; 179: 110993. Publisher Full Text
9. Rathore SS, Kumar S: A decision tree logic based recommendation system to select software fault prediction techniques. Computing. 2017; 99: 255–285. Publisher Full Text
10. Singh Y, Kaur A, Malhotra R: Software fault proneness prediction using support vector machines. Proceedings of the World Congress on Engineering. 2009, July; Vol. 1: pp. 1–3.
11. Li J, He P, Zhu J, et al.: Software defect prediction via convolutional neural network. 2017 IEEE International Conference on Software Quality, Reliability and Security (QRS). IEEE; 2017, July; pp. 318–328.
12. Goyal J, Ranjan Sinha R: Software defect-based prediction using logistic regression: Review and challenges. Second International Conference on Sustainable Technologies for Computational Intelligence: Proceedings of ICTSCI 2021. Singapore: Springer; 2022; pp. 233–248.
13. Wang T, Li WH: Naive bayes software defect prediction model. 2010 International Conference on Computational Intelligence and Software Engineering. IEEE; 2010, December; pp. 1–4.
14. Chandrashekar G, Sahin F: A survey on feature selection methods. Comput. Electr. Eng. 2014; 40(1): 16–28. Publisher Full Text
15. Cherrington M, Thabtah F, Lu J, et al.: Feature selection: filter methods performance challenges. 2019 International Conference on Computer and Information Sciences (ICCIS). IEEE; 2019, April; pp. 1–4.
16. Chen G, Chen J: A novel wrapper method for feature selection and its applications. Neurocomputing. 2015; 159: 219–226. Publisher Full Text
17. Lal TN, Chapelle O, Weston J, et al.: Embedded methods. Feature extraction: Foundations and applications. Berlin, Heidelberg: Springer Berlin Heidelberg; 2006; pp. 137–165.
18. Chen CW, Tsai YH, Chang FR, et al.: Ensemble feature selection in medical datasets: Combining filter, wrapper, and embedded feature selection results. Expert. Syst. 2020; 37(5): e12553. Publisher Full Text
19. Malhotra R, Pritam N, Singh Y: On the applicability of evolutionary computation for software defect prediction. 2014 International Conference on Advances in Computing, Communications and Informatics (ICACCI). IEEE; 2014, September; pp. 2249–2257.
20. Ab Wahab MN, Nefti-Meziani S, Atyabi A: A comprehensive review of swarm optimization algorithms. PLoS One. 2015; 10(5): e0122827. PubMed Abstract | Publisher Full Text | Free Full Text
21. Wahono RS, Herman NS: Genetic feature selection for software defect prediction. Adv. Sci. Lett. 2014; 20(1): 239–244. Publisher Full Text
22. Du KL, Swamy MNS, Du KL, et al.: fragment swarm optimization. Search and Optimization by Metaheuristics: Techniques and Algorithms Inspired by Nature.2016; 153–173.
23. Brezočnik L, Podgorelec V: Applying weighted fragment swarm optimization to imbalanced data in software defect prediction. New Technologies, Development and Application. Vol. 4. . Springer International Publishing; 2019; pp. 289–296.
24. Das S, Suganthan PN: Differential evolution: A survey of the state-of-the-art. IEEE Trans. Evol. Comput. 2010; 15(1): 4–31. Publisher Full Text
25. Dorigo M, Birattari M, Stutzle T: Ant colony optimization. IEEE Comput. Intell. Mag. 2006; 1(4): 28–39. Publisher Full Text
26. Tan F, Fu X, Zhang Y, et al.: A genetic algorithm-based method for feature subset selection. Soft. Comput. 2008; 12: 111–120. Publisher Full Text
27. Sakri SB, Rashid NBA, Zain ZM: fragment swarm optimization feature selection for breast cancer recurrence prediction. IEEE Access. 2018; 6: 29637–29647. Publisher Full Text
28. Ghosh A, Datta A, Ghosh S: Self-adaptive differential evolution for feature selection in hyperspectral image data. Appl. Soft Comput. 2013; 13(4): 1969–1977. Publisher Full Text
29. Aghdam MH, Ghasem-Aghaee N, Basiri ME: Text feature selection using ant colony optimization. Expert Syst. Appl. 2009; 36(3): 6843–6853. Publisher Full Text
30. Shehadeh HA: Chernobyl disaster optimizer (CDO): a novel meta-heuristic method for global optimization. Neural Comput. Applic. 2023; 35(15): 10733–10749. Publisher Full Text
31. Nakariyakul S: A comparative study of suboptimal branch and bound algorithms. Inf. Sci. 2014; 278: 545–554. Publisher Full Text
32. Das H, Prajapati S, Gourisaria MK, et al.: Feature Selection Using Golden Jackal Optimization for Software Fault Prediction. Mathematics. 2023; 11(11): 2438. Publisher Full Text
33. Khalid A, Badshah G, Ayub N, et al.: Software Defect Prediction Analysis Using Machine Learning Techniques. Sustainability. 2023; 15(6): 5517. Publisher Full Text
34. Kumar H, Das H: Software Fault Prediction using Wrapper based Feature Selection Approach employing Genetic Algorithm. 2022 OPJU International Technology Conference on Emerging Technologies for Sustainable Development (OTCON). IEEE; 2023, February; pp. 1–7.
35. Thirumoorthy K: A feature selection model for software defect prediction using binary Rao optimization algorithm. Appl. Soft. Comput. 2022; 131: 109737. Publisher Full Text
36. Batool I, Khan TA: Software fault prediction using data mining, machine learning and deep learning techniques: A systematic literature review. Comput. Electr. Eng. 2022; 100: 107886. Publisher Full Text
37. Chen LQ, Wang C, Song SL: Software defect prediction based on nested-stacking and heterogeneous feature selection. Complex Intell. Syst. 2022; 8(4): 3333–3348. Publisher Full Text
38. Arora R, Kaur A: Heterogeneous Fault Prediction Using Feature Selection and Supervised Learning Algorithms. Vietnam J. Comput. Sci. 2022; 09(03): 261–284. Publisher Full Text
39. Gao K, Khoshgoftaar TM, Wang H, et al.: Choosing software metrics for defect prediction: an investigation on feature selection techniques. Softw. Pract. Experience. 2011; 41(5): 579–606. Publisher Full Text
40. Anand K, Jena AK, Choudhary T: Performance Analysis of Feature Selection Techniques in Software Defect Prediction using Machine Learning. 2022 International Conference on Advancements in Smart, Secure and Intelligent Computing (ASSIC). IEEE; 2022, November; pp. 1–7.
41. Balogun AO, Basri S, Capretz LF, et al.: Software Defect Prediction Using Wrapper Feature Selection Based on Dynamic Re-Reranking Strategy. Symmetry. 2021; 13(11): 2166. Publisher Full Text
42. Balogun AO, Basri S, Mahamad S, et al.: A novel rank aggregation-based hybrid multifilter wrapper feature selection method in software defect prediction. Comput. Intell. Neurosci. 2021; 2021: 1–19. PubMed Abstract | Publisher Full Text | Free Full Text
43. Alsghaier H, Akour M: Software fault prediction using fragment swarm algorithm with genetic algorithm and support vector machine classifier. Softw. Pract. Experience. 2020; 50(4): 407–427. Publisher Full Text
44. Alsghaier H, Akour M: Software fault prediction using whale algorithm with genetics algorithm. Softw. Pract. Experience. 2021; 51(5): 1121–1146. Publisher Full Text
45. Balogun AO, Basri S, Abdulkadir SJ, et al.: Performance analysis of feature selection methods in software defect prediction: a search method approach. Appl. Sci. 2019; 9(13): 2764. Publisher Full Text
46. Mehic A: The Electoral consequences of nuclear fallout: evidence from chernobyl. Department of Economics, School of Economics and Management, Lund University; 2020. Reference Source
47. Strath SJ, Swartz AM, Parker SJ, et al.: A pilot randomized controlled trial evaluating motivationally matched pedometer feedback to increase physical activity behavior in older adults. J. Phys. Act. Health. 2011; 8(s2): S267–S274. PubMed Abstract | Publisher Full Text
48. Shirabad JS, Menzies TJ: The PROMISE repository of software engineering databases. School of Information Technology and Engineering, University of Ottawa, Canada; 2005; 24.
49. Patro SGOPAL, Sahu KK: Normalization: A preprocessing stage. arXiv preprint arXiv:1503.06462. 2015.
50. Demšar J: Statistical comparisons of classifiers over multiple data sets. J. Mach. Learn. Res. 2006; 7: 1–30.
51. Friedman M: A comparison of alternative tests of significance for the problem of m rankings. Ann. Math. Stat. 1940; 11(1): 86–92. Publisher Full Text
52. García S, Fernández A, Luengo J, et al.: Advanced nonparametric tests for multiple comparisons in the design of experiments in computational intelligence and data mining: experimental analysis of power. Inf. Sci. 2010; 180(10): 2044–2064. Publisher Full Text
53. Gao K, Khoshgoftaar TM, Seliya N: Predicting high-risk program modules by selecting the right software measurements. Softw. Qual. J. 2012; 20: 3–42. Publisher Full Text
54. Abdu A, Zhai Z, Abdo HA, et al.: Software Defect Prediction Based on Deep Representation Learning of Source Code From Contextual Syntax and Semantic Graph. IEEE Trans. Reliab. 2024; 73: 820–834. Publisher Full Text
55. Abdu A, Zhai Z, Abdo HA, et al.: Semantic and traditional feature fusion for software defect prediction using hybrid deep learning model. Sci. Rep. 2024; 14(1): 14771. PubMed Abstract | Publisher Full Text | Free Full Text
56. Abdu A, Zhai Z, Abdo HA, et al.: Graph-Based Feature Learning for Cross-Project Software Defect Prediction. Comput. Mater. Contin. 2023; 77(1): 161–180. Publisher Full Text
57. Abdu A, Zhai Z, Algabri R, et al.: Deep learning-based software defect prediction via semantic key features of source code—systematic survey. Mathematics. 2022; 10(17): 3120. Publisher Full Text
58. Anand K: Dataset 1: Zip file containing the underlying data of the presented methods and results in png files. figshare. Figure. 2024. Publisher Full Text
59. Anand K: Dataset 2: Underlying data of the presented results in csv files. Dataset. figshare. 2024. Publisher Full Text

Comments on this article Comments (0)

Version 2

VERSION 2 PUBLISHED 29 Jul 2024

Author details Author details

¹ School of Computer Engineering, Kalinga Institute of Industrial Technology, Bhubaneswar, Odisha, 751024, India

Kunal Anand
Roles: Conceptualization, Funding Acquisition, Methodology, Validation, Writing – Original Draft Preparation

Ajay Kumar Jena
Roles: Formal Analysis, Investigation, Software, Supervision

Himansu Das
Roles: Methodology, Validation, Writing – Review & Editing

Competing interests

No competing interests were disclosed.

Grant information

This work was funded by the Kalinga Institute of Industrial Technology, Bhubaneswar (KIIT-DU/309/24)
The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.

Article Versions (2)

version 2

Revised

Published: 17 Dec 2024, 13:844

https://doi.org/10.12688/f1000research.150927.2

version 1

Published: 29 Jul 2024, 13:844

https://doi.org/10.12688/f1000research.150927.1

© 2024 Anand K et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Download

Export To

metrics

	Views	Downloads
F1000Research	-	-
PubMed Central Data from PMC are received and updated monthly.	-	-

Citations

SEE MORE DETAILS

CITE

how to cite this article

Anand K, Jena AK and Das H. Implementation of Chernobyl disaster optimizer based feature selection approach to predict software defects [version 2; peer review: 2 approved, 1 not approved]. F1000Research 2024, 13:844 (https://doi.org/10.12688/f1000research.150927.2)

NOTE: If applicable, it is important to ensure the information in square brackets after the title is included in all citations of this article.

track

receive updates on this article

Track an article to receive email alerts on any updates to this article.

Open Peer Review

Current Reviewer Status: ?

Key to Reviewer Statuses VIEW HIDE

ApprovedThe paper is scientifically sound in its current form and only minor, if any, improvements are suggested

Approved with reservations A number of small changes, sometimes more significant revisions are required to address specific details and improve the papers academic merit.

Not approvedFundamental flaws in the paper seriously undermine the findings and conclusions

Version 2

VERSION 2

PUBLISHED 17 Dec 2024

Revised

Views

Reviewer Report 26 Dec 2024

Francis Palma, University of New Brunswick Fredericton, Fredericton, New Brunswick, Canada

Approved

https://doi.org/10.5256/f1000research.175264.r349537

I can propose to index ... Continue reading

CITE

Report a concern

Author Response 08 Jan 2025

Kunal Anand, School of Computer Engineering, Kalinga Institute of Industrial Technology, Bhubaneswar, 751024, India

08 Jan 2025

Author Response

Respected Reviewer,
I want to express my gratitude for looking over my research paper. I sincerely appreciate the time and effort you took to review my work and offer helpful ... Continue reading Respected Reviewer,
I want to express my gratitude for looking over my research paper. I sincerely appreciate the time and effort you took to review my work and offer helpful criticism that improved the final manuscript's quality. I am really encouraged by your acceptance of the work, and your helpful criticism helped me to improve the impact and clarity of my research. Once more, I want to thank you for your invaluable contributions to my work. Regards
Respected Reviewer,
I want to express my gratitude for looking over my research paper. I sincerely appreciate the time and effort you took to review my work and offer helpful criticism that improved the final manuscript's quality. I am really encouraged by your acceptance of the work, and your helpful criticism helped me to improve the impact and clarity of my research. Once more, I want to thank you for your invaluable contributions to my work. Regards
Competing Interests: No competing interests were disclosed. Close
Report a concern
Respond or Comment

COMMENTS ON THIS REPORT

Author Response 08 Jan 2025

Kunal Anand, School of Computer Engineering, Kalinga Institute of Industrial Technology, Bhubaneswar, 751024, India

08 Jan 2025

Author Response

Respected Reviewer,
I want to express my gratitude for looking over my research paper. I sincerely appreciate the time and effort you took to review my work and offer helpful ... Continue reading Respected Reviewer,
I want to express my gratitude for looking over my research paper. I sincerely appreciate the time and effort you took to review my work and offer helpful criticism that improved the final manuscript's quality. I am really encouraged by your acceptance of the work, and your helpful criticism helped me to improve the impact and clarity of my research. Once more, I want to thank you for your invaluable contributions to my work. Regards
Respected Reviewer,
I want to express my gratitude for looking over my research paper. I sincerely appreciate the time and effort you took to review my work and offer helpful criticism that improved the final manuscript's quality. I am really encouraged by your acceptance of the work, and your helpful criticism helped me to improve the impact and clarity of my research. Once more, I want to thank you for your invaluable contributions to my work. Regards
Competing Interests: No competing interests were disclosed. Close
Report a concern

Views

Reviewer Report 18 Dec 2024

Ahmed Abdu, Northwestern Polytechnical University, Xi’an, China

Approved

https://doi.org/10.5256/f1000research.175264.r349536

All the concerns and suggestions I outlined ... Continue reading

CITE

Report a concern

Author Response 06 Jan 2025

Kunal Anand, School of Computer Engineering, Kalinga Institute of Industrial Technology, Bhubaneswar, 751024, India

06 Jan 2025

Author Response

Esteemed Reviewer,
I would like to sincerely thank you for reviewing my research paper. I deeply appreciate the time and effort you invested in evaluating my work and providing insightful comments ... Continue reading Esteemed Reviewer,
I would like to sincerely thank you for reviewing my research paper. I deeply appreciate the time and effort you invested in evaluating my work and providing insightful comments that enhanced the quality of the final manuscript. Your acceptance of the paper is a tremendous encouragement to me, and your constructive feedback was instrumental in refining the clarity and impact of my research. Thank you once again for your valuable contributions to my work. Regards
Esteemed Reviewer,
I would like to sincerely thank you for reviewing my research paper. I deeply appreciate the time and effort you invested in evaluating my work and providing insightful comments that enhanced the quality of the final manuscript. Your acceptance of the paper is a tremendous encouragement to me, and your constructive feedback was instrumental in refining the clarity and impact of my research. Thank you once again for your valuable contributions to my work. Regards
Competing Interests: No competing interests were disclosed. Close
Report a concern
Respond or Comment

COMMENTS ON THIS REPORT

Author Response 06 Jan 2025

Kunal Anand, School of Computer Engineering, Kalinga Institute of Industrial Technology, Bhubaneswar, 751024, India

06 Jan 2025

Author Response

Esteemed Reviewer,
I would like to sincerely thank you for reviewing my research paper. I deeply appreciate the time and effort you invested in evaluating my work and providing insightful comments ... Continue reading Esteemed Reviewer,
I would like to sincerely thank you for reviewing my research paper. I deeply appreciate the time and effort you invested in evaluating my work and providing insightful comments that enhanced the quality of the final manuscript. Your acceptance of the paper is a tremendous encouragement to me, and your constructive feedback was instrumental in refining the clarity and impact of my research. Thank you once again for your valuable contributions to my work. Regards
Esteemed Reviewer,
I would like to sincerely thank you for reviewing my research paper. I deeply appreciate the time and effort you invested in evaluating my work and providing insightful comments that enhanced the quality of the final manuscript. Your acceptance of the paper is a tremendous encouragement to me, and your constructive feedback was instrumental in refining the clarity and impact of my research. Thank you once again for your valuable contributions to my work. Regards
Competing Interests: No competing interests were disclosed. Close
Report a concern

Version 1

VERSION 1

PUBLISHED 29 Jul 2024

Views

Reviewer Report 20 Sep 2024

Francis Palma, University of New Brunswick Fredericton, Fredericton, New Brunswick, Canada

Approved with Reservations

https://doi.org/10.5256/f1000research.165541.r319618

The article is sound and complete regarding technical contributions. However, some clarifications and methodological discussions are missing. Below are my more detailed comments.

1. In the Abstract, the authors said, "To overcome the above shortcomings, this study aims to develop an innovative FS
technique, namely, the Chernobyl Optimization Algorithm (FSCOA), to unwrap the most informative features" -> I assume the authors are not developing COA? Rather applying or using it? This phrase needs to be revised.

2. The first line in the Methods part of the Abstract: "attacking" -> "attaching".

3. The last paragraph of the Introduction provides the outline of the article. However, later, I found no section numbers or identifiers. Shouldn't the sections be numbered?

4. The Related Works section discusses 11 studies. I would expect a table providing a summary and comparison among these studies, including your own study (this one).

5. The overall methodology should be presented as various Steps in a more systematic, organized, and modular manner...

6. The three steps mentioned in Page 6, I suggest you show these steps in the figure using like Step 1, Step 2, etc.

7. In Figure 1, not sure why this is called a FSCOA approach, because feature selection is part of any model building activity, you are just using a new technique for feature selection... right?

8. In Figure 1, the testing data is not fed to the model training phase; it is generally fed to the model testing phase, right? The figure needs a revision.

9. The figures e.g. Fig 1 and 2 in general look of poor quality, this is perhaps the authors did not use vector/scalable images, and simple converted images to pdf from MS Word affects the figure quality.

10. "Alpha fragment" title on page 9 should be in bold.

11. Algorithm 1 on page 11, how is this related to Figure 2? Have you also described the Algorithm 1 in this article? If not, the authors should briefly discuss the various Algorithmic steps, at least at the high level.

12. In Figures 3 to 6, the x and y-axis labels are not legible. Should the font size be increased?

13. Before the Conclusion section, this article is missing one major section: "Threats to Validity." What are the threats to the validity of your results? The authors should discuss various internal, external, construct, reliability, etc. threats.

Is the work clearly and accurately presented and does it cite the current literature?

Yes
Is the study design appropriate and is the work technically sound?

Yes
Are sufficient details of methods and analysis provided to allow replication by others?

Partly
If applicable, is the statistical analysis and its interpretation appropriate?

Yes
Are all the source data underlying the results available to ensure full reproducibility?

Yes
Are the conclusions drawn adequately supported by the results?

Yes

Competing Interests: No competing interests were disclosed.

Reviewer Expertise: Software engineering, software quality, software maintenance and evolution.

I confirm that I have read this submission and believe that I have an appropriate level of expertise to confirm that it is of an acceptable scientific standard, however I have significant reservations, as outlined above.

CITE

Report a concern

Author Response 17 Dec 2024

Kunal Anand, School of Computer Engineering, Kalinga Institute of Industrial Technology, Bhubaneswar, 751024, India

17 Dec 2024

Author Response
1. In the Abstract, the authors said, "To overcome the above shortcomings, this study aims to develop an innovative FS technique, namely, the Chernobyl Optimization Algorithm (FSCOA), to unwrap
... Continue reading
In the Abstract, the authors said, "To overcome the above shortcomings, this study aims to develop an innovative FS technique, namely, the Chernobyl Optimization Algorithm (FSCOA), to unwrap the most informative features" -> I assume the authors are not developing COA? Rather applying or using it? This phrase needs to be revised.

Response: We sincerely thank the distinguished reviewer for their insightful feedback on our work. In this regard, we would like to inform that the above mentioned phrase has been reworked in the revised manuscript.

The first line in the Methods part of the Abstract: "attacking" -> "attaching".

Response: We truly value the insightful reviewer's feedback on our work. Regarding this, we would like to say that the complete sentence has been reworked and also has been removed from the method part to the background part in the abstract. Here, the word “attacking” has been replaced with “attaching” in the revised manuscript.

The last paragraph of the Introduction provides the outline of the article. However, later, I found no section numbers or identifiers. Shouldn't the sections be numbered?

Response: We are deeply appreciative of the esteemed reviewer's perceptive comments on our work. Regarding this, we want to submit that the section numbers for the paper's headers were included in the original manuscript when it was submitted. Nevertheless, the document was published using the publisher's template, which omitted the section numbers. The final paragraph has been rewritten in the updated text to adhere to the distinguished reviewer's recommendation.

The Related Works section discusses 11 studies. I would expect a table providing a summary and comparison among these studies, including your own study (this one).

Response: We sincerely appreciate the insightful feedback on our efforts provided by the excellent reviewer. In light of this, we would like to submit that, as desired by the distinguished reviewer, the literature review is included as a table at the conclusion of the related work section. We further want to state that the proposed work has been appropriately compared with the existing FS techniques used in the referenced studies in the result analysis section.

The overall methodology should be presented as various Steps in a more systematic, organized, and modular manner…

Response: We truly value the thoughtful criticism of our work that the outstanding reviewer gave us. Considering this, we would like to submit that the proposed methodology section has been reworked in line with the esteemed reviewer’s advise in the revised manuscript.

The three steps mentioned in Page 6, I suggest you show these steps in the figure using like Step 1, Step 2, etc.

Response: The outstanding reviewer's informative comments on our efforts are greatly appreciated. Accordingly, we would like to state that the introductory paragraph in the section “Feature selection using based on Chernobyl Disaster Optimization Algorithm” has been revised.

In Figure 1, not sure why this is called a FSCOA approach, because feature selection is part of any model building activity, you are just using a new technique for feature selection... right?

Response: We sincerely appreciate the insightful critique of our work provided by the esteemed reviewer. In light of this, we would like to state that the figure 1 has been revised in line with the esteemed reviewer's suggestion.

In Figure 1, the testing data is not fed to the model training phase; it is generally fed to the model testing phase, right? The figure needs a revision.

Response: The distinguished reviewer's perceptive criticism of our work is greatly appreciated. Given this, we would like to state that the figure 1 has been updated to reflect the reviewer's insightful recommendations.

The figures e.g. Fig 1 and 2 in general look of poor quality, this is perhaps the authors did not use vector/scalable images, and simple converted images to pdf from MS Word affects the figure quality.

Response: We are grateful for the insightful critique of our work provided by the esteemed reviewer. In light of this, we would like to state that the reviewer's wise suggestions have been incorporated into the improved quality of Figures 1 and 2.

"Alpha fragment" title on page 9 should be in bold.

Response: The distinguished reviewer's perceptive criticism of our work is greatly appreciated. Taking this into consideration, we are pleased to submit that the reviewer's insightful recommendations have been implemented.

Algorithm 1 on page 11, how is this related to Figure 2? Have you also described the Algorithm 1 in this article? If not, the authors should briefly discuss the various Algorithmic steps, at least at the high level.

Response: The distinguished reviewer's perceptive criticism of our work is greatly appreciated. In light of this, we would like to submit that algorithm 1 is shown graphically in Figure 2. The text and the alpha, beta, and gamma segments that follow figure 2 provide a succinct explanation of algorithm 1 and figure 2.

In Figures 3 to 6, the x and y-axis labels are not legible. Should the font size be increased?

Response:We are grateful for the insightful critique of our work provided by the esteemed reviewer. Given this, we would like to state that the font sizes of the individual images in figures 3 through 6 are appropriate. However, while retaining individual photographs, the paper's length was increasing. For this reason, the only reason to combine the images on a smaller scale was to reduce the paper's length.

Before the Conclusion section, this article is missing one major section: "Threats to Validity." What are the threats to the validity of your results? The authors should discuss various internal, external, construct, reliability, etc. threats.

Response: We sincerely appreciate the insightful critique of our work provided by the exceptional reviewer. In light of this, we would like to state that the updated paper now includes a new “Threats to Validity” section that complies with the esteemed reviewer's wise recommendation.
In the Abstract, the authors said, "To overcome the above shortcomings, this study aims to develop an innovative FS technique, namely, the Chernobyl Optimization Algorithm (FSCOA), to unwrap the most informative features" -> I assume the authors are not developing COA? Rather applying or using it? This phrase needs to be revised.

Response: We sincerely thank the distinguished reviewer for their insightful feedback on our work. In this regard, we would like to inform that the above mentioned phrase has been reworked in the revised manuscript.

The first line in the Methods part of the Abstract: "attacking" -> "attaching".

Response: We truly value the insightful reviewer's feedback on our work. Regarding this, we would like to say that the complete sentence has been reworked and also has been removed from the method part to the background part in the abstract. Here, the word “attacking” has been replaced with “attaching” in the revised manuscript.

The last paragraph of the Introduction provides the outline of the article. However, later, I found no section numbers or identifiers. Shouldn't the sections be numbered?

Response: We are deeply appreciative of the esteemed reviewer's perceptive comments on our work. Regarding this, we want to submit that the section numbers for the paper's headers were included in the original manuscript when it was submitted. Nevertheless, the document was published using the publisher's template, which omitted the section numbers. The final paragraph has been rewritten in the updated text to adhere to the distinguished reviewer's recommendation.

The Related Works section discusses 11 studies. I would expect a table providing a summary and comparison among these studies, including your own study (this one).

Response: We sincerely appreciate the insightful feedback on our efforts provided by the excellent reviewer. In light of this, we would like to submit that, as desired by the distinguished reviewer, the literature review is included as a table at the conclusion of the related work section. We further want to state that the proposed work has been appropriately compared with the existing FS techniques used in the referenced studies in the result analysis section.

The overall methodology should be presented as various Steps in a more systematic, organized, and modular manner…

Response: We truly value the thoughtful criticism of our work that the outstanding reviewer gave us. Considering this, we would like to submit that the proposed methodology section has been reworked in line with the esteemed reviewer’s advise in the revised manuscript.

The three steps mentioned in Page 6, I suggest you show these steps in the figure using like Step 1, Step 2, etc.

Response: The outstanding reviewer's informative comments on our efforts are greatly appreciated. Accordingly, we would like to state that the introductory paragraph in the section “Feature selection using based on Chernobyl Disaster Optimization Algorithm” has been revised.

In Figure 1, not sure why this is called a FSCOA approach, because feature selection is part of any model building activity, you are just using a new technique for feature selection... right?

Response: We sincerely appreciate the insightful critique of our work provided by the esteemed reviewer. In light of this, we would like to state that the figure 1 has been revised in line with the esteemed reviewer's suggestion.

In Figure 1, the testing data is not fed to the model training phase; it is generally fed to the model testing phase, right? The figure needs a revision.

Response: The distinguished reviewer's perceptive criticism of our work is greatly appreciated. Given this, we would like to state that the figure 1 has been updated to reflect the reviewer's insightful recommendations.

The figures e.g. Fig 1 and 2 in general look of poor quality, this is perhaps the authors did not use vector/scalable images, and simple converted images to pdf from MS Word affects the figure quality.

Response: We are grateful for the insightful critique of our work provided by the esteemed reviewer. In light of this, we would like to state that the reviewer's wise suggestions have been incorporated into the improved quality of Figures 1 and 2.

"Alpha fragment" title on page 9 should be in bold.

Response: The distinguished reviewer's perceptive criticism of our work is greatly appreciated. Taking this into consideration, we are pleased to submit that the reviewer's insightful recommendations have been implemented.

Algorithm 1 on page 11, how is this related to Figure 2? Have you also described the Algorithm 1 in this article? If not, the authors should briefly discuss the various Algorithmic steps, at least at the high level.

Response: The distinguished reviewer's perceptive criticism of our work is greatly appreciated. In light of this, we would like to submit that algorithm 1 is shown graphically in Figure 2. The text and the alpha, beta, and gamma segments that follow figure 2 provide a succinct explanation of algorithm 1 and figure 2.

In Figures 3 to 6, the x and y-axis labels are not legible. Should the font size be increased?

Response:We are grateful for the insightful critique of our work provided by the esteemed reviewer. Given this, we would like to state that the font sizes of the individual images in figures 3 through 6 are appropriate. However, while retaining individual photographs, the paper's length was increasing. For this reason, the only reason to combine the images on a smaller scale was to reduce the paper's length.

Before the Conclusion section, this article is missing one major section: "Threats to Validity." What are the threats to the validity of your results? The authors should discuss various internal, external, construct, reliability, etc. threats.

Response: We sincerely appreciate the insightful critique of our work provided by the exceptional reviewer. In light of this, we would like to state that the updated paper now includes a new “Threats to Validity” section that complies with the esteemed reviewer's wise recommendation.
Competing Interests: No competing interest Close
Report a concern
Respond or Comment

COMMENTS ON THIS REPORT

Author Response 17 Dec 2024

Kunal Anand, School of Computer Engineering, Kalinga Institute of Industrial Technology, Bhubaneswar, 751024, India

17 Dec 2024

Author Response
1. In the Abstract, the authors said, "To overcome the above shortcomings, this study aims to develop an innovative FS technique, namely, the Chernobyl Optimization Algorithm (FSCOA), to unwrap
... Continue reading
In the Abstract, the authors said, "To overcome the above shortcomings, this study aims to develop an innovative FS technique, namely, the Chernobyl Optimization Algorithm (FSCOA), to unwrap the most informative features" -> I assume the authors are not developing COA? Rather applying or using it? This phrase needs to be revised.

Response: We sincerely thank the distinguished reviewer for their insightful feedback on our work. In this regard, we would like to inform that the above mentioned phrase has been reworked in the revised manuscript.

The first line in the Methods part of the Abstract: "attacking" -> "attaching".

Response: We truly value the insightful reviewer's feedback on our work. Regarding this, we would like to say that the complete sentence has been reworked and also has been removed from the method part to the background part in the abstract. Here, the word “attacking” has been replaced with “attaching” in the revised manuscript.

The last paragraph of the Introduction provides the outline of the article. However, later, I found no section numbers or identifiers. Shouldn't the sections be numbered?

Response: We are deeply appreciative of the esteemed reviewer's perceptive comments on our work. Regarding this, we want to submit that the section numbers for the paper's headers were included in the original manuscript when it was submitted. Nevertheless, the document was published using the publisher's template, which omitted the section numbers. The final paragraph has been rewritten in the updated text to adhere to the distinguished reviewer's recommendation.

The Related Works section discusses 11 studies. I would expect a table providing a summary and comparison among these studies, including your own study (this one).

Response: We sincerely appreciate the insightful feedback on our efforts provided by the excellent reviewer. In light of this, we would like to submit that, as desired by the distinguished reviewer, the literature review is included as a table at the conclusion of the related work section. We further want to state that the proposed work has been appropriately compared with the existing FS techniques used in the referenced studies in the result analysis section.

The overall methodology should be presented as various Steps in a more systematic, organized, and modular manner…

Response: We truly value the thoughtful criticism of our work that the outstanding reviewer gave us. Considering this, we would like to submit that the proposed methodology section has been reworked in line with the esteemed reviewer’s advise in the revised manuscript.

The three steps mentioned in Page 6, I suggest you show these steps in the figure using like Step 1, Step 2, etc.

Response: The outstanding reviewer's informative comments on our efforts are greatly appreciated. Accordingly, we would like to state that the introductory paragraph in the section “Feature selection using based on Chernobyl Disaster Optimization Algorithm” has been revised.

In Figure 1, not sure why this is called a FSCOA approach, because feature selection is part of any model building activity, you are just using a new technique for feature selection... right?

Response: We sincerely appreciate the insightful critique of our work provided by the esteemed reviewer. In light of this, we would like to state that the figure 1 has been revised in line with the esteemed reviewer's suggestion.

In Figure 1, the testing data is not fed to the model training phase; it is generally fed to the model testing phase, right? The figure needs a revision.

Response: The distinguished reviewer's perceptive criticism of our work is greatly appreciated. Given this, we would like to state that the figure 1 has been updated to reflect the reviewer's insightful recommendations.

The figures e.g. Fig 1 and 2 in general look of poor quality, this is perhaps the authors did not use vector/scalable images, and simple converted images to pdf from MS Word affects the figure quality.

Response: We are grateful for the insightful critique of our work provided by the esteemed reviewer. In light of this, we would like to state that the reviewer's wise suggestions have been incorporated into the improved quality of Figures 1 and 2.

"Alpha fragment" title on page 9 should be in bold.

Response: The distinguished reviewer's perceptive criticism of our work is greatly appreciated. Taking this into consideration, we are pleased to submit that the reviewer's insightful recommendations have been implemented.

Algorithm 1 on page 11, how is this related to Figure 2? Have you also described the Algorithm 1 in this article? If not, the authors should briefly discuss the various Algorithmic steps, at least at the high level.

Response: The distinguished reviewer's perceptive criticism of our work is greatly appreciated. In light of this, we would like to submit that algorithm 1 is shown graphically in Figure 2. The text and the alpha, beta, and gamma segments that follow figure 2 provide a succinct explanation of algorithm 1 and figure 2.

In Figures 3 to 6, the x and y-axis labels are not legible. Should the font size be increased?

Response:We are grateful for the insightful critique of our work provided by the esteemed reviewer. Given this, we would like to state that the font sizes of the individual images in figures 3 through 6 are appropriate. However, while retaining individual photographs, the paper's length was increasing. For this reason, the only reason to combine the images on a smaller scale was to reduce the paper's length.

Before the Conclusion section, this article is missing one major section: "Threats to Validity." What are the threats to the validity of your results? The authors should discuss various internal, external, construct, reliability, etc. threats.

Response: We sincerely appreciate the insightful critique of our work provided by the exceptional reviewer. In light of this, we would like to state that the updated paper now includes a new “Threats to Validity” section that complies with the esteemed reviewer's wise recommendation.
In the Abstract, the authors said, "To overcome the above shortcomings, this study aims to develop an innovative FS technique, namely, the Chernobyl Optimization Algorithm (FSCOA), to unwrap the most informative features" -> I assume the authors are not developing COA? Rather applying or using it? This phrase needs to be revised.

Response: We sincerely thank the distinguished reviewer for their insightful feedback on our work. In this regard, we would like to inform that the above mentioned phrase has been reworked in the revised manuscript.

The first line in the Methods part of the Abstract: "attacking" -> "attaching".

Response: We truly value the insightful reviewer's feedback on our work. Regarding this, we would like to say that the complete sentence has been reworked and also has been removed from the method part to the background part in the abstract. Here, the word “attacking” has been replaced with “attaching” in the revised manuscript.

The last paragraph of the Introduction provides the outline of the article. However, later, I found no section numbers or identifiers. Shouldn't the sections be numbered?

Response: We are deeply appreciative of the esteemed reviewer's perceptive comments on our work. Regarding this, we want to submit that the section numbers for the paper's headers were included in the original manuscript when it was submitted. Nevertheless, the document was published using the publisher's template, which omitted the section numbers. The final paragraph has been rewritten in the updated text to adhere to the distinguished reviewer's recommendation.

The Related Works section discusses 11 studies. I would expect a table providing a summary and comparison among these studies, including your own study (this one).

Response: We sincerely appreciate the insightful feedback on our efforts provided by the excellent reviewer. In light of this, we would like to submit that, as desired by the distinguished reviewer, the literature review is included as a table at the conclusion of the related work section. We further want to state that the proposed work has been appropriately compared with the existing FS techniques used in the referenced studies in the result analysis section.

The overall methodology should be presented as various Steps in a more systematic, organized, and modular manner…

Response: We truly value the thoughtful criticism of our work that the outstanding reviewer gave us. Considering this, we would like to submit that the proposed methodology section has been reworked in line with the esteemed reviewer’s advise in the revised manuscript.

The three steps mentioned in Page 6, I suggest you show these steps in the figure using like Step 1, Step 2, etc.

Response: The outstanding reviewer's informative comments on our efforts are greatly appreciated. Accordingly, we would like to state that the introductory paragraph in the section “Feature selection using based on Chernobyl Disaster Optimization Algorithm” has been revised.

In Figure 1, not sure why this is called a FSCOA approach, because feature selection is part of any model building activity, you are just using a new technique for feature selection... right?

Response: We sincerely appreciate the insightful critique of our work provided by the esteemed reviewer. In light of this, we would like to state that the figure 1 has been revised in line with the esteemed reviewer's suggestion.

In Figure 1, the testing data is not fed to the model training phase; it is generally fed to the model testing phase, right? The figure needs a revision.

Response: The distinguished reviewer's perceptive criticism of our work is greatly appreciated. Given this, we would like to state that the figure 1 has been updated to reflect the reviewer's insightful recommendations.

The figures e.g. Fig 1 and 2 in general look of poor quality, this is perhaps the authors did not use vector/scalable images, and simple converted images to pdf from MS Word affects the figure quality.

Response: We are grateful for the insightful critique of our work provided by the esteemed reviewer. In light of this, we would like to state that the reviewer's wise suggestions have been incorporated into the improved quality of Figures 1 and 2.

"Alpha fragment" title on page 9 should be in bold.

Response: The distinguished reviewer's perceptive criticism of our work is greatly appreciated. Taking this into consideration, we are pleased to submit that the reviewer's insightful recommendations have been implemented.

Algorithm 1 on page 11, how is this related to Figure 2? Have you also described the Algorithm 1 in this article? If not, the authors should briefly discuss the various Algorithmic steps, at least at the high level.

Response: The distinguished reviewer's perceptive criticism of our work is greatly appreciated. In light of this, we would like to submit that algorithm 1 is shown graphically in Figure 2. The text and the alpha, beta, and gamma segments that follow figure 2 provide a succinct explanation of algorithm 1 and figure 2.

In Figures 3 to 6, the x and y-axis labels are not legible. Should the font size be increased?

Response:We are grateful for the insightful critique of our work provided by the esteemed reviewer. Given this, we would like to state that the font sizes of the individual images in figures 3 through 6 are appropriate. However, while retaining individual photographs, the paper's length was increasing. For this reason, the only reason to combine the images on a smaller scale was to reduce the paper's length.

Before the Conclusion section, this article is missing one major section: "Threats to Validity." What are the threats to the validity of your results? The authors should discuss various internal, external, construct, reliability, etc. threats.

Response: We sincerely appreciate the insightful critique of our work provided by the exceptional reviewer. In light of this, we would like to state that the updated paper now includes a new “Threats to Validity” section that complies with the esteemed reviewer's wise recommendation.
Competing Interests: No competing interest Close
Report a concern

Views

Reviewer Report 17 Sep 2024

Ahmed Abdu, Northwestern Polytechnical University, Xi’an, China

Approved with Reservations

https://doi.org/10.5256/f1000research.165541.r319612

This paper presents an approach to addressing the challenges of software defect prediction (SDP) through the innovative use of the Chornobyl Optimization Algorithm (COA) for feature selection. The problem of handling high-dimensional datasets in SDP is well-recognized, and the authors' focus on reducing these dimensions through feature selection techniques is highly relevant to improving predictive performance. The abstract provides a clear overview of the limitations of existing meta-heuristic algorithms, positioning the COA as a promising alternative. However, it requires some major revisions to fully meet the expectations for clarity and depth in the current research context. Addressing these revisions will help strengthen the study's overall contribution and impact:
1. The abstract should clarify the specific contribution of the Chernobyl Optimization Algorithm (FSCOA) in comparison to existing methods. The abstract mentions drawbacks of other meta-heuristic algorithms but doesn't specify how FSCOA addresses these issues in a distinct or innovative way. Adding a brief statement on the unique advantages of FSCOA (e.g., better performance, reduced computational cost, or improved convergence) would enhance clarity and highlight the novelty of the study.

2. The introduction section outlines the research focus, but the main contributions of the paper should be revised to clearly delineate three distinct and understandable contributions. Clarifying these contributions will help readers better grasp the novel aspects of your work and its significance in advancing the field.

3. The authors should consider extending the Related Work section by including more recent references to ensure the study is up-to-date with the latest advancements in software defect prediction. Specifically, citing the following studies would strengthen the discussion of deep learning and feature selection approaches in software defect prediction:

Abdu, A., Zhai, Z., Abdo, H. A., & Algabri, R. (2024). Software Defect Prediction Based on Deep Representation Learning of Source Code From Contextual Syntax and Semantic Graph. IEEE Transactions on Reliability.
Abdu, A., Zhai, Z., Abdo, H. A., Algabri, R., & Lee, S. (2023). Graph-Based Feature Learning for Cross-Project Software Defect Prediction. Computers, Materials and Continua, 77(1).

Abdu, A., Zhai, Z., Abdo, H. A., Algabri, R., Al-Masni, M. A., Muhammad, M. S., & Gu, Y. H. (2024). Semantic and traditional feature fusion for software defect prediction using hybrid deep learning model. Scientific Reports, 14(1), 14771.
Abdu, A., Zhai, Z., Algabri, R., Abdo, H. A., Hamad, K., & Al-antari, M. A. (2022). Deep learning-based software defect prediction via semantic key features of source code—systematic survey. Mathematics, 10(17), 3120.

By incorporating these studies, the authors will enrich the context of their research and ensure that their work is grounded in the most current literature on the topic.

4. The results discussion is currently lacking depth and does not sufficiently highlight the strengths of the proposed method. The comparison with other approaches is very limited, which weakens the impact of your findings. I recommend expanding the explanation and discussion of the results, focusing on:

How your proposed method outperforms or differs from the baseline methods.
Highlighting key insights and practical implications of your findings.

By thoroughly discussing these aspects, you will better demonstrate the novelty and effectiveness of your method, making the results more compelling for the readers.

Is the work clearly and accurately presented and does it cite the current literature?

Partly
Is the study design appropriate and is the work technically sound?

Partly
Are sufficient details of methods and analysis provided to allow replication by others?

No
If applicable, is the statistical analysis and its interpretation appropriate?

No
Are all the source data underlying the results available to ensure full reproducibility?

No source data required
Are the conclusions drawn adequately supported by the results?

Yes

References

1. Abdu A, Zhai Z, Abdo HA, Algabri R, et al.: Semantic and traditional feature fusion for software defect prediction using hybrid deep learning model.Sci Rep. 2024; 14 (1): 14771 PubMed Abstract | Publisher Full Text
2. A Abdu: Software Defect Prediction Based on Deep Representation Learning of Source Code From Contextual Syntax and Semantic Graph.
3. A Abdu: Deep Learning-Based Software Defect Prediction via Semantic Key Features of Source Code—Systematic Survey.
4. A Abdu: Graph-Based Feature Learning for Cross-Project Software Defect Prediction.

Competing Interests: No competing interests were disclosed.

Reviewer Expertise: Software engineering, software defect prediction, deep learning

CITE

Report a concern

Author Response 17 Dec 2024

Kunal Anand, School of Computer Engineering, Kalinga Institute of Industrial Technology, Bhubaneswar, 751024, India

17 Dec 2024

Author Response
1. The abstract should clarify the specific contribution of the Chernobyl Optimization Algorithm (FSCOA) in comparison to existing methods. The abstract mentions drawbacks of other meta-heuristic algorithms but doesn't
... Continue reading
The abstract should clarify the specific contribution of the Chernobyl Optimization Algorithm (FSCOA) in comparison to existing methods. The abstract mentions drawbacks of other meta-heuristic algorithms but doesn't specify how FSCOA addresses these issues in a distinct or innovative way. Adding a brief statement on the unique advantages of FSCOA (e.g., better performance, reduced computational cost, or improved convergence) would enhance clarity and highlight the novelty of the study.

Response: We really appreciate the insightful reviewer's feedback on our work, and in light of this, we would like to propose that the abstract has been amended in the revised article in accordance with the reviewer's suggestions.

The introduction section outlines the research focus, but the main contributions of the paper should be revised to clearly delineate three distinct and understandable contributions. Clarifying these contributions will help readers better grasp the novel aspects of your work and its significance in advancing the field.

Response: Our sincere gratitude goes out to the renowned reviewer for his insightful feedback on our work. This is an extremely astute suggestion from the esteemed critic. As suggested by the esteemed reviewer, the major contributions of the paper have been reworked in the revised manuscript.

3. The authors should consider extending the Related Work section by including more recent references to ensure the study is up-to-date with the latest advancements in software defect prediction. Specifically, citing the following studies would strengthen the discussion of deep learning and feature selection approaches in software defect prediction:
Abdu, A., Zhai, Z., Abdo, H. A., & Algabri, R. (2024). Software Defect Prediction Based on Deep Representation Learning of Source Code From Contextual Syntax and Semantic Graph. IEEE Transactions on Reliability.
Abdu, A., Zhai, Z., Abdo, H. A., Algabri, R., & Lee, S. (2023). Graph-Based Feature Learning for Cross-Project Software Defect Prediction. Computers, Materials and Continua, 77(1).
Abdu, A., Zhai, Z., Abdo, H. A., Algabri, R., Al-Masni, M. A., Muhammad, M. S., & Gu, Y. H. (2024). Semantic and traditional feature fusion for software defect prediction using hybrid deep learning model. Scientific Reports, 14(1), 14771.
Abdu, A., Zhai, Z., Algabri, R., Abdo, H. A., Hamad, K., & Al-antari, M. A. (2022). Deep learning-based software defect prediction via semantic key features of source code—systematic survey. Mathematics, 10(17), 3120.
By incorporating these studies, the authors will enrich the context of their research and ensure that their work is grounded in the most current literature on the topic.

Response: We sincerely thank the distinguished critic for his insightful feedback on our work. The renowned reviewer has made a very wise recommendation. In accordance with the most recent developments in software defect prediction, a few recent works have been added to the related work area as suggested by the esteemed reviewer.

4. The results discussion is currently lacking depth and does not sufficiently highlight the strengths of the proposed method. The comparison with other approaches is very limited, which weakens the impact of your findings. I recommend expanding the explanation and discussion of the results, focusing on:
How your proposed method outperforms or differs from the baseline methods.
Highlighting key insights and practical implications of your findings.
By thoroughly discussing these aspects, you will better demonstrate the novelty and effectiveness of your method, making the results more compelling for the readers.

Response: We sincerely thank the distinguished reviewer for his insightful critique of our work. The recommendation from the renowned reviewer is really astute. As suggested by the respected reviewer, we have incorporated in the updated text the notable distinctions between the suggested method and the other baseline FS approaches employed in this study. Additionally, the consequences of the suggested procedures are incorporated into the updated manuscript as a distinct section called "Threats to validity."
The abstract should clarify the specific contribution of the Chernobyl Optimization Algorithm (FSCOA) in comparison to existing methods. The abstract mentions drawbacks of other meta-heuristic algorithms but doesn't specify how FSCOA addresses these issues in a distinct or innovative way. Adding a brief statement on the unique advantages of FSCOA (e.g., better performance, reduced computational cost, or improved convergence) would enhance clarity and highlight the novelty of the study.

Response: We really appreciate the insightful reviewer's feedback on our work, and in light of this, we would like to propose that the abstract has been amended in the revised article in accordance with the reviewer's suggestions.

The introduction section outlines the research focus, but the main contributions of the paper should be revised to clearly delineate three distinct and understandable contributions. Clarifying these contributions will help readers better grasp the novel aspects of your work and its significance in advancing the field.

Response: Our sincere gratitude goes out to the renowned reviewer for his insightful feedback on our work. This is an extremely astute suggestion from the esteemed critic. As suggested by the esteemed reviewer, the major contributions of the paper have been reworked in the revised manuscript.

3. The authors should consider extending the Related Work section by including more recent references to ensure the study is up-to-date with the latest advancements in software defect prediction. Specifically, citing the following studies would strengthen the discussion of deep learning and feature selection approaches in software defect prediction:
Abdu, A., Zhai, Z., Abdo, H. A., & Algabri, R. (2024). Software Defect Prediction Based on Deep Representation Learning of Source Code From Contextual Syntax and Semantic Graph. IEEE Transactions on Reliability.
Abdu, A., Zhai, Z., Abdo, H. A., Algabri, R., & Lee, S. (2023). Graph-Based Feature Learning for Cross-Project Software Defect Prediction. Computers, Materials and Continua, 77(1).
Abdu, A., Zhai, Z., Abdo, H. A., Algabri, R., Al-Masni, M. A., Muhammad, M. S., & Gu, Y. H. (2024). Semantic and traditional feature fusion for software defect prediction using hybrid deep learning model. Scientific Reports, 14(1), 14771.
Abdu, A., Zhai, Z., Algabri, R., Abdo, H. A., Hamad, K., & Al-antari, M. A. (2022). Deep learning-based software defect prediction via semantic key features of source code—systematic survey. Mathematics, 10(17), 3120.
By incorporating these studies, the authors will enrich the context of their research and ensure that their work is grounded in the most current literature on the topic.

Response: We sincerely thank the distinguished critic for his insightful feedback on our work. The renowned reviewer has made a very wise recommendation. In accordance with the most recent developments in software defect prediction, a few recent works have been added to the related work area as suggested by the esteemed reviewer.

4. The results discussion is currently lacking depth and does not sufficiently highlight the strengths of the proposed method. The comparison with other approaches is very limited, which weakens the impact of your findings. I recommend expanding the explanation and discussion of the results, focusing on:
How your proposed method outperforms or differs from the baseline methods.
Highlighting key insights and practical implications of your findings.
By thoroughly discussing these aspects, you will better demonstrate the novelty and effectiveness of your method, making the results more compelling for the readers.

Response: We sincerely thank the distinguished reviewer for his insightful critique of our work. The recommendation from the renowned reviewer is really astute. As suggested by the respected reviewer, we have incorporated in the updated text the notable distinctions between the suggested method and the other baseline FS approaches employed in this study. Additionally, the consequences of the suggested procedures are incorporated into the updated manuscript as a distinct section called "Threats to validity."
Competing Interests: No competing interest Close
Report a concern
Respond or Comment

COMMENTS ON THIS REPORT

Author Response 17 Dec 2024

Kunal Anand, School of Computer Engineering, Kalinga Institute of Industrial Technology, Bhubaneswar, 751024, India

17 Dec 2024

Author Response
1. The abstract should clarify the specific contribution of the Chernobyl Optimization Algorithm (FSCOA) in comparison to existing methods. The abstract mentions drawbacks of other meta-heuristic algorithms but doesn't
... Continue reading
The abstract should clarify the specific contribution of the Chernobyl Optimization Algorithm (FSCOA) in comparison to existing methods. The abstract mentions drawbacks of other meta-heuristic algorithms but doesn't specify how FSCOA addresses these issues in a distinct or innovative way. Adding a brief statement on the unique advantages of FSCOA (e.g., better performance, reduced computational cost, or improved convergence) would enhance clarity and highlight the novelty of the study.

Response: We really appreciate the insightful reviewer's feedback on our work, and in light of this, we would like to propose that the abstract has been amended in the revised article in accordance with the reviewer's suggestions.

The introduction section outlines the research focus, but the main contributions of the paper should be revised to clearly delineate three distinct and understandable contributions. Clarifying these contributions will help readers better grasp the novel aspects of your work and its significance in advancing the field.

Response: Our sincere gratitude goes out to the renowned reviewer for his insightful feedback on our work. This is an extremely astute suggestion from the esteemed critic. As suggested by the esteemed reviewer, the major contributions of the paper have been reworked in the revised manuscript.

3. The authors should consider extending the Related Work section by including more recent references to ensure the study is up-to-date with the latest advancements in software defect prediction. Specifically, citing the following studies would strengthen the discussion of deep learning and feature selection approaches in software defect prediction:
Abdu, A., Zhai, Z., Abdo, H. A., & Algabri, R. (2024). Software Defect Prediction Based on Deep Representation Learning of Source Code From Contextual Syntax and Semantic Graph. IEEE Transactions on Reliability.
Abdu, A., Zhai, Z., Abdo, H. A., Algabri, R., & Lee, S. (2023). Graph-Based Feature Learning for Cross-Project Software Defect Prediction. Computers, Materials and Continua, 77(1).
Abdu, A., Zhai, Z., Abdo, H. A., Algabri, R., Al-Masni, M. A., Muhammad, M. S., & Gu, Y. H. (2024). Semantic and traditional feature fusion for software defect prediction using hybrid deep learning model. Scientific Reports, 14(1), 14771.
Abdu, A., Zhai, Z., Algabri, R., Abdo, H. A., Hamad, K., & Al-antari, M. A. (2022). Deep learning-based software defect prediction via semantic key features of source code—systematic survey. Mathematics, 10(17), 3120.
By incorporating these studies, the authors will enrich the context of their research and ensure that their work is grounded in the most current literature on the topic.

Response: We sincerely thank the distinguished critic for his insightful feedback on our work. The renowned reviewer has made a very wise recommendation. In accordance with the most recent developments in software defect prediction, a few recent works have been added to the related work area as suggested by the esteemed reviewer.

4. The results discussion is currently lacking depth and does not sufficiently highlight the strengths of the proposed method. The comparison with other approaches is very limited, which weakens the impact of your findings. I recommend expanding the explanation and discussion of the results, focusing on:
How your proposed method outperforms or differs from the baseline methods.
Highlighting key insights and practical implications of your findings.
By thoroughly discussing these aspects, you will better demonstrate the novelty and effectiveness of your method, making the results more compelling for the readers.

Response: We sincerely thank the distinguished reviewer for his insightful critique of our work. The recommendation from the renowned reviewer is really astute. As suggested by the respected reviewer, we have incorporated in the updated text the notable distinctions between the suggested method and the other baseline FS approaches employed in this study. Additionally, the consequences of the suggested procedures are incorporated into the updated manuscript as a distinct section called "Threats to validity."
The abstract should clarify the specific contribution of the Chernobyl Optimization Algorithm (FSCOA) in comparison to existing methods. The abstract mentions drawbacks of other meta-heuristic algorithms but doesn't specify how FSCOA addresses these issues in a distinct or innovative way. Adding a brief statement on the unique advantages of FSCOA (e.g., better performance, reduced computational cost, or improved convergence) would enhance clarity and highlight the novelty of the study.

Response: We really appreciate the insightful reviewer's feedback on our work, and in light of this, we would like to propose that the abstract has been amended in the revised article in accordance with the reviewer's suggestions.

The introduction section outlines the research focus, but the main contributions of the paper should be revised to clearly delineate three distinct and understandable contributions. Clarifying these contributions will help readers better grasp the novel aspects of your work and its significance in advancing the field.

Response: Our sincere gratitude goes out to the renowned reviewer for his insightful feedback on our work. This is an extremely astute suggestion from the esteemed critic. As suggested by the esteemed reviewer, the major contributions of the paper have been reworked in the revised manuscript.

3. The authors should consider extending the Related Work section by including more recent references to ensure the study is up-to-date with the latest advancements in software defect prediction. Specifically, citing the following studies would strengthen the discussion of deep learning and feature selection approaches in software defect prediction:
Abdu, A., Zhai, Z., Abdo, H. A., & Algabri, R. (2024). Software Defect Prediction Based on Deep Representation Learning of Source Code From Contextual Syntax and Semantic Graph. IEEE Transactions on Reliability.
Abdu, A., Zhai, Z., Abdo, H. A., Algabri, R., & Lee, S. (2023). Graph-Based Feature Learning for Cross-Project Software Defect Prediction. Computers, Materials and Continua, 77(1).
Abdu, A., Zhai, Z., Abdo, H. A., Algabri, R., Al-Masni, M. A., Muhammad, M. S., & Gu, Y. H. (2024). Semantic and traditional feature fusion for software defect prediction using hybrid deep learning model. Scientific Reports, 14(1), 14771.
Abdu, A., Zhai, Z., Algabri, R., Abdo, H. A., Hamad, K., & Al-antari, M. A. (2022). Deep learning-based software defect prediction via semantic key features of source code—systematic survey. Mathematics, 10(17), 3120.
By incorporating these studies, the authors will enrich the context of their research and ensure that their work is grounded in the most current literature on the topic.

Response: We sincerely thank the distinguished critic for his insightful feedback on our work. The renowned reviewer has made a very wise recommendation. In accordance with the most recent developments in software defect prediction, a few recent works have been added to the related work area as suggested by the esteemed reviewer.

4. The results discussion is currently lacking depth and does not sufficiently highlight the strengths of the proposed method. The comparison with other approaches is very limited, which weakens the impact of your findings. I recommend expanding the explanation and discussion of the results, focusing on:
How your proposed method outperforms or differs from the baseline methods.
Highlighting key insights and practical implications of your findings.
By thoroughly discussing these aspects, you will better demonstrate the novelty and effectiveness of your method, making the results more compelling for the readers.

Response: We sincerely thank the distinguished reviewer for his insightful critique of our work. The recommendation from the renowned reviewer is really astute. As suggested by the respected reviewer, we have incorporated in the updated text the notable distinctions between the suggested method and the other baseline FS approaches employed in this study. Additionally, the consequences of the suggested procedures are incorporated into the updated manuscript as a distinct section called "Threats to validity."
Competing Interests: No competing interest Close
Report a concern

Views

Reviewer Report 14 Aug 2024

Shabib Aftab, Virtual University of Pakistan, Lahore, Punjab, Pakistan

Not Approved

https://doi.org/10.5256/f1000research.165541.r309141

Abstract: The abstract should offer a succinct overview of the research paper, covering the research problem, objectives, methodology, key findings, and conclusions. It must be clear and informative, providing readers with an understanding of the study's importance. You should precisely highlight the main contributions, including the datasets utilized and the accuracy achieved by your proposed system.

Introduction: The introduction section should conclude with a brief outline of the structure for the remainder of the paper
.
Literature Review: A distinct section dedicated to the Literature Review should be included in the paper in the form of a table.

Results: The results section should clearly and systematically present the study's findings. It should provide detailed information, including a thorough comparison with other related techniques. You also have to explore other measures as well in the comparison other techniques.

Conclusion: This section should offer a comprehensive summary of the research and include a detailed discussion of future research directions.

English: The paper should be reviewed to ensure the English language is free of grammatical errors.

Is the work clearly and accurately presented and does it cite the current literature?

Partly
Is the study design appropriate and is the work technically sound?

No
Are sufficient details of methods and analysis provided to allow replication by others?

Partly
If applicable, is the statistical analysis and its interpretation appropriate?

Partly
Are all the source data underlying the results available to ensure full reproducibility?

Yes
Are the conclusions drawn adequately supported by the results?

Yes

Competing Interests: No competing interests were disclosed.

Reviewer Expertise: software defect prediction

I confirm that I have read this submission and believe that I have an appropriate level of expertise to state that I do not consider it to be of an acceptable scientific standard, for reasons outlined above.

CITE

Report a concern

Author Response 17 Dec 2024

Kunal Anand, School of Computer Engineering, Kalinga Institute of Industrial Technology, Bhubaneswar, 751024, India

17 Dec 2024

Author Response

Abstract: The abstract should offer a succinct overview of the research paper, covering the research problem, objectives, methodology, key findings, and conclusions. It must be clear and informative, providing readers ... Continue reading Abstract: The abstract should offer a succinct overview of the research paper, covering the research problem, objectives, methodology, key findings, and conclusions. It must be clear and informative, providing readers with an understanding of the study's importance. You should precisely highlight the main contributions, including the datasets utilized and the accuracy achieved by your proposed system.

Response: We are deeply appreciative of the esteemed reviewer's perceptive comments on our work. In this context, we would like to submit that, as suggested by the esteemed reviewer, the abstract has been reworked in the revised manuscript.

Introduction: The introduction section should conclude with a brief outline of the structure for the remainder of the paper.

Response: We sincerely appreciate the insightful feedback on our efforts provided by the excellent reviewer. In light of this, we would like to state that the last paragraph of the introduction section outlines the structure for the remainder of the paper.
.
Literature Review: A distinct section dedicated to the Literature Review should be included in the paper in the form of a table.

Response: The excellent reviewer provided us with insightful feedback of our work, which we much appreciate. This is a really smart idea from the respected critic. As desired by the respected reviewer, we would like to submit that the literature review has been appended as a table at the conclusion of the related work section.

Results: The results section should clearly and systematically present the study's findings. It should provide detailed information, including a thorough comparison with other related techniques. You also have to explore other measures as well in the comparison other techniques.

Response: We are deeply appreciative of the distinguished reviewer's insightful critique of our work. It is a very wise suggestion from the renowned critic. We have incorporated in the amended text the notable distinctions between the suggested approach and the other related FS techniques employed in this work, as suggested by the excellent reviewer. Furthermore, a distinct section titled "Threats to validity" in the updated manuscript addresses the ramifications of the suggested FSCOA strategy. In addition to investigating additional measurements, our goal for the future is to broaden the scope of the suggested work by comparing it with more FS techniques.

Conclusion: This section should offer a comprehensive summary of the research and include a detailed discussion of future research directions.

Response: We greatly appreciate the insightful criticism of our work provided by the esteemed reviewer. The renowned reviewer has offered a really astute suggestion. In light of this, we would like to submit that the conclusion section has been altered in the revised manuscript in accordance with the distinguished reviewer's suggestions.

English: The paper should be reviewed to ensure the English language is free of grammatical errors.

Response: To make sure the English is correct, the paper has been carefully examined and modified accordingly.
Abstract: The abstract should offer a succinct overview of the research paper, covering the research problem, objectives, methodology, key findings, and conclusions. It must be clear and informative, providing readers with an understanding of the study's importance. You should precisely highlight the main contributions, including the datasets utilized and the accuracy achieved by your proposed system.

Response: We are deeply appreciative of the esteemed reviewer's perceptive comments on our work. In this context, we would like to submit that, as suggested by the esteemed reviewer, the abstract has been reworked in the revised manuscript.

Introduction: The introduction section should conclude with a brief outline of the structure for the remainder of the paper.

Response: We sincerely appreciate the insightful feedback on our efforts provided by the excellent reviewer. In light of this, we would like to state that the last paragraph of the introduction section outlines the structure for the remainder of the paper.
.
Literature Review: A distinct section dedicated to the Literature Review should be included in the paper in the form of a table.

Response: The excellent reviewer provided us with insightful feedback of our work, which we much appreciate. This is a really smart idea from the respected critic. As desired by the respected reviewer, we would like to submit that the literature review has been appended as a table at the conclusion of the related work section.

Results: The results section should clearly and systematically present the study's findings. It should provide detailed information, including a thorough comparison with other related techniques. You also have to explore other measures as well in the comparison other techniques.

Response: We are deeply appreciative of the distinguished reviewer's insightful critique of our work. It is a very wise suggestion from the renowned critic. We have incorporated in the amended text the notable distinctions between the suggested approach and the other related FS techniques employed in this work, as suggested by the excellent reviewer. Furthermore, a distinct section titled "Threats to validity" in the updated manuscript addresses the ramifications of the suggested FSCOA strategy. In addition to investigating additional measurements, our goal for the future is to broaden the scope of the suggested work by comparing it with more FS techniques.

Conclusion: This section should offer a comprehensive summary of the research and include a detailed discussion of future research directions.

Response: We greatly appreciate the insightful criticism of our work provided by the esteemed reviewer. The renowned reviewer has offered a really astute suggestion. In light of this, we would like to submit that the conclusion section has been altered in the revised manuscript in accordance with the distinguished reviewer's suggestions.

English: The paper should be reviewed to ensure the English language is free of grammatical errors.

Response: To make sure the English is correct, the paper has been carefully examined and modified accordingly.
Competing Interests: No competing interest. Close
Report a concern
Respond or Comment

COMMENTS ON THIS REPORT

Author Response 17 Dec 2024

Kunal Anand, School of Computer Engineering, Kalinga Institute of Industrial Technology, Bhubaneswar, 751024, India

17 Dec 2024

Author Response

Abstract: The abstract should offer a succinct overview of the research paper, covering the research problem, objectives, methodology, key findings, and conclusions. It must be clear and informative, providing readers ... Continue reading Abstract: The abstract should offer a succinct overview of the research paper, covering the research problem, objectives, methodology, key findings, and conclusions. It must be clear and informative, providing readers with an understanding of the study's importance. You should precisely highlight the main contributions, including the datasets utilized and the accuracy achieved by your proposed system.

Response: We are deeply appreciative of the esteemed reviewer's perceptive comments on our work. In this context, we would like to submit that, as suggested by the esteemed reviewer, the abstract has been reworked in the revised manuscript.

Introduction: The introduction section should conclude with a brief outline of the structure for the remainder of the paper.

Response: We sincerely appreciate the insightful feedback on our efforts provided by the excellent reviewer. In light of this, we would like to state that the last paragraph of the introduction section outlines the structure for the remainder of the paper.
.
Literature Review: A distinct section dedicated to the Literature Review should be included in the paper in the form of a table.

Response: The excellent reviewer provided us with insightful feedback of our work, which we much appreciate. This is a really smart idea from the respected critic. As desired by the respected reviewer, we would like to submit that the literature review has been appended as a table at the conclusion of the related work section.

Results: The results section should clearly and systematically present the study's findings. It should provide detailed information, including a thorough comparison with other related techniques. You also have to explore other measures as well in the comparison other techniques.

Response: We are deeply appreciative of the distinguished reviewer's insightful critique of our work. It is a very wise suggestion from the renowned critic. We have incorporated in the amended text the notable distinctions between the suggested approach and the other related FS techniques employed in this work, as suggested by the excellent reviewer. Furthermore, a distinct section titled "Threats to validity" in the updated manuscript addresses the ramifications of the suggested FSCOA strategy. In addition to investigating additional measurements, our goal for the future is to broaden the scope of the suggested work by comparing it with more FS techniques.

Conclusion: This section should offer a comprehensive summary of the research and include a detailed discussion of future research directions.

Response: We greatly appreciate the insightful criticism of our work provided by the esteemed reviewer. The renowned reviewer has offered a really astute suggestion. In light of this, we would like to submit that the conclusion section has been altered in the revised manuscript in accordance with the distinguished reviewer's suggestions.

English: The paper should be reviewed to ensure the English language is free of grammatical errors.

Response: To make sure the English is correct, the paper has been carefully examined and modified accordingly.
Abstract: The abstract should offer a succinct overview of the research paper, covering the research problem, objectives, methodology, key findings, and conclusions. It must be clear and informative, providing readers with an understanding of the study's importance. You should precisely highlight the main contributions, including the datasets utilized and the accuracy achieved by your proposed system.

Response: We are deeply appreciative of the esteemed reviewer's perceptive comments on our work. In this context, we would like to submit that, as suggested by the esteemed reviewer, the abstract has been reworked in the revised manuscript.

Introduction: The introduction section should conclude with a brief outline of the structure for the remainder of the paper.

Response: We sincerely appreciate the insightful feedback on our efforts provided by the excellent reviewer. In light of this, we would like to state that the last paragraph of the introduction section outlines the structure for the remainder of the paper.
.
Literature Review: A distinct section dedicated to the Literature Review should be included in the paper in the form of a table.

Response: The excellent reviewer provided us with insightful feedback of our work, which we much appreciate. This is a really smart idea from the respected critic. As desired by the respected reviewer, we would like to submit that the literature review has been appended as a table at the conclusion of the related work section.

Results: The results section should clearly and systematically present the study's findings. It should provide detailed information, including a thorough comparison with other related techniques. You also have to explore other measures as well in the comparison other techniques.

Response: We are deeply appreciative of the distinguished reviewer's insightful critique of our work. It is a very wise suggestion from the renowned critic. We have incorporated in the amended text the notable distinctions between the suggested approach and the other related FS techniques employed in this work, as suggested by the excellent reviewer. Furthermore, a distinct section titled "Threats to validity" in the updated manuscript addresses the ramifications of the suggested FSCOA strategy. In addition to investigating additional measurements, our goal for the future is to broaden the scope of the suggested work by comparing it with more FS techniques.

Conclusion: This section should offer a comprehensive summary of the research and include a detailed discussion of future research directions.

Response: We greatly appreciate the insightful criticism of our work provided by the esteemed reviewer. The renowned reviewer has offered a really astute suggestion. In light of this, we would like to submit that the conclusion section has been altered in the revised manuscript in accordance with the distinguished reviewer's suggestions.

English: The paper should be reviewed to ensure the English language is free of grammatical errors.

Response: To make sure the English is correct, the paper has been carefully examined and modified accordingly.
Competing Interests: No competing interest. Close
Report a concern

Comments on this article Comments (0)

Version 2

VERSION 2 PUBLISHED 29 Jul 2024

Open Peer Review

Reviewer Status

Reviewer Reports

	Invited Reviewers
	1	2	3
Version 2 (revision) 17 Dec 24		read	read
Version 1 29 Jul 24	read	read	read

Shabib Aftab, Virtual University of Pakistan, Lahore, Pakistan
Ahmed Abdu, Northwestern Polytechnical University, Xi’an, China
Francis Palma, University of New Brunswick Fredericton, Fredericton, Canada

Comments on this article

All Comments(0)

Add a comment

Browse by related subjects

Back to all reports

Reviewer Report

6 Views

26 Dec 2024 | for Version 2

Francis Palma, University of New Brunswick Fredericton, Fredericton, New Brunswick, Canada

6 Views Cite this report Responses(1)

Approved

I can propose to index the manuscript in the current form.

Competing Interests

No competing interests were disclosed.

I confirm that I have read this submission and believe that I have an appropriate level of expertise to confirm that it is of an acceptable scientific standard.

Respond to this report

Responses (1)

Back to all reports

Reviewer Report

8 Views

18 Dec 2024 | for Version 2

Ahmed Abdu, Northwestern Polytechnical University, Xi’an, China

8 Views Cite this report Responses(1)

Approved

All the concerns and suggestions I outlined in my previous review have been satisfactorily addressed.

Competing Interests

No competing interests were disclosed.

Reviewer Expertise

Software engineering, software defect prediction, deep learning

I confirm that I have read this submission and believe that I have an appropriate level of expertise to confirm that it is of an acceptable scientific standard.

Respond to this report

Responses (1)

Author Response

06 Jan 2025

Kunal Anand, School of Computer Engineering, Kalinga Institute of Industrial Technology, Bhubaneswar, 751024, India

Esteemed Reviewer,
I would like to sincerely thank you for reviewing my research paper. I deeply appreciate the time and effort you invested in evaluating my work and providing insightful comments that enhanced the quality of the final manuscript. Your acceptance of the paper is a tremendous encouragement to me, and your constructive feedback was instrumental in refining the clarity and impact of my research. Thank you once again for your valuable contributions to my work. Regards

View more View less

Competing Interests

No competing interests were disclosed.

Back to all reports

Reviewer Report

13 Views

20 Sep 2024 | for Version 1

Francis Palma, University of New Brunswick Fredericton, Fredericton, New Brunswick, Canada

13 Views Cite this report Responses(1)

Approved With Reservations

Is the work clearly and accurately presented and does it cite the current literature?

Yes
Is the study design appropriate and is the work technically sound?

Yes
Are sufficient details of methods and analysis provided to allow replication by others?

Partly
If applicable, is the statistical analysis and its interpretation appropriate?

Yes
Are all the source data underlying the results available to ensure full reproducibility?

Yes
Are the conclusions drawn adequately supported by the results?

Yes

Competing Interests

No competing interests were disclosed.

Reviewer Expertise

Software engineering, software quality, software maintenance and evolution.

Respond to this report

Responses (1)

Author Response

17 Dec 2024

Kunal Anand, School of Computer Engineering, Kalinga Institute of Industrial Technology, Bhubaneswar, 751024, India

In the Abstract, the authors said, "To overcome the above shortcomings, this study aims to develop an innovative FS technique, namely, the Chernobyl Optimization Algorithm (FSCOA), to unwrap the most informative features" -> I assume the authors are not developing COA? Rather applying or using it? This phrase needs to be revised.

Response: We sincerely thank the distinguished reviewer for their insightful feedback on our work. In this regard, we would like to inform that the above mentioned phrase has been reworked in the revised manuscript.

The first line in the Methods part of the Abstract: "attacking" -> "attaching".

Response: We truly value the insightful reviewer's feedback on our work. Regarding this, we would like to say that the complete sentence has been reworked and also has been removed from the method part to the background part in the abstract. Here, the word “attacking” has been replaced with “attaching” in the revised manuscript.

The last paragraph of the Introduction provides the outline of the article. However, later, I found no section numbers or identifiers. Shouldn't the sections be numbered?

Response: We are deeply appreciative of the esteemed reviewer's perceptive comments on our work. Regarding this, we want to submit that the section numbers for the paper's headers were included in the original manuscript when it was submitted. Nevertheless, the document was published using the publisher's template, which omitted the section numbers. The final paragraph has been rewritten in the updated text to adhere to the distinguished reviewer's recommendation.

The Related Works section discusses 11 studies. I would expect a table providing a summary and comparison among these studies, including your own study (this one).

Response: We sincerely appreciate the insightful feedback on our efforts provided by the excellent reviewer. In light of this, we would like to submit that, as desired by the distinguished reviewer, the literature review is included as a table at the conclusion of the related work section. We further want to state that the proposed work has been appropriately compared with the existing FS techniques used in the referenced studies in the result analysis section.

The overall methodology should be presented as various Steps in a more systematic, organized, and modular manner…

Response: We truly value the thoughtful criticism of our work that the outstanding reviewer gave us. Considering this, we would like to submit that the proposed methodology section has been reworked in line with the esteemed reviewer’s advise in the revised manuscript.

The three steps mentioned in Page 6, I suggest you show these steps in the figure using like Step 1, Step 2, etc.

Response: The outstanding reviewer's informative comments on our efforts are greatly appreciated. Accordingly, we would like to state that the introductory paragraph in the section “Feature selection using based on Chernobyl Disaster Optimization Algorithm” has been revised.

In Figure 1, not sure why this is called a FSCOA approach, because feature selection is part of any model building activity, you are just using a new technique for feature selection... right?

Response: We sincerely appreciate the insightful critique of our work provided by the esteemed reviewer. In light of this, we would like to state that the figure 1 has been revised in line with the esteemed reviewer's suggestion.

In Figure 1, the testing data is not fed to the model training phase; it is generally fed to the model testing phase, right? The figure needs a revision.

Response: The distinguished reviewer's perceptive criticism of our work is greatly appreciated. Given this, we would like to state that the figure 1 has been updated to reflect the reviewer's insightful recommendations.

The figures e.g. Fig 1 and 2 in general look of poor quality, this is perhaps the authors did not use vector/scalable images, and simple converted images to pdf from MS Word affects the figure quality.

Response: We are grateful for the insightful critique of our work provided by the esteemed reviewer. In light of this, we would like to state that the reviewer's wise suggestions have been incorporated into the improved quality of Figures 1 and 2.

"Alpha fragment" title on page 9 should be in bold.

Response: The distinguished reviewer's perceptive criticism of our work is greatly appreciated. Taking this into consideration, we are pleased to submit that the reviewer's insightful recommendations have been implemented.

Algorithm 1 on page 11, how is this related to Figure 2? Have you also described the Algorithm 1 in this article? If not, the authors should briefly discuss the various Algorithmic steps, at least at the high level.

Response: The distinguished reviewer's perceptive criticism of our work is greatly appreciated. In light of this, we would like to submit that algorithm 1 is shown graphically in Figure 2. The text and the alpha, beta, and gamma segments that follow figure 2 provide a succinct explanation of algorithm 1 and figure 2.

In Figures 3 to 6, the x and y-axis labels are not legible. Should the font size be increased?

Response:We are grateful for the insightful critique of our work provided by the esteemed reviewer. Given this, we would like to state that the font sizes of the individual images in figures 3 through 6 are appropriate. However, while retaining individual photographs, the paper's length was increasing. For this reason, the only reason to combine the images on a smaller scale was to reduce the paper's length.

Before the Conclusion section, this article is missing one major section: "Threats to Validity." What are the threats to the validity of your results? The authors should discuss various internal, external, construct, reliability, etc. threats.

Response: We sincerely appreciate the insightful critique of our work provided by the exceptional reviewer. In light of this, we would like to state that the updated paper now includes a new “Threats to Validity” section that complies with the esteemed reviewer's wise recommendation.

View more View less

Competing Interests

No competing interest

Back to all reports

Reviewer Report

14 Views

17 Sep 2024 | for Version 1

Ahmed Abdu, Northwestern Polytechnical University, Xi’an, China

14 Views Cite this report Responses(1)

Approved With Reservations

Abdu, A., Zhai, Z., Abdo, H. A., & Algabri, R. (2024). Software Defect Prediction Based on Deep Representation Learning of Source Code From Contextual Syntax and Semantic Graph. IEEE Transactions on Reliability.
Abdu, A., Zhai, Z., Abdo, H. A., Algabri, R., & Lee, S. (2023). Graph-Based Feature Learning for Cross-Project Software Defect Prediction. Computers, Materials and Continua, 77(1).

Abdu, A., Zhai, Z., Abdo, H. A., Algabri, R., Al-Masni, M. A., Muhammad, M. S., & Gu, Y. H. (2024). Semantic and traditional feature fusion for software defect prediction using hybrid deep learning model. Scientific Reports, 14(1), 14771.
Abdu, A., Zhai, Z., Algabri, R., Abdo, H. A., Hamad, K., & Al-antari, M. A. (2022). Deep learning-based software defect prediction via semantic key features of source code—systematic survey. Mathematics, 10(17), 3120.

How your proposed method outperforms or differs from the baseline methods.
Highlighting key insights and practical implications of your findings.

By thoroughly discussing these aspects, you will better demonstrate the novelty and effectiveness of your method, making the results more compelling for the readers.

Is the work clearly and accurately presented and does it cite the current literature?

Partly
Is the study design appropriate and is the work technically sound?

Partly
Are sufficient details of methods and analysis provided to allow replication by others?

No
If applicable, is the statistical analysis and its interpretation appropriate?

No
Are all the source data underlying the results available to ensure full reproducibility?

No source data required
Are the conclusions drawn adequately supported by the results?

Yes

References

Competing Interests

No competing interests were disclosed.

Reviewer Expertise

Software engineering, software defect prediction, deep learning

Respond to this report

Responses (1)

Author Response

17 Dec 2024

Kunal Anand, School of Computer Engineering, Kalinga Institute of Industrial Technology, Bhubaneswar, 751024, India

The abstract should clarify the specific contribution of the Chernobyl Optimization Algorithm (FSCOA) in comparison to existing methods. The abstract mentions drawbacks of other meta-heuristic algorithms but doesn't specify how FSCOA addresses these issues in a distinct or innovative way. Adding a brief statement on the unique advantages of FSCOA (e.g., better performance, reduced computational cost, or improved convergence) would enhance clarity and highlight the novelty of the study.

Response: We really appreciate the insightful reviewer's feedback on our work, and in light of this, we would like to propose that the abstract has been amended in the revised article in accordance with the reviewer's suggestions.

The introduction section outlines the research focus, but the main contributions of the paper should be revised to clearly delineate three distinct and understandable contributions. Clarifying these contributions will help readers better grasp the novel aspects of your work and its significance in advancing the field.

Response: Our sincere gratitude goes out to the renowned reviewer for his insightful feedback on our work. This is an extremely astute suggestion from the esteemed critic. As suggested by the esteemed reviewer, the major contributions of the paper have been reworked in the revised manuscript.

3. The authors should consider extending the Related Work section by including more recent references to ensure the study is up-to-date with the latest advancements in software defect prediction. Specifically, citing the following studies would strengthen the discussion of deep learning and feature selection approaches in software defect prediction:
Abdu, A., Zhai, Z., Abdo, H. A., & Algabri, R. (2024). Software Defect Prediction Based on Deep Representation Learning of Source Code From Contextual Syntax and Semantic Graph. IEEE Transactions on Reliability.
Abdu, A., Zhai, Z., Abdo, H. A., Algabri, R., & Lee, S. (2023). Graph-Based Feature Learning for Cross-Project Software Defect Prediction. Computers, Materials and Continua, 77(1).
Abdu, A., Zhai, Z., Abdo, H. A., Algabri, R., Al-Masni, M. A., Muhammad, M. S., & Gu, Y. H. (2024). Semantic and traditional feature fusion for software defect prediction using hybrid deep learning model. Scientific Reports, 14(1), 14771.
Abdu, A., Zhai, Z., Algabri, R., Abdo, H. A., Hamad, K., & Al-antari, M. A. (2022). Deep learning-based software defect prediction via semantic key features of source code—systematic survey. Mathematics, 10(17), 3120.
By incorporating these studies, the authors will enrich the context of their research and ensure that their work is grounded in the most current literature on the topic.

Response: We sincerely thank the distinguished critic for his insightful feedback on our work. The renowned reviewer has made a very wise recommendation. In accordance with the most recent developments in software defect prediction, a few recent works have been added to the related work area as suggested by the esteemed reviewer.

4. The results discussion is currently lacking depth and does not sufficiently highlight the strengths of the proposed method. The comparison with other approaches is very limited, which weakens the impact of your findings. I recommend expanding the explanation and discussion of the results, focusing on:
How your proposed method outperforms or differs from the baseline methods.
Highlighting key insights and practical implications of your findings.
By thoroughly discussing these aspects, you will better demonstrate the novelty and effectiveness of your method, making the results more compelling for the readers.

Response: We sincerely thank the distinguished reviewer for his insightful critique of our work. The recommendation from the renowned reviewer is really astute. As suggested by the respected reviewer, we have incorporated in the updated text the notable distinctions between the suggested method and the other baseline FS approaches employed in this study. Additionally, the consequences of the suggested procedures are incorporated into the updated manuscript as a distinct section called "Threats to validity."

View more View less

Competing Interests

No competing interest

Back to all reports

Reviewer Report

18 Views

14 Aug 2024 | for Version 1

Shabib Aftab, Virtual University of Pakistan, Lahore, Punjab, Pakistan

18 Views Cite this report Responses(1)

Not Approved

Is the work clearly and accurately presented and does it cite the current literature?

Partly
Is the study design appropriate and is the work technically sound?

No
Are sufficient details of methods and analysis provided to allow replication by others?

Partly
If applicable, is the statistical analysis and its interpretation appropriate?

Partly
Are all the source data underlying the results available to ensure full reproducibility?

Yes
Are the conclusions drawn adequately supported by the results?

Yes

Competing Interests

No competing interests were disclosed.

Reviewer Expertise

software defect prediction

Respond to this report

Responses (1)

Author Response

17 Dec 2024

Kunal Anand, School of Computer Engineering, Kalinga Institute of Industrial Technology, Bhubaneswar, 751024, India

Abstract: The abstract should offer a succinct overview of the research paper, covering the research problem, objectives, methodology, key findings, and conclusions. It must be clear and informative, providing readers with an understanding of the study's importance. You should precisely highlight the main contributions, including the datasets utilized and the accuracy achieved by your proposed system.

Response: We are deeply appreciative of the esteemed reviewer's perceptive comments on our work. In this context, we would like to submit that, as suggested by the esteemed reviewer, the abstract has been reworked in the revised manuscript.

Introduction: The introduction section should conclude with a brief outline of the structure for the remainder of the paper.

Response: We sincerely appreciate the insightful feedback on our efforts provided by the excellent reviewer. In light of this, we would like to state that the last paragraph of the introduction section outlines the structure for the remainder of the paper.
.
Literature Review: A distinct section dedicated to the Literature Review should be included in the paper in the form of a table.

Response: The excellent reviewer provided us with insightful feedback of our work, which we much appreciate. This is a really smart idea from the respected critic. As desired by the respected reviewer, we would like to submit that the literature review has been appended as a table at the conclusion of the related work section.

Results: The results section should clearly and systematically present the study's findings. It should provide detailed information, including a thorough comparison with other related techniques. You also have to explore other measures as well in the comparison other techniques.

Response: We are deeply appreciative of the distinguished reviewer's insightful critique of our work. It is a very wise suggestion from the renowned critic. We have incorporated in the amended text the notable distinctions between the suggested approach and the other related FS techniques employed in this work, as suggested by the excellent reviewer. Furthermore, a distinct section titled "Threats to validity" in the updated manuscript addresses the ramifications of the suggested FSCOA strategy. In addition to investigating additional measurements, our goal for the future is to broaden the scope of the suggested work by comparing it with more FS techniques.

Conclusion: This section should offer a comprehensive summary of the research and include a detailed discussion of future research directions.

Response: We greatly appreciate the insightful criticism of our work provided by the esteemed reviewer. The renowned reviewer has offered a really astute suggestion. In light of this, we would like to submit that the conclusion section has been altered in the revised manuscript in accordance with the distinguished reviewer's suggestions.

English: The paper should be reviewed to ensure the English language is free of grammatical errors.

Response: To make sure the English is correct, the paper has been carefully examined and modified accordingly.

View more View less

Competing Interests

No competing interest.

Alongside their report, reviewers assign a status to the article:

Approved - the paper is scientifically sound in its current form and only minor, if any, improvements are suggested

Approved with reservations - A number of small changes, sometimes more significant revisions are required to address specific details and improve the papers academic merit.

Not approved - fundamental flaws in the paper seriously undermine the findings and conclusions

[1] 1. Mall R: Fundamentals of software engineering. PHI Learning Pvt. Ltd; 2018.

[2] 2. Prasad D: Automated software testing: foundations, applications and challenges. Jena AK, Das H, Mohapatra DP, editors. 2020; pp. 1–165. Springer;

[3] 3. Anand K, Jena AK: Software Defect Prediction: An ML Approach-Based Comprehensive Study. Communication, Software and Networks: Proceedings of INDIA 2022. Singapore: Springer Nature Singapore; 2022; pp. 497–512.

[4] 4. Saifan AA, Abu-wardih L: Software defect prediction based on feature subset selection and ensemble classification. ECTI Trans. Comput. Inf. Technol. 2020; 14(2): 213–228. Publisher Full Text

[5] 5. Malhotra R: Comparative analysis of statistical and machine learning methods for predicting faulty modules. Appl. Soft Comput. 2014; 21: 286–297. Publisher Full Text

[6] 6. Harshvardhan GM, Gourisaria MK, Pandey M, et al.: A comprehensive survey and analysis of generative models in machine learning. Comput Sci Rev. 2020; 38: 100285.

[7] 7. Hammouri A, Hammad M, Alnabhan M, et al.: Software bug prediction using machine learning approach. Int. J. Adv. Comput. Sci. Appl. 2018; 9(2). Publisher Full Text

[8] 8. Gökçeoğlu M, Sözer H: Automated defect prioritization based on defects resolved at various project periods. J. Syst. Softw. 2021; 179: 110993. Publisher Full Text

[9] 9. Rathore SS, Kumar S: A decision tree logic based recommendation system to select software fault prediction techniques. Computing. 2017; 99: 255–285. Publisher Full Text

[10] 10. Singh Y, Kaur A, Malhotra R: Software fault proneness prediction using support vector machines. Proceedings of the World Congress on Engineering. 2009, July; Vol. 1: pp. 1–3.

[11] 11. Li J, He P, Zhu J, et al.: Software defect prediction via convolutional neural network. 2017 IEEE International Conference on Software Quality, Reliability and Security (QRS). IEEE; 2017, July; pp. 318–328.

[12] 12. Goyal J, Ranjan Sinha R: Software defect-based prediction using logistic regression: Review and challenges. Second International Conference on Sustainable Technologies for Computational Intelligence: Proceedings of ICTSCI 2021. Singapore: Springer; 2022; pp. 233–248.

[13] 13. Wang T, Li WH: Naive bayes software defect prediction model. 2010 International Conference on Computational Intelligence and Software Engineering. IEEE; 2010, December; pp. 1–4.

[14] 14. Chandrashekar G, Sahin F: A survey on feature selection methods. Comput. Electr. Eng. 2014; 40(1): 16–28. Publisher Full Text

[15] 15. Cherrington M, Thabtah F, Lu J, et al.: Feature selection: filter methods performance challenges. 2019 International Conference on Computer and Information Sciences (ICCIS). IEEE; 2019, April; pp. 1–4.

[16] 16. Chen G, Chen J: A novel wrapper method for feature selection and its applications. Neurocomputing. 2015; 159: 219–226. Publisher Full Text

[17] 17. Lal TN, Chapelle O, Weston J, et al.: Embedded methods. Feature extraction: Foundations and applications. Berlin, Heidelberg: Springer Berlin Heidelberg; 2006; pp. 137–165.

[18] 18. Chen CW, Tsai YH, Chang FR, et al.: Ensemble feature selection in medical datasets: Combining filter, wrapper, and embedded feature selection results. Expert. Syst. 2020; 37(5): e12553. Publisher Full Text

[19] 19. Malhotra R, Pritam N, Singh Y: On the applicability of evolutionary computation for software defect prediction. 2014 International Conference on Advances in Computing, Communications and Informatics (ICACCI). IEEE; 2014, September; pp. 2249–2257.

[20] 20. Ab Wahab MN, Nefti-Meziani S, Atyabi A: A comprehensive review of swarm optimization algorithms. PLoS One. 2015; 10(5): e0122827. PubMed Abstract | Publisher Full Text | Free Full Text

[21] 21. Wahono RS, Herman NS: Genetic feature selection for software defect prediction. Adv. Sci. Lett. 2014; 20(1): 239–244. Publisher Full Text

[22] 22. Du KL, Swamy MNS, Du KL, et al.: fragment swarm optimization. Search and Optimization by Metaheuristics: Techniques and Algorithms Inspired by Nature.2016; 153–173.

[23] 23. Brezočnik L, Podgorelec V: Applying weighted fragment swarm optimization to imbalanced data in software defect prediction. New Technologies, Development and Application. Vol. 4. . Springer International Publishing; 2019; pp. 289–296.

[24] 24. Das S, Suganthan PN: Differential evolution: A survey of the state-of-the-art. IEEE Trans. Evol. Comput. 2010; 15(1): 4–31. Publisher Full Text

[25] 25. Dorigo M, Birattari M, Stutzle T: Ant colony optimization. IEEE Comput. Intell. Mag. 2006; 1(4): 28–39. Publisher Full Text

[26] 26. Tan F, Fu X, Zhang Y, et al.: A genetic algorithm-based method for feature subset selection. Soft. Comput. 2008; 12: 111–120. Publisher Full Text

[27] 27. Sakri SB, Rashid NBA, Zain ZM: fragment swarm optimization feature selection for breast cancer recurrence prediction. IEEE Access. 2018; 6: 29637–29647. Publisher Full Text

[28] 28. Ghosh A, Datta A, Ghosh S: Self-adaptive differential evolution for feature selection in hyperspectral image data. Appl. Soft Comput. 2013; 13(4): 1969–1977. Publisher Full Text

[29] 29. Aghdam MH, Ghasem-Aghaee N, Basiri ME: Text feature selection using ant colony optimization. Expert Syst. Appl. 2009; 36(3): 6843–6853. Publisher Full Text

[30] 30. Shehadeh HA: Chernobyl disaster optimizer (CDO): a novel meta-heuristic method for global optimization. Neural Comput. Applic. 2023; 35(15): 10733–10749. Publisher Full Text

[31] 31. Nakariyakul S: A comparative study of suboptimal branch and bound algorithms. Inf. Sci. 2014; 278: 545–554. Publisher Full Text

[32] 32. Das H, Prajapati S, Gourisaria MK, et al.: Feature Selection Using Golden Jackal Optimization for Software Fault Prediction. Mathematics. 2023; 11(11): 2438. Publisher Full Text

[33] 33. Khalid A, Badshah G, Ayub N, et al.: Software Defect Prediction Analysis Using Machine Learning Techniques. Sustainability. 2023; 15(6): 5517. Publisher Full Text

[34] 34. Kumar H, Das H: Software Fault Prediction using Wrapper based Feature Selection Approach employing Genetic Algorithm. 2022 OPJU International Technology Conference on Emerging Technologies for Sustainable Development (OTCON). IEEE; 2023, February; pp. 1–7.

[35] 35. Thirumoorthy K: A feature selection model for software defect prediction using binary Rao optimization algorithm. Appl. Soft. Comput. 2022; 131: 109737. Publisher Full Text

[36] 36. Batool I, Khan TA: Software fault prediction using data mining, machine learning and deep learning techniques: A systematic literature review. Comput. Electr. Eng. 2022; 100: 107886. Publisher Full Text

[37] 37. Chen LQ, Wang C, Song SL: Software defect prediction based on nested-stacking and heterogeneous feature selection. Complex Intell. Syst. 2022; 8(4): 3333–3348. Publisher Full Text

[38] 38. Arora R, Kaur A: Heterogeneous Fault Prediction Using Feature Selection and Supervised Learning Algorithms. Vietnam J. Comput. Sci. 2022; 09(03): 261–284. Publisher Full Text

[39] 39. Gao K, Khoshgoftaar TM, Wang H, et al.: Choosing software metrics for defect prediction: an investigation on feature selection techniques. Softw. Pract. Experience. 2011; 41(5): 579–606. Publisher Full Text

[40] 40. Anand K, Jena AK, Choudhary T: Performance Analysis of Feature Selection Techniques in Software Defect Prediction using Machine Learning. 2022 International Conference on Advancements in Smart, Secure and Intelligent Computing (ASSIC). IEEE; 2022, November; pp. 1–7.

[41] 41. Balogun AO, Basri S, Capretz LF, et al.: Software Defect Prediction Using Wrapper Feature Selection Based on Dynamic Re-Reranking Strategy. Symmetry. 2021; 13(11): 2166. Publisher Full Text

[42] 42. Balogun AO, Basri S, Mahamad S, et al.: A novel rank aggregation-based hybrid multifilter wrapper feature selection method in software defect prediction. Comput. Intell. Neurosci. 2021; 2021: 1–19. PubMed Abstract | Publisher Full Text | Free Full Text

[43] 43. Alsghaier H, Akour M: Software fault prediction using fragment swarm algorithm with genetic algorithm and support vector machine classifier. Softw. Pract. Experience. 2020; 50(4): 407–427. Publisher Full Text

[44] 44. Alsghaier H, Akour M: Software fault prediction using whale algorithm with genetics algorithm. Softw. Pract. Experience. 2021; 51(5): 1121–1146. Publisher Full Text

[45] 45. Balogun AO, Basri S, Abdulkadir SJ, et al.: Performance analysis of feature selection methods in software defect prediction: a search method approach. Appl. Sci. 2019; 9(13): 2764. Publisher Full Text

[46] 46. Mehic A: The Electoral consequences of nuclear fallout: evidence from chernobyl. Department of Economics, School of Economics and Management, Lund University; 2020. Reference Source

[47] 47. Strath SJ, Swartz AM, Parker SJ, et al.: A pilot randomized controlled trial evaluating motivationally matched pedometer feedback to increase physical activity behavior in older adults. J. Phys. Act. Health. 2011; 8(s2): S267–S274. PubMed Abstract | Publisher Full Text

[48] 48. Shirabad JS, Menzies TJ: The PROMISE repository of software engineering databases. School of Information Technology and Engineering, University of Ottawa, Canada; 2005; 24.

[49] 49. Patro SGOPAL, Sahu KK: Normalization: A preprocessing stage. arXiv preprint arXiv:1503.06462. 2015.

[50] 50. Demšar J: Statistical comparisons of classifiers over multiple data sets. J. Mach. Learn. Res. 2006; 7: 1–30.

[51] 51. Friedman M: A comparison of alternative tests of significance for the problem of m rankings. Ann. Math. Stat. 1940; 11(1): 86–92. Publisher Full Text

[52] 52. García S, Fernández A, Luengo J, et al.: Advanced nonparametric tests for multiple comparisons in the design of experiments in computational intelligence and data mining: experimental analysis of power. Inf. Sci. 2010; 180(10): 2044–2064. Publisher Full Text

[53] 53. Gao K, Khoshgoftaar TM, Seliya N: Predicting high-risk program modules by selecting the right software measurements. Softw. Qual. J. 2012; 20: 3–42. Publisher Full Text

[54] 54. Abdu A, Zhai Z, Abdo HA, et al.: Software Defect Prediction Based on Deep Representation Learning of Source Code From Contextual Syntax and Semantic Graph. IEEE Trans. Reliab. 2024; 73: 820–834. Publisher Full Text

[55] 55. Abdu A, Zhai Z, Abdo HA, et al.: Semantic and traditional feature fusion for software defect prediction using hybrid deep learning model. Sci. Rep. 2024; 14(1): 14771. PubMed Abstract | Publisher Full Text | Free Full Text

[56] 56. Abdu A, Zhai Z, Abdo HA, et al.: Graph-Based Feature Learning for Cross-Project Software Defect Prediction. Comput. Mater. Contin. 2023; 77(1): 161–180. Publisher Full Text

[57] 57. Abdu A, Zhai Z, Algabri R, et al.: Deep learning-based software defect prediction via semantic key features of source code—systematic survey. Mathematics. 2022; 10(17): 3120. Publisher Full Text

[58] 58. Anand K: Dataset 1: Zip file containing the underlying data of the presented methods and results in png files. figshare. Figure. 2024. Publisher Full Text

[59] 59. Anand K: Dataset 2: Underlying data of the presented results in csv files. Dataset. figshare. 2024. Publisher Full Text

Implementation of Chernobyl disaster optimizer based feature selection approach to predict software defects

Abstract

Background

Methods

Results

Conclusion

Keywords

Revised Amendments from Version 1

Introduction

Related works

Table 1. Summary of referred literature in software defect prediction.

Feature selection based on Chernobyl Disaster Optimizer Algorithm

Figure 1. Blueprint of the suggested FSCOA methodology.

Figure 2. Flow-diagram of recommended FSCOA approach.

(1)

(2)

(3)

(4)

(5)

(6)

(7)

(8)

(9)

(10)

(11)

(12)

(13)

(14)

(15)

(16)

(17)

Proposed FSCOA approach

Algorithm 1.

Result analysis

Table 2. Specifics of the enforced NASA datasets.

(18)

Table 3. Accuracy percentage and number of features selected by four classifiers for twelve datasets.

Figure 3. DT fitness error plot.

Figure 4. KNN fitness error plot.

Figure 5. NB fitness error plot.

Figure 6. QDA fitness error plot.

Statistical analysis

(19)

Table 4. For twelve NASA datasets, the average rank of all FS algorithms (Friedman Rank).

(20)

Table 5. AvgRank of all FS configurations.

(21)

(22)

(23)

Table 6. Holm procedure.

Threats to validity

Conclusion

Data availability

Underlying data

Acknowledgments

References

Comments on this article Comments (0)

Open Peer Review

Comments on this article Comments (0)

Open Peer Review

Reviewer Status

Reviewer Reports

Comments on this article

Browse by related subjects

Competing Interests Policy

Stay Updated