ALL Metrics
-
Views
-
Downloads
Get PDF
Get XML
Cite
Export
Track
Research Article

Toward Effective Educational Supervision in Yemen: A Hybrid Fuzzy Delphi and Clustering Analysis of Technical Barriers

[version 1; peer review: awaiting peer review]
PUBLISHED 22 Sep 2025
Author details Author details
OPEN PEER REVIEW
REVIEWER STATUS AWAITING PEER REVIEW

Abstract

Objectives

Comprehensive educational supervision is essential for ensuring quality teaching, fostering professional development, and supporting institutional capacity building. However, its implementation encounters numerous structural, technical, and human resource challenges. This study aimed to identify, validate, rank, and cluster the technical barriers affecting comprehensive educational supervision in Amanat Al Asimah, Yemen. This aligns with national reform goals by offering strategic insights to improve supervisory systems, thereby enhancing teaching quality, institutional performance, and educational resilience in fragile contexts

Methods

This study employed a three-phase mixed-methods approach. Initially, a literature review identified 11 key barriers to effective supervision. These were validated using the Fuzzy Delphi Method (FDM), involving 16 experienced educational supervisors to assess the consensus and suitability of the items. Subsequently, a quantitative survey targeting 370 teachers was conducted to evaluate their perceived severity. Fuzzy set theory was used to aggregate and defuzzify the responses, generating crisp scores for prioritization. Finally, K-means clustering was applied to segment the barriers based on their impacts.

Results

FDM analysis confirmed the validity of all 11 identified barriers, with a domain-level threshold of 0.093 and an average expert consensus of 90%, indicating strong agreement. The fuzzy set-based evaluation highlighted three top-priority challenges: weak supervisory competencies, limited ability to develop effective supervisory plans, and poor supervisor-teacher relationships. K-means clustering grouped the barriers into three segments: one high-priority barrier, seven moderate-priority concerns, and three low-priority issues. Notably, weak supervisory competencies emerged as the most critical barrier, isolated in a high-priority cluster.

Conclusion

These findings provide evidence-based guidance for policy and strategic interventions aimed at enhancing the effectiveness of supervision systems in fragile educational settings. The study concludes with recommendations for strengthening supervisory competencies, improving resource allocation, and fostering trust-based supervisor-teacher relationships, thereby contributing to the quality of education and institutional resilience in Yemen.

Keywords

educational supervision; Fuzzy Delphi Method; Yemen; fuzzy set theory; K-means ; clustering; supervisory competencies

1. Introduction

The current global shifts in science and education have significantly impacted various educational dimensions, including systems and supervision processes. These transformations have compelled educators to critically examine educational structures and reaffirm the teacher’s role as a central pillar within the system.1 Within this context, educational supervision has emerged as a pivotal component of educational management, fundamental to the broader educational framework that encompasses both teaching and learning.2 Educational supervision serves as a cornerstone for ensuring educational quality and promoting continuous improvement. It plays a vital role in enhancing teacher performance by offering professional guidance, fostering reflective practice, and aligning teaching with curriculum standards.3 Historically limited to classroom observation, the scope of supervision has expanded to include mentorship, collaborative problem-solving, and the development of targeted professional growth plans.4

In today’s dynamic educational landscape—shaped by digital transformation and evolving pedagogical expectations—supervisors are instrumental in helping educators integrate technology, maintain digital ethics, and implement innovative teaching strategies suited to 21st-century learners.5 Beyond instructional support, effective supervision also nurtures teachers’ emotional well-being and professional development, ensuring that education remains inclusive, adaptive, and student-centered.6 Modern supervisory models now emphasize collaborative learning environments and constructive feedback mechanisms, thereby cultivating a culture of continuous improvement.7 As Okafor et al. note, effective supervision not only enhances instructional practices but also strengthens institutional effectiveness by fostering an environment conducive to both teacher and student success.8 Given its multifaceted nature, educational supervision continues to evolve—requiring ongoing research and innovation to meet the complex demands of contemporary education.

The success of educational supervision hinges on several critical indicators. Supervisors must offer clear directives and align instructional practices with institutional objectives.9 Emphasis should be placed on implementing quality assurance mechanisms that align with global standards while addressing local educational contexts.9 Continuous professional development is essential in fostering a culture that supports quality teaching and lifelong learning.10,11

Moreover, supervision becomes most effective when teachers actively engage with feedback, thereby promoting a culture of accountability and improvement.12 Effective planning is equally important—supervisors must ensure that their approach is strategic, goal-oriented, and systematic.13,14 Competent supervisors are characterized by strong instructional leadership, effective communication, accurate observation, and constructive feedback delivery.15,16 Building trust-based professional relationships between supervisors and teachers further enhances collaboration and facilitates open communication.17,18 Additionally, the integration of modern technology and the availability of financial and learning resources significantly contribute to the success of supervision practices.19 These factors collectively support a dynamic, responsive, and impactful supervision system.

Ultimately, educational supervision focuses on actively involving teachers in the learning environment to enhance teaching methods and improve student achievement. It plays a central role in overseeing the educational process by integrating instructional activities and continuously evaluating institutional outcomes.2,20 Through identifying challenges and working collaboratively to resolve them, supervision ensures the implementation of appropriate actions that drive educational advancement.

Numerous studies indicate that educational supervision, in its current state, faces significant challenges that hinder its effectiveness. Including inadequate training programs,10 limited planning,21 weak supervisory competencies,15 low teacher acceptance of modern methods,22 strained relationships,18 technological gaps,19 and insufficient financial, infrastructural, and learning resources.2,23 To overcome these barriers, studies recommend structured professional development, strategic planning, competence-based supervisor selection,16 fostering collaborative cultures,24 integrating digital tools,25 securing adequate funding,26 and providing supportive environments to strengthen supervision’s impact.

Comprehensive educational supervision is a fundamental approach employed in schools to assess and respond to their challenges, needs, and educational and social realities. It involves the continuous presence of educational supervisors within school environments to gain firsthand insights into the teaching and learning processes both inside and outside the classroom. Through this approach, supervisors are able to diagnose the real conditions of the educational system—including the roles of teachers, learners, school leadership, and the broader learning environment. However, in Yemen, the ability to implement effective supervision has been severely undermined by ongoing conflict and political instability. The education system, which once showed progress with enrollment rates increasing from 71.3% to 97.5% between 1999 and 2013, has since experienced significant setbacks. By the 2015/2016 academic year, around 1.8 million children were out of school, and over 1,100 schools had been either destroyed or repurposed for emergency use.27 These structural disruptions have weakened the foundation of educational supervision. Furthermore, the quality of education has declined significantly, with reports of curriculum manipulation for ideological purposes and widespread issues such as the absence of teacher training and salary payments.28 In response to these profound challenges, Yemen’s National Vision for Building the Modern Yemeni State and the national research priorities emphasize the urgent need for comprehensive education reform.

In line with these priorities, this study aims to analyze the technical challenges facing educational supervision in schools located in Amanat Al Asimah and explore viable strategies to improve the efficiency of educational processes. The focus is on supporting national goals by modernizing supervisory practices and ensuring they contribute meaningfully to sustainable development and institutional capacity building. The study seeks to identify, rank, and cluster these challenges in schools within Amanat Al Asimah, Yemen, while exploring viable strategies to enhance the efficiency of educational supervision processes.

The significance of this study lies in its alignment with Yemen’s National Vision for Building a Modern Yemeni State and national research priorities. It contributes to modernizing supervisory practices in a conflict-affected context, supports sustainable development, and enhances institutional capacity in Yemen’s education sector. Additionally, it provides insights into addressing teacher-related challenges and improving the quality of education.

Methodologically, the study employs a mixed-methods approach that includes a literature review to identify key barriers to effective supervision, the Fuzzy Delphi Method to validate and prioritize these barriers, a quantitative survey of teachers to assess their perceptions of supervisory challenges, and fuzzy set theory and K-means clustering analysis to categorize and prioritize the identified barriers.

The findings of this study could inform targeted policy interventions to enhance supervisory effectiveness, guide resource allocation for improving educational supervision, support capacity-building initiatives for supervisors, and contribute to developing a responsive supervision framework aligned with national development objectives.

2. Methods

Multi-Criteria Decision-Making (MCDM) methods have emerged as vital analytical tools in a wide range of fields, including health, sustainability, tourism, healthcare, engineering, education and financial risk assessment.29–39 The integration of MCDM methods with fuzzy logic and clustering techniques provides a robust framework for addressing complex decisions under uncertainty.29–33 Fuzzy-based MCDM approaches effectively capture expert ambiguity and contextual nuances, making them particularly suitable for fields such as health, education, and sustainability.35,38–40 The Fuzzy Delphi Method (FDM), enhanced by fuzzy set theory, improves expert consensus and facilitates clearer selection and prioritization of variables.35,38,41,42

Meanwhile, machine learning (ML) has significantly advanced real-world decision-making through predictive modeling, risk analysis, and pattern recognition.43–48 Its applications span various domains, including healthcare,43 big data analytics,44 manufacturing quality control,45 and infrastructure forecasting.46 ML also supports academic evaluation through web mining47 and Industry 4.0 applications.48

Recent research increasingly integrates MCDM, fuzzy logic, and unsupervised ML techniques—particularly K-means clustering—to enhance decision-making by segmenting barriers and identifying strategic priorities.29–33 This hybrid approach has been successfully applied in health security,29–33 ecotourism,39 urban resilience assessment,49 and education,40,50 offering scalable, context-sensitive solutions to complex policy and planning challenges.

Building on these methodological advancements and empirical applications, the present study adopts a similar integrative approach to explore and prioritize barriers in the context of educational supervision in Yemen.

In the present study, the Fuzzy Delphi Method (FDM) was employed as a preliminary phase to identify and validate the most contextually relevant barriers to effective educational supervision in Yemen. The primary purpose of the FDM was not to evaluate or resolve the barriers but to systematically screen and refine a set of candidate variables based on expert consensus for use in subsequent analyses. This was followed by a quantitative teacher survey assessing perceptions of supervisory challenges and the application of fuzzy set theory and K-means clustering to categorize and prioritize the identified barriers. Our analysis was conducted through a structured, three-phase process, with each stage clearly outlined and supported by a rationale for the chosen methodology.

2.1 Technical challenges in comprehensive educational supervision – Literature review

A substantial body of research underscores the significance of educational supervision in enhancing instructional quality, fostering professional development, and aligning teaching practices with institutional and national objectives. Despite its critical role, several technical challenges impede the effectiveness of educational supervision, particularly in fragile contexts such as Yemen. A key issue identified in the literature is teachers’ lack of interest in the instructions and directives provided by supervisors. This disinterest undermines efforts to implement improvements in teaching practices and reflects broader issues of disengagement and accountability.9,12,51,52 This is due to the weak capacity of some educational supervisors to develop effective supervisory plans, which are essential for guiding systematic, goal-oriented oversight.13,14,53,54 Without proper planning, supervision can become inconsistent, fragmented, and reactive. Another challenge is the limited ability to develop and implement effective training programs, which are vital for equipping teachers with up-to-date skills and aligning instructional methods with evolving standards.10,21

The literature also highlights the lack of supervisory competencies among some educational supervisors, including deficiencies in instructional leadership, communication, and feedback delivery.15,16,19 Such gaps diminish the credibility and impact of the supervision. Equally critical is the resistance among some teachers to modern supervision methods such as collaborative evaluation and peer feedback. This resistance can limit the effectiveness of reforms intended to make supervision more participatory and data-driven.22,24,55–57 Moreover, weak or distant professional relationships between supervisors and teachers hinder open communication, reduce trust, and lead to ineffective feedback cycles.17,18,58,59

Technological and infrastructural deficiencies also pose significant obstacles. The absence or inadequacy of modern technologies and digital tools, such as learning management systems, video conferencing platforms, and data analytics software, limits the efficiency and adaptability of supervision practices.19,25,59,60 Furthermore, the lack of financial resources allocated to supervision hampers planning, professional development, transportation, and the acquisition of the necessary tools.8,23,61 A closely related issue is the inadequacy of supportive learning resources in schools, such as libraries, science laboratories, Internet access, and computers. These are essential for both effective teaching and meaningful supervisory assessment.19,62–64

Additionally, the lack of dedicated training and rehabilitation halls limits opportunities for continuous professional development, and the absence of a specialized office for managing supervision compromises the coordination and institutionalization of supervisory activities.2

2.2 Phase 2-assessing the suitability of research variables using the fuzzy delphi method.

The Fuzzy Delphi Method (FDM) is a widely used technique for assessing the suitability of research indicators in measuring specific phenomena within defined environments because of its ability to combine expert judgment with the flexibility of fuzzy logic.35,39,40 Often, determining the relevance of indicators requires subjective input from domain experts, whose perspectives may vary or be influenced by uncertainties arising from complex contextual conditions.65 FDM addresses this challenge by allowing experts to provide assessments using linguistic terms, which are then converted into fuzzy numbers, enabling the method to handle ambiguity and imprecision more effectively.35,65 Compared to the traditional Delphi method,41 which often requires multiple rounds of consultation to reach a consensus and may lead to misinterpretation or loss of expert interest due to repetitive rounds, FDM enhances efficiency by minimizing the number of iterations needed. It also reduces the risk of bias introduced through forced modifications of expert opinions and lowers the overall cost and time required for data collection.35,39

Additionally, the FDM increases survey response rates, improves the reliability of expert feedback, and effectively resolves issues related to non-consensus cases. This helps make evaluations more consistent and clear, keeping expert opinions genuine while encouraging agreement and realism in finding relevant indicators. The method further provides a systematic approach to filter and validate indicators by applying a consensus threshold, ensuring the retention of only the most relevant and context-appropriate variables. Owing to its adaptability across domains and environments, FDM enables experts to assess indicator suitability based on theoretical soundness and practical relevance to the target context.35,39,42 Overall, by merging expert knowledge, fuzzy logic, and a structured consensus mechanism, the Fuzzy Delphi Method (FDM) enhances the robustness, credibility, and contextual relevance of selected indicators, making it particularly valuable in complex evaluation frameworks, such as the assessment of educational barriers in low-income countries and other context-sensitive settings.

In this study, the following FDM procedural steps were implemented42,65:

  • • Step 1: Identification of Key Evaluation Barriers through Literature Review

Based on the literature reviewed in Section 2.1, 11 critical technical barriers impacting the effectiveness of educational supervision were identified. These include (B1) teachers’ disinterest in supervisors’ directives, (B2) Limited capability of certain educational supervisors in creating effective supervisory plans, (B3) Limited capability of certain educational supervisors in creating and implementing effective supervisory training programs, (B4) Weak competencies of educational supervisors in supervision, (B5) limited teacher acceptance of modern supervisory methods, (B6) poor relationships between supervisors and teachers, (B7) lack of access to modern technologies, (B8) inadequate financial allocations for supervision activities, (B9) insufficient availability of learning resources in schools, (B10) Absence of training and rehabilitation halls, and (B11) lack of dedicated offices for educational supervision. These barriers represent the foundational factors proposed for expert evaluation, as addressing them is essential to improve the quality of comprehensive educational supervision and ensure its alignment with Yemen’s broader educational development objectives.

  • • Step 2: Expert Selection and Data Collection

In accordance with FDM guidelines, this study purposively selected a panel of experts to ensure homogeneity, as recommended in previous research.39 A total of 16 educational supervisors were chosen based on their direct involvement in educational supervision and extensive experience—each with over 10 years in the field. A structured questionnaire was used to collect data, consisting of 11 barriers to effective supervision. Fifteen printed questionnaires were distributed and collected manually. Experts were asked to rate the relevance and contextual suitability of each barrier within the Yemeni educational environment using a five-point Likert scale (1 = strongly disagree, 5 = strongly agree). The collected responses (see Table 1) were then converted into fuzzy numbers using the conversion scale proposed by.42,65( Table 2).

  • • Step 3: Fuzzy Data Analysis

    • ā—‹ Step 3.1: Aggregation of Fuzzy Ratings

Table 1. Experts’ scores using a five-point likert scale.

Expert NoB1B2B3B4B5B6B7B8B9B10 B11
E1 45454455544
E2 45555554534
E3 45555545544
E4 45454455544
E5 44545555534
E6 54445545544
E7 45454455544
E8 45555555544
E9 55455555543
E10 45454555544
E11 44545555544
E12 55454445543
E13 45454555534
E14 45555455544
E15 45454455544
E16 45555555544

Table 2. Mapping likert to fuzzy scoring scale.

Likert scale scoringLinguistic variableFuzzy Scale scoring
m1m2 m3
5 Highly Agree0.60.81
4 Agree0.40.60.8
3 Moderately/Not sure0.20.40.6
2 Not Agree00.20.4
1 Highly Not Agree000.2

In fuzzy set theory, each element is characterized by a pair: the value and its degree of membership in the fuzzy set, defined within the [0,1] interval. A membership function quantifies this degree, with values closer to 1 indicating a stronger inclusion in the fuzzy set. As detailed in Table 2, fuzzy values were used to represent the uncertain expert responses (see Table 3).

Table 3. Experts’ scores using a fuzzy rating scale.

Expert NoB1B2… .B11
m1m2m3m1m2m3… .m1m2 m3
E1 0.40.60.80.60.61… .0.40.60.8
E2 0.40.60.80.60.81… .0.40.60.8
E3 0.40.60.80.60.81… .0.40.60.8
E4 0.40.60.80.60.81… .0.40.60.8
E5 0.40.60.80.40.60.8… .0.40.60.8
E6 0.60.810.40.60.8… .0.40.60.8
E7 0.40.60.80.60.81… .0.40.60.8
E8 0.40.60.80.60.81… .0.40.60.8
E9 0.60.810.60.81… .0.20.40.6
E10 0.40.60.80.60.81… .0.40.60.8
E11 0.40.60.80.40.60.8… .0.40.60.8
E12 0.60.810.60.81… .0.20.40.6
E13 0.40.60.80.60.81… .0.40.60.8
E14 0.40.60.80.60.81… .0.40.60.8
E15 0.40.60.80.60.81… .0.40.60.8
E16 0.40.60.80.60.81… .0.40.60.8
AFR 0.4380.6380.8380.5630.7500.963… .0.3750.5750.775
CS 0.6380.758… .0.575

The average fuzzy rating (AFR) for each variable (barrier z) was then calculated across all n experts using the following formula:

(1)
wz~=(az.bz.cz)=(1nāˆ‘r=1narz.1nāˆ‘r=1nbrz.1nāˆ‘r=1ncrz)
  • ā—‹ Step 3.2: Defuzzification

The fuzzy values for each variable were then converted into crisp scores (CS) (See Table 3) using the centroid method (center of gravity), defined as:

(2)
Dz=(az.bz.cz)3
  • • Step 4: Evaluation of Domain Acceptability

    • ā—‹ Step 4.1: Distance Calculation

As shown in Table 4, the deviation between each expert’s fuzzy rating and the aggregated fuzzy score was computed using the following formula:

(3)
Drz=(azāˆ’arz)2+(bzāˆ’brz)2+(czāˆ’crz)23
  • ā—‹ Step 4.2: Threshold Value of Each barrier

Table 4. Deviation calculation of expert fuzzy ratings versus aggregated scores.

Expert No B1 B2 B3 B4 B5 B6 B7 B8 B9 B10 B11
E1 0.10.20.10.10.20.20.10.00.00.10.0
E2 0.10.10.20.10.10.10.10.30.00.20.0
E3 0.10.10.20.10.10.10.20.00.00.10.0
E4 0.10.10.10.10.20.20.10.00.00.10.0
E5 0.10.20.20.20.10.10.10.00.00.20.0
E6 0.20.20.10.20.10.10.20.00.00.10.0
E7 0.10.10.10.10.20.20.10.00.00.10.0
E8 0.10.10.20.10.10.10.10.00.00.10.0
E9 0.20.10.10.10.10.10.10.00.00.10.3
E10 0.10.10.10.10.20.10.10.00.00.10.0
E11 0.10.20.20.20.10.10.10.00.00.10.0
E12 0.20.10.10.10.20.20.20.00.00.10.3
E13 0.10.10.10.10.20.10.10.00.00.20.0
E14 0.10.10.20.10.10.20.10.00.00.10.0
E15 0.10.10.10.10.20.20.10.00.00.10.0
E16 0.10.10.20.10.10.10.10.00.00.10.0
Threshold Value 0.0930.1040.1500.0930.1500.1430.0930.0360.0000.0930.067
Domain-Level Threshold 0.093 < 0.2

For each variable, a threshold value was calculated by averaging these distances:

(4)
Thz=1nāˆ‘r=1nDrz
  • ā—‹ Step 4.3: Domain-Level Threshold

The overall threshold for the entire domain of variables was then determined by averaging the threshold values of all factors:

(5)
Thdomain=1nāˆ‘z=1mThz

A domain and its variables were considered valid if their threshold values were ≤ 0.2.42,65 Barriers with threshold values below this criterion were deemed acceptable for further analysis. In this study, As shown in Table 4, the overall domain threshold was calculated as 0.093, while all individual variable threshold values were found to be less than 0.2.

  • • Step 5: Assessing Expert Consensus

To determine expert consensus on each factor, the number of experts (n) whose ratings were within a fuzzy distance of ≤ 0.2 from the average fuzzy rating ( Ez) was calculated (see Table 5). The percentage of these experts EAz was defined as:

(6)
EAz=Ezn%

Table 5. Determining expert consensus through fuzzy distance metrics.

Expert nB1B2B3B4B5B6B7B8B9B10 B11
Number of experts, d ≤ 0.2 1313161316161315161314
Percentage of Experts (%), d ≤ 0.2 81811008110010081941008188
Average 90%

Consensus was measured by assessing the proportion of experts whose ratings deviated from the group average by less than or equal to 0.2, as indicated in Equation (6). A consensus threshold of 75% was adopted. All 11 barriers met or exceeded this threshold. Subsequently, the final crisp scores were used to rank the barriers in terms of priority, with any factor scoring below 0.4 excluded from further consideration.

  • • Step 6: Final Evaluation and Variable Selection

After confirming the validity of the evaluation domain and identifying variables with sufficient expert consensus, the final step involved analyzing the crisp scores of the accepted barriers. These scores were used to rank the barriers in order of priority, with any variable scoring below 0.4 excluded from further consideration. However, as presented in the results section, all 11 barriers achieved scores above the threshold and were therefore accepted and retained for further analysis.

2.3 Barrier prioritization in educational supervision via fuzzy set theory and cluster segmentation

To investigate the technical barriers impacting comprehensive educational supervision in schools in Sana’a, Yemen, a structured questionnaire was developed. The items in this instrument were derived from key barriers previously identified and validated through the Fuzzy Delphi Method. The questionnaire aimed to capture teachers’ perceptions of the supervisory challenges within their institutional contexts.

Step 1: Sample of the Study

A descriptive quantitative approach was employed, aligning with the study’s objective to assess systemic supervisory barriers from the teachers’ perspective. The study population comprised approximately 7,000 teachers working in schools that implemented comprehensive educational supervision in the capital city. Following Krejcie and Morgan’s sampling guidelines, a sample of 370 teachers was selected, surpassing the recommended minimum of 364 for a population of this size. This sample size ensured representativeness and allowed for statistical generalization to the broader population.

Step 2: Research Instrument Design

To gather the necessary data, a structured questionnaire was designed based on the study objectives, relevant literature, validated prior instruments, and the Fuzzy Delphi procedures outlined in Section 2.2. The instrument consisted of 11 items, each corresponding to a specific technical barrier affecting the effectiveness of comprehensive educational supervision. A five-point Likert scale was used to measure participants’ perceptions of each barrier, with response options ranging from ā€œVery Lowā€ (1 point) to ā€œVery Highā€ (5 points). The full scale included very low (1), low (2), moderate (3), high (4), and very high (5).

This structure enabled respondents to express their views clearly and allowed for the consistent quantification of data for analysis.

Step 3: Data Collection Procedures

The questionnaire was distributed across four educational districts in Amanat Al-asima Sana’a, where comprehensive educational supervision is currently being practiced. These districts were selected from among the ten official educational zones in the city of Amanat Al-Asima, Yemen.

Teachers from private schools that implemented comprehensive supervision practices were also included. Completed questionnaires were collected through school administrations and returned directly to the researcher.

Step 4: Aggregation and Defuzzification of Responses

The collected responses were converted into fuzzy numbers using the predefined scale (see Table 1), enabling the representation of uncertainty in respondents’ judgments. The average fuzzy rating for each barrier was calculated across all respondents using Equation 1. These fuzzy values were then defuzzified into crisp scores using the centroid (center of gravity) method, as described in Equation 2 (see Table 6).

Table 6. Respondents’ scores using a fuzzy rating scale.

Respondent NoB1B2… .B11
m1 m2 m3 m1 m2 m3 … . m1 m2 m3
R1 000.20.60.61… .00.20.4
R2 00.20.400.60.4… .00.20.4
R3 0.60.810.60.81… .0.20.40.6
R4 0.20.40.60.20.40.6… .000.2
R5 000.20.40.40.8… .000.2
R6 0.40.60.800.20.4… .0.40.60.8
R7 0.20.40.60.40.40.8… .0.60.81
R8 0.40.60.80.40.40.8… .0.40.60.8
R9 0.20.40.60.20.40.6… .0.20.40.6
R10 0.20.40.60.20.60.6… .0.20.40.6
R11 0.20.40.60.20.40.6… .0.40.60.8
R12 0.40.60.80.40.40.8… .0.40.60.8
R13 000.2000.2… .000.2
… .. … ..… ..… ..… ..… ..… ..… ..… ..… ..… ..
R369 0.40.60.80.40.20.8… .0.40.60.8
R370 0.60.810.60.61… .0.60.81
AFR 0.2350.4150.6150.2570.4210.648… .0.2560.4260.626
CS 0.4220.442… .0.436

Following the evaluation of the barriers and the computation of the average score for each based on respondents’ input, the aggregated scores were used to rank and categorize the barriers according to their impact on the effectiveness of comprehensive educational supervision.

Step 5. Clustering Analysis and Barrier Prioritization Using K-Means

The aim of the clustering process was to categorize the identified barriers into distinct groups based on their combined impact. This approach allows for a deeper understanding of the relative effects of these factors on the success of comprehensive educational supervision and enables the targeted prioritization of interventions. Clustering is a fundamental unsupervised machine learning technique that organizes unlabeled data into distinct clusters based on inherent similarities, such as proximity or density metrics.29–32

Unlike supervised learning, which relies on labeled outputs for training, clustering reveals hidden patterns without predefined categories, making it essential for exploratory analysis. By uncovering latent structures, clustering aids in hypothesis generation and provides valuable insights into strategic decision-making.66 Among the various clustering techniques, K-means is particularly favored for its simplicity, computational efficiency, and versatility across disciplines.29,30

The algorithm partitions n data points into k clusters by minimizing the variance within each cluster. It follows an iterative process29–33: (1) randomly initializing k centroids; (2) assigning each data point to the nearest centroid; (3) recalculating centroids based on updated cluster memberships; and (4) repeating these steps until the model converges or reaches a predetermined iteration limit.

To determine the optimal number of clusters, the Elbow Method was employed using the Total Sum of Squared Errors (SSE) as a performance metric (see Figure 1). The SSE was computed for k values ranging from one to five. A significant reduction in SSE was observed between k = 1 (0.0055724) and k = 3 (0.0006534), indicating an improved model fit. Beyond k = 3, further reductions in SSE were minimal, suggesting diminishing returns. The inflection point at k = 3, known as the ā€œelbow,ā€ indicates an optimal balance between simplicity and explanatory power. Consequently, k = 3 was selected as the ideal number of clusters.

ab966fb1-c819-49a5-858a-debb1312c09a_figure1.gif

Figure 1. Elbow Method results for determining the optimal number of clusters.

The plot illustrates the correlation between the within-cluster sum of squares (WCSS) and the number of clusters (k).

The final clustering model classified the barriers into a three-tier system based on their aggregated impacts. Segment 3 represents a high-priority danger zone, encompassing barriers with high average ratings from respondents, indicating significant concerns and severity. Segment 2 reflects a moderate level of concern, where the perceived impact of the barriers is notable but less critical. Segment 1 comprises low-priority or minimal-risk barriers, characterized by relatively low average ratings and limited influence on the effectiveness of comprehensive education supervision. This structured grouping enhances the interpretability of the results and provides a practical framework for prioritizing interventions and developing targeted policy responses.

2.4. Ethical approval and consent to participate

This study adhered to the principles of the Declaration of Helsinki and the national guidelines for research involving human participants. Official permission to conduct the study was secured from the Office of Education, Bani-Alhareth, Amanat Al Asimah, Republic of Yemen (Approval Reference No.: [72-9-24), authorizing data collection from participating schools. Participants received clear information about the study’s objectives and procedures and were assured that their participation was entirely voluntary, with the right to withdraw at any time, without consequences. Written informed consent was obtained from all participants before data collection, and no minors were included in this study.

3. Results

3.1 Fuzzy delphi method: Domain acceptability and expert agreement

The evaluation of domain acceptability aimed to assess the consistency of expert responses across all the identified barriers. The process began by calculating the distance between each expert’s fuzzy rating and the aggregated fuzzy score for each barrier. This deviation indicates how closely an individual expert’s judgment aligns with the group’s average perception. The mean of these deviations for each barrier was then computed to establish the threshold value. The results in Table 4 show that all 11 barriers had threshold values below the established cutoff of 0.2, ranging from 0.000 (for B9) to 0.150 (for B3 and B5). These findings suggest a high level of agreement among the experts for each variable. Furthermore, the domain-level threshold, derived by averaging the individual barrier thresholds, was calculated as 0.093, which is well below the acceptable upper limit. This confirmed the internal consistency of the expert evaluations and validated the overall domain of barriers for further analysis using the Fuzzy Delphi Method. Following the confirmation of domain validity, the focus shifted to assessing the expert consensus. This was achieved by identifying the number of experts whose ratings deviated from the group average by no more than 0.2 for each barrier. The percentage of experts who met this condition was calculated. The results in Table 5 reveal that all 11 barriers met or exceeded the minimum consensus threshold of 75%, with agreement levels ranging from 81% to 100%. Specifically, barriers B3, B5, B6, and B9 achieved full consensus (100%), while the lowest consensus rate was 81% for barriers B1, B2, B4, B7, and B10. The overall average expert agreement across all barriers was 90%, indicating a robust level of shared perceptions among the panel. Based on these results, all 11 barriers were accepted and retained for the final prioritization phase using defuzzified crisp scores.

In the final stage of the fuzzy Delphi analysis, the validated barriers were assessed and ranked according to their defuzzified crisp scores ( Table 7). These scores reflect the central tendency of expert judgments converted from fuzzy numbers into precise quantitative values using the centroid method. A threshold score of 0.4 was established to filter out low-priority barriers; any variable scoring below this cutoff was omitted from the final list. The results indicated that all 11 barriers achieved crisp scores well above the minimum threshold, underscoring their significance in the context of educational supervision challenges in Yemen’s medical schools. Among them, the barrier ā€œinsufficient availability of learning resources in schoolsā€ (B9) was ranked highest, with a crisp score of 0.800, followed by ā€œinadequate financial allocations for supervisionā€ (B8) at 0.788, and both ā€œweak competencies of educational supervisorsā€ (B4) and ā€œlack of access to modern technologiesā€ (B7) tied with a score of 0.763. The lowest-ranking yet accepted variables included ā€œlack of dedicated offices for educational supervisionā€ (B11) and ā€œabsence of training and rehabilitation hallsā€ (B10), scoring 0.575 and 0.563, respectively. This ranking provides a clear prioritization of the most pressing barriers and serves as a critical reference point for targeted policy interventions and resource allocation to enhance the effectiveness of comprehensive educational supervision.

Table 7. Defuzzified crisp scores and ranking of educational supervision barriers based on expert evaluation.

BarrierThe average fuzzy rating for each variableRank Decision
B1 0.6389Accepted
B2 0.7585Accepted
B3 0.6888Accepted
B4 0.7633Accepted
B5 0.7137Accepted
B6 0.7256Accepted
B7 0.7633Accepted
B8 0.7882Accepted
B9 0.8001Accepted
B10 0.56311Accepted
B11 0.57510Accepted

3.2. Assessment, ranking and clustering of technical supervision barriers

The sample included a diverse group of educators varying in gender, academic qualifications, years of experience, and school type. Specifically, 30% of the respondents (n = 111) were male and 70% (n = 259) were female. Regarding educational attainment, 40.8% (n = 151) held a high school diploma, 57.8% (n = 214) had a bachelor’s degree, and 1.4% (n = 5) had a master’s diploma or higher. In terms of professional experience, 32.4% (n = 120) had 1–5 years of teaching experience, 24.1% (n = 89) had 6–10 years, and 43.5% (n = 161) had over 10 years.

The questionnaire was distributed almost evenly across school types, with 51.4% of respondents (n = 190) from public schools and 48.6% (n = 180) from private ones. These demographic characteristics ensured that the collected data reflected a broad range of perspectives, enhancing the credibility and depth of the subsequent analysis of the perceived technical barriers.

The demographic data of the sample ( Table 8) indicate a predominantly female teaching workforce, with most holding at least a bachelor’s degree. A significant portion of respondents (43.5%) had more than ten years of teaching experience, suggesting that the dataset included insights from educators with substantial professional exposure. Additionally, the nearly equal distribution of respondents across public and private schools enhances the generalizability of the findings, allowing comparisons between different institutional contexts. Table 8 presents the full distribution of demographic variables among the sample:

Table 8. Demographic distribution of the study sample.

VariableCategoryFrequency Percentage
Gender Male11130.00%
Female25970.00%
Qualification High School15140.80%
Bachelor’s Degree21457.80%
Master’s Degree51.40%
Experience 1–5 years12032.40%
6–10 years8924.10%
More than 10 years16143.50%
School Type Public19051.40%
Private18048.60%

Table 9 presents the average fuzzy ratings for the various barriers impacting the efficacy of comprehensive educational supervision, along with their respective ranks. This evaluation, derived from the respondents’ input, offers a thorough overview of the challenges encountered in educational supervision. Table 10 illustrates the distribution of these barriers, which are categorized into three segments based on their perceived severity and impact. A total of 11 barriers were assessed, culminating in an overall sum of 100.0%. The table also includes the Sum of Squared Errors (SSE) for each segment. This distribution underscores the varying levels of concern associated with each segment, providing a clear overview of the challenges that require future attention. Segment 3, containing only one barrier, accounted for 9.1% of the total and reflected high-priority concerns perceived as critical threats to effective field supervision. In contrast, Segment 2 comprises seven barriers, representing 63.6% of the total, highlighting moderate-impact issues that significantly affect the supervision process of the internship. Segment 1 included three barriers, accounting for 27.3% of the total, indicating low-priority or minimal risk concerns that required less immediate attention. These barriers are characterized by relatively low average ratings and limited influence on the effectiveness of comprehensive educational supervision.

Table 9. Ranking of barriers impacting the effectiveness of educational supervision based on respondents’ input.

BarrierThe average fuzzy rating for each variableRank Segment
B10.42262
B20.44222
B30.42552
B40.46713
B50.39891
B60.44032
B70.389111
B80.391101
B90.41782
B100.42172
B110.43642

Table 10. Distribution and segmentation of barriers impacting educational supervision.

SegmentNumber of barriersPercentage % SSE/Segment
Segment 1 327.3%0.000047
Segment 2 763.6%0.000607
Segment 3 19.1%0.000000
TOTAL 11100.0%0.00065

4. Discussion

The Fuzzy Delphi Method (FDM) process confirmed the appropriateness and relevance of all 11 barriers to effective educational supervision. Each barrier met the necessary consensus and threshold criteria, demonstrating that the selected variables were not only theoretically grounded in the literature but also validated by expert opinion as reflective of the real-world challenges facing Yemen’s supervision system.

The high level of consensus among experts (average agreement of 90%) and a low domain-level threshold (0.093) indicate a consistent and strong agreement across the expert panel. These outcomes enhance the methodological rigor and credibility of barrier selection processes. Notably, the high crisp scores obtained for several barriers, particularly those related to resource limitations (e.g., insufficient learning materials and inadequate financial support), highlight their practical significance. These challenges are not just perceived concerns but are deeply embedded in the operational dynamics of the educational supervision system. Even barriers with relatively lower average ratings, such as the lack of training halls or dedicated office space, were retained in the final model, emphasizing their contextual importance despite being perceived as less urgent issues. The use of FDM in this study served as an effective filtering and validation mechanism, ensuring that only the most contextually relevant and expert-validated barriers were included in subsequent analytical phases. This is especially critical in complex decision-making environments, where aligning empirical evidence with expert insights enhances the focus, reliability, and applicability of advanced evaluation models, such as MCDM techniques or policy simulations. By incorporating expert judgment under conditions of uncertainty and applying structured fuzzy logic principles, the method strengthens the foundation for deeper investigation and prioritization [39,42,65].

Following FDM validation, ranking and clustering analyses ( Tables 9 and 10) provided further insights into how these barriers vary in severity and impact. Segmentation into three groups enabled the strategic categorization of challenges, offering a clearer roadmap for targeted interventions and policy reform.

The analysis identified a critical barrier in educational supervision: the weak competencies of educational supervisors (B4). This barrier, despite representing only 9.1% of total concerns, received the highest average score (0.467) and ranked first among all barriers, emphasizing its importance. Consistent with the existing literature,67–69 this finding highlights the vital role of competent supervision in teacher development, instructional improvement, and overall school enhancement. Competent supervisors possess essential skills in instructional leadership, communication, observation, feedback delivery, and the ability to analyze teaching practices and design appropriate interventions.15,16 Weak supervisory competencies can undermine the entire supervisory process, leading to unclear guidance, ineffective support, and hindered teacher development,70 potentially compromising educational quality and impeding systemic improvement. This analysis emphasizes the need for immediate and sustained action to enhance supervisory competencies, as they form the foundation of effective educational supervision and improve teaching and learning outcomes.16–18 To address this critical issue, policymakers and educational institutions must prioritize comprehensive professional development programs for supervisors, focusing on continuous training, mentorship programs, tailored performance evaluation systems, and the development of instructional leadership capabilities.15,17,18,68,69 Strengthening supervisory competencies is likely to have far-reaching effects, potentially mitigating the interconnected barriers within the educational system.

Furthermore, segment 2, comprising seven barriers and representing the largest portion (63.6%), included various moderate-impact issues influencing daily supervision operations. The most significant barrier, B2: Limited capability in creating effective supervisory plans (average score: 0.442), highlights the necessity of enhanced strategic planning skills among supervisors. This aligns with the understanding that well-developed supervisory plans are essential for effective supervision, providing structure, clear goals, and direction.53,54 Effective supervisory plans ensure a systematic rather than a reactive process, aligning supervision with instructional improvement. These plans must be goal-oriented, realistic, adaptable, time-bound, and developed collaboratively with the teachers. The ability to create and implement strong supervisory plans is central to ensuring purposeful and focused supervision aligned with educational goals.13,14 Conversely, poorly planned supervision can be inconsistent or unclear, limiting its impact on teacher development and student learning. Equally concerning is B6: Poor relationships between supervisors and teachers (0.440), which points to underlying communication and trust issues. Effective supervision relies not only on technical competence but also on interpersonal relationships that foster a supportive and collaborative school environment. A professional relationship built on trust and communication encourages teacher openness, responsiveness to feedback, and collaboration.18,59

In contrast, weak relationships or lack of trust may result in resistance and reduce the impact of supervisory efforts. Additional barriers in this segment include B11: Lack of dedicated offices for educational supervision (0.436) and B3: Limited capability in implementing training programs (0.425). These barriers indicate both infrastructural deficiencies and limited pedagogical leadership, suggesting a need for resource investment and capacity building. Well-designed and timely training programs directly enhance teachers’ instructional skills, promote consistency in instructional quality, and foster a culture of professional growth.10,11 The distribution of barriers in Segment 2 underscores the need for a comprehensive improvement strategy that addresses both relational and structural challenges. Enhancing supervisors’ abilities across three key dimensions—planning, relational engagement, and training delivery—is foundational to improving the overall effectiveness of supervision systems.

This approach would involve the socioeconomic status: Strengthening planning capabilities to improve the quality, consistency, and outcomes of the educational supervision process; developing meaningful relationships between supervisors and teachers to create a supportive and collaborative environment is essential; improving the design and implementation of impactful training programs to enhance teacher performance and professional development is crucial.

By addressing these interconnected aspects, supervisors can provide more meaningful feedback, manage their time effectively, and identify key areas for improvement.13 This comprehensive approach would significantly enhance the quality and impact of educational supervision, ultimately benefiting the development of teachers and student learning outcomes.

Finally, segment 1, comprising three barriers and representing 27.3% of the total, included concerns that were considered low-priority or minimal risk. These barriers are marked by relatively low average ratings and limited impact on the effectiveness of comprehensive educational supervision. Despite being rated as lower priority, the barriers in Segment 1 highlight structural limitations that could negatively affect the supervision process if not addressed. For example, limited teacher acceptance of modern supervisory methods can stifle innovation and diminish the effectiveness of such interventions. When teachers resist collaborative evaluation, peer feedback, or data-informed coaching, it undermines the culture of trust and continuous improvement that is essential for effective supervision.22 Similarly, inadequate financial allocations for supervision activities restrict supervisors’ ability to conduct school visits, attend professional development programs, or acquire necessary materials, resulting in poorly planned and under-resourced supervision systems.8,23 Additionally, the lack of access to modern technologies limits the use of digital tools for classroom observation, feedback, training and communication. Without these tools, supervision becomes less interactive, data-informed, and responsive.19,60 While these barriers may not have the most immediate impact, their cumulative effect over time can impede broader reform efforts, stifle innovation, and exacerbate other challenges in educational supervision. Therefore, continuous monitoring and proactive mitigation of lower-ranked issues are crucial for sustaining long-term improvements.

The low average ratings and limited influence of these barriers indicate that resources and efforts should primarily focus on addressing more critical issues in comprehensive educational supervision. Nevertheless, it remains important to monitor these low-priority barriers, as they could potentially affect the overall effectiveness of supervision if neglected for an extended period.

In summary, the integrated FDM and clustering analysis offered a multilayered understanding of the barriers hindering educational supervision in Yemen. The findings emphasize the urgency of enhancing supervisory competencies and highlight the need to address moderate-impact barriers related to planning, relationships, infrastructure, and training. Low-priority issues should not be ignored, as their future impact may become more significant in a changing educational landscape. Collectively, these insights provide a strategic framework for policymakers, educational leaders, and stakeholders to allocate resources, design interventions, and improve supervisory effectiveness, ultimately contributing to more robust and equitable educational outcomes

5. Conclusion

This study systematically investigated the technical barriers undermining the effectiveness of comprehensive educational supervision in Amanat Al Asimah, Yemen, using a robust mixed-methods framework. Eleven key challenges were identified, validated through the Fuzzy Delphi Method, and prioritized using fuzzy set theory and K-means clustering. The Fuzzy Delphi results demonstrated strong expert consensus, with a domain-level threshold of 0.093 and an average agreement of 90%, affirming the contextual relevance of the selected barriers. Through clustering analysis, the barriers were segmented into three priority levels. Segment 3, comprising a single high-priority issue—weak supervisory competencies—emerged as the most critical concern, underscoring the central role of professional capacity in enabling effective supervision. Segment 2 contained seven moderate-priority barriers, including planning deficiencies, strained supervisor–teacher relationships, and inadequate training infrastructure. Segment 1 grouped three low-priority barriers, such as resistance to modern supervision methods and technological or financial constraints, which, though less urgent, could accumulate into significant challenges if left unaddressed.

These findings carry important implications for policymakers, educational leaders, and planners operating in fragile and resource-limited contexts. The study offers a data-driven roadmap for prioritizing reforms, emphasizing the urgent need to invest in supervisory training, institutional planning, and resource allocation. The segmentation of barriers provides a practical framework for designing targeted, phased interventions. Additionally, the approach used in this study can be replicated or adapted to other conflict-affected or low-income settings, contributing to broader efforts in educational reform and capacity building.

However, the study also faces several limitations. It was geographically restricted to Amanat Al Asimah, potentially limiting generalizability across Yemen’s diverse regions. Furthermore, it focused primarily on teachers’ and supervisors’ perspectives, excluding voices from students, principals, and policymakers. Also, it concentrated on technical barriers, leaving out sociopolitical or cultural dimensions that may influence supervisory effectiveness. Future research should expand geographically, incorporate multiple stakeholders, and explore non-technical factors influencing educational supervision. Longitudinal and experimental designs could also evaluate how barriers evolve and how proposed solutions perform over time. Comparative studies across similar contexts could provide valuable benchmarks and inform regional or global supervisory strategies.

In summary, this study highlights the urgency of strengthening supervisory competencies and institutional frameworks to enhance the quality and effectiveness of educational supervision in Yemen. By integrating expert validation, teacher feedback, and advanced clustering techniques, it presents a scalable and evidence-based framework that can inform strategic planning, policy development, and educational resilience in post-crisis and low-resource environments

Ethics and consent

This study adhered to the principles of the Declaration of Helsinki and the national guidelines for research involving human participants. Official permission to conduct the study was secured from the Office of Education, Bani-Alhareth, Amanat Al Asimah, Republic of Yemen (Approval Reference No.: [72-9-24), authorizing data collection from participating schools. Participants received clear information about the study’s objectives and procedures and were assured that their participation was entirely voluntary, with the right to withdraw at any time, without consequences. Written informed consent was obtained from all participants before data collection, and no minors were included in this study.

Declaration of generative AI and AI-assisted technologies in the writing process

During the preparation of this work the author(s) used [DeepSeek v3, Paperpal, Quillbot, and ChatGPT] for language refinement and structure. After using this tools, the author(s) reviewed and edited the content as needed and take(s) full responsibility for the content of the publication.

Comments on this article Comments (0)

Version 1
VERSION 1 PUBLISHED 22 Sep 2025
Comment
Author details Author details
Competing interests
Grant information
Copyright
Download
 
Export To
metrics
Views Downloads
F1000Research - -
PubMed Central
Data from PMC are received and updated monthly.
- -
Citations
CITE
how to cite this article
Alshameri RGA, Nasser AA, Abdullah AH et al. Toward Effective Educational Supervision in Yemen: A Hybrid Fuzzy Delphi and Clustering Analysis of Technical Barriers [version 1; peer review: awaiting peer review]. F1000Research 2025, 14:956 (https://doi.org/10.12688/f1000research.167498.1)
NOTE: If applicable, it is important to ensure the information in square brackets after the title is included in all citations of this article.
track
receive updates on this article
Track an article to receive email alerts on any updates to this article.

Open Peer Review

Current Reviewer Status:
AWAITING PEER REVIEW
AWAITING PEER REVIEW
?
Key to Reviewer Statuses VIEW
ApprovedThe paper is scientifically sound in its current form and only minor, if any, improvements are suggested
Approved with reservations A number of small changes, sometimes more significant revisions are required to address specific details and improve the papers academic merit.
Not approvedFundamental flaws in the paper seriously undermine the findings and conclusions

Comments on this article Comments (0)

Version 1
VERSION 1 PUBLISHED 22 Sep 2025
Comment
Alongside their report, reviewers assign a status to the article:
Approved - the paper is scientifically sound in its current form and only minor, if any, improvements are suggested
Approved with reservations - A number of small changes, sometimes more significant revisions are required to address specific details and improve the papers academic merit.
Not approved - fundamental flaws in the paper seriously undermine the findings and conclusions
Sign In
If you've forgotten your password, please enter your email address below and we'll send you instructions on how to reset your password.

The email address should be the one you originally registered with F1000.

Email address not valid, please try again

You registered with F1000 via Google, so we cannot reset your password.

To sign in, please click here.

If you still need help with your Google account password, please click here.

You registered with F1000 via Facebook, so we cannot reset your password.

To sign in, please click here.

If you still need help with your Facebook account password, please click here.

Code not correct, please try again
Email us for further assistance.
Server error, please try again.