Keywords
educational supervision; Fuzzy Delphi Method; Yemen; fuzzy set theory; K-means ; clustering; supervisory competencies
Comprehensive educational supervision is essential for ensuring quality teaching, fostering professional development, and supporting institutional capacity building. However, its implementation encounters numerous structural, technical, and human resource challenges. This study aimed to identify, validate, rank, and cluster the technical barriers affecting comprehensive educational supervision in Amanat Al Asimah, Yemen. This aligns with national reform goals by offering strategic insights to improve supervisory systems, thereby enhancing teaching quality, institutional performance, and educational resilience in fragile contexts
This study employed a three-phase mixed-methods approach. Initially, a literature review identified 11 key barriers to effective supervision. These were validated using the Fuzzy Delphi Method (FDM), involving 16 experienced educational supervisors to assess the consensus and suitability of the items. Subsequently, a quantitative survey targeting 370 teachers was conducted to evaluate their perceived severity. Fuzzy set theory was used to aggregate and defuzzify the responses, generating crisp scores for prioritization. Finally, K-means clustering was applied to segment the barriers based on their impacts.
FDM analysis confirmed the validity of all 11 identified barriers, with a domain-level threshold of 0.093 and an average expert consensus of 90%, indicating strong agreement. The fuzzy set-based evaluation highlighted three top-priority challenges: weak supervisory competencies, limited ability to develop effective supervisory plans, and poor supervisor-teacher relationships. K-means clustering grouped the barriers into three segments: one high-priority barrier, seven moderate-priority concerns, and three low-priority issues. Notably, weak supervisory competencies emerged as the most critical barrier, isolated in a high-priority cluster.
These findings provide evidence-based guidance for policy and strategic interventions aimed at enhancing the effectiveness of supervision systems in fragile educational settings. The study concludes with recommendations for strengthening supervisory competencies, improving resource allocation, and fostering trust-based supervisor-teacher relationships, thereby contributing to the quality of education and institutional resilience in Yemen.
educational supervision; Fuzzy Delphi Method; Yemen; fuzzy set theory; K-means ; clustering; supervisory competencies
The current global shifts in science and education have significantly impacted various educational dimensions, including systems and supervision processes. These transformations have compelled educators to critically examine educational structures and reaffirm the teacherās role as a central pillar within the system.1 Within this context, educational supervision has emerged as a pivotal component of educational management, fundamental to the broader educational framework that encompasses both teaching and learning.2 Educational supervision serves as a cornerstone for ensuring educational quality and promoting continuous improvement. It plays a vital role in enhancing teacher performance by offering professional guidance, fostering reflective practice, and aligning teaching with curriculum standards.3 Historically limited to classroom observation, the scope of supervision has expanded to include mentorship, collaborative problem-solving, and the development of targeted professional growth plans.4
In todayās dynamic educational landscapeāshaped by digital transformation and evolving pedagogical expectationsāsupervisors are instrumental in helping educators integrate technology, maintain digital ethics, and implement innovative teaching strategies suited to 21st-century learners.5 Beyond instructional support, effective supervision also nurtures teachersā emotional well-being and professional development, ensuring that education remains inclusive, adaptive, and student-centered.6 Modern supervisory models now emphasize collaborative learning environments and constructive feedback mechanisms, thereby cultivating a culture of continuous improvement.7 As Okafor et al. note, effective supervision not only enhances instructional practices but also strengthens institutional effectiveness by fostering an environment conducive to both teacher and student success.8 Given its multifaceted nature, educational supervision continues to evolveārequiring ongoing research and innovation to meet the complex demands of contemporary education.
The success of educational supervision hinges on several critical indicators. Supervisors must offer clear directives and align instructional practices with institutional objectives.9 Emphasis should be placed on implementing quality assurance mechanisms that align with global standards while addressing local educational contexts.9 Continuous professional development is essential in fostering a culture that supports quality teaching and lifelong learning.10,11
Moreover, supervision becomes most effective when teachers actively engage with feedback, thereby promoting a culture of accountability and improvement.12 Effective planning is equally importantāsupervisors must ensure that their approach is strategic, goal-oriented, and systematic.13,14 Competent supervisors are characterized by strong instructional leadership, effective communication, accurate observation, and constructive feedback delivery.15,16 Building trust-based professional relationships between supervisors and teachers further enhances collaboration and facilitates open communication.17,18 Additionally, the integration of modern technology and the availability of financial and learning resources significantly contribute to the success of supervision practices.19 These factors collectively support a dynamic, responsive, and impactful supervision system.
Ultimately, educational supervision focuses on actively involving teachers in the learning environment to enhance teaching methods and improve student achievement. It plays a central role in overseeing the educational process by integrating instructional activities and continuously evaluating institutional outcomes.2,20 Through identifying challenges and working collaboratively to resolve them, supervision ensures the implementation of appropriate actions that drive educational advancement.
Numerous studies indicate that educational supervision, in its current state, faces significant challenges that hinder its effectiveness. Including inadequate training programs,10 limited planning,21 weak supervisory competencies,15 low teacher acceptance of modern methods,22 strained relationships,18 technological gaps,19 and insufficient financial, infrastructural, and learning resources.2,23 To overcome these barriers, studies recommend structured professional development, strategic planning, competence-based supervisor selection,16 fostering collaborative cultures,24 integrating digital tools,25 securing adequate funding,26 and providing supportive environments to strengthen supervisionās impact.
Comprehensive educational supervision is a fundamental approach employed in schools to assess and respond to their challenges, needs, and educational and social realities. It involves the continuous presence of educational supervisors within school environments to gain firsthand insights into the teaching and learning processes both inside and outside the classroom. Through this approach, supervisors are able to diagnose the real conditions of the educational systemāincluding the roles of teachers, learners, school leadership, and the broader learning environment. However, in Yemen, the ability to implement effective supervision has been severely undermined by ongoing conflict and political instability. The education system, which once showed progress with enrollment rates increasing from 71.3% to 97.5% between 1999 and 2013, has since experienced significant setbacks. By the 2015/2016 academic year, around 1.8 million children were out of school, and over 1,100 schools had been either destroyed or repurposed for emergency use.27 These structural disruptions have weakened the foundation of educational supervision. Furthermore, the quality of education has declined significantly, with reports of curriculum manipulation for ideological purposes and widespread issues such as the absence of teacher training and salary payments.28 In response to these profound challenges, Yemenās National Vision for Building the Modern Yemeni State and the national research priorities emphasize the urgent need for comprehensive education reform.
In line with these priorities, this study aims to analyze the technical challenges facing educational supervision in schools located in Amanat Al Asimah and explore viable strategies to improve the efficiency of educational processes. The focus is on supporting national goals by modernizing supervisory practices and ensuring they contribute meaningfully to sustainable development and institutional capacity building. The study seeks to identify, rank, and cluster these challenges in schools within Amanat Al Asimah, Yemen, while exploring viable strategies to enhance the efficiency of educational supervision processes.
The significance of this study lies in its alignment with Yemenās National Vision for Building a Modern Yemeni State and national research priorities. It contributes to modernizing supervisory practices in a conflict-affected context, supports sustainable development, and enhances institutional capacity in Yemenās education sector. Additionally, it provides insights into addressing teacher-related challenges and improving the quality of education.
Methodologically, the study employs a mixed-methods approach that includes a literature review to identify key barriers to effective supervision, the Fuzzy Delphi Method to validate and prioritize these barriers, a quantitative survey of teachers to assess their perceptions of supervisory challenges, and fuzzy set theory and K-means clustering analysis to categorize and prioritize the identified barriers.
The findings of this study could inform targeted policy interventions to enhance supervisory effectiveness, guide resource allocation for improving educational supervision, support capacity-building initiatives for supervisors, and contribute to developing a responsive supervision framework aligned with national development objectives.
Multi-Criteria Decision-Making (MCDM) methods have emerged as vital analytical tools in a wide range of fields, including health, sustainability, tourism, healthcare, engineering, education and financial risk assessment.29ā39 The integration of MCDM methods with fuzzy logic and clustering techniques provides a robust framework for addressing complex decisions under uncertainty.29ā33 Fuzzy-based MCDM approaches effectively capture expert ambiguity and contextual nuances, making them particularly suitable for fields such as health, education, and sustainability.35,38ā40 The Fuzzy Delphi Method (FDM), enhanced by fuzzy set theory, improves expert consensus and facilitates clearer selection and prioritization of variables.35,38,41,42
Meanwhile, machine learning (ML) has significantly advanced real-world decision-making through predictive modeling, risk analysis, and pattern recognition.43ā48 Its applications span various domains, including healthcare,43 big data analytics,44 manufacturing quality control,45 and infrastructure forecasting.46 ML also supports academic evaluation through web mining47 and Industry 4.0 applications.48
Recent research increasingly integrates MCDM, fuzzy logic, and unsupervised ML techniquesāparticularly K-means clusteringāto enhance decision-making by segmenting barriers and identifying strategic priorities.29ā33 This hybrid approach has been successfully applied in health security,29ā33 ecotourism,39 urban resilience assessment,49 and education,40,50 offering scalable, context-sensitive solutions to complex policy and planning challenges.
Building on these methodological advancements and empirical applications, the present study adopts a similar integrative approach to explore and prioritize barriers in the context of educational supervision in Yemen.
In the present study, the Fuzzy Delphi Method (FDM) was employed as a preliminary phase to identify and validate the most contextually relevant barriers to effective educational supervision in Yemen. The primary purpose of the FDM was not to evaluate or resolve the barriers but to systematically screen and refine a set of candidate variables based on expert consensus for use in subsequent analyses. This was followed by a quantitative teacher survey assessing perceptions of supervisory challenges and the application of fuzzy set theory and K-means clustering to categorize and prioritize the identified barriers. Our analysis was conducted through a structured, three-phase process, with each stage clearly outlined and supported by a rationale for the chosen methodology.
A substantial body of research underscores the significance of educational supervision in enhancing instructional quality, fostering professional development, and aligning teaching practices with institutional and national objectives. Despite its critical role, several technical challenges impede the effectiveness of educational supervision, particularly in fragile contexts such as Yemen. A key issue identified in the literature is teachersā lack of interest in the instructions and directives provided by supervisors. This disinterest undermines efforts to implement improvements in teaching practices and reflects broader issues of disengagement and accountability.9,12,51,52 This is due to the weak capacity of some educational supervisors to develop effective supervisory plans, which are essential for guiding systematic, goal-oriented oversight.13,14,53,54 Without proper planning, supervision can become inconsistent, fragmented, and reactive. Another challenge is the limited ability to develop and implement effective training programs, which are vital for equipping teachers with up-to-date skills and aligning instructional methods with evolving standards.10,21
The literature also highlights the lack of supervisory competencies among some educational supervisors, including deficiencies in instructional leadership, communication, and feedback delivery.15,16,19 Such gaps diminish the credibility and impact of the supervision. Equally critical is the resistance among some teachers to modern supervision methods such as collaborative evaluation and peer feedback. This resistance can limit the effectiveness of reforms intended to make supervision more participatory and data-driven.22,24,55ā57 Moreover, weak or distant professional relationships between supervisors and teachers hinder open communication, reduce trust, and lead to ineffective feedback cycles.17,18,58,59
Technological and infrastructural deficiencies also pose significant obstacles. The absence or inadequacy of modern technologies and digital tools, such as learning management systems, video conferencing platforms, and data analytics software, limits the efficiency and adaptability of supervision practices.19,25,59,60 Furthermore, the lack of financial resources allocated to supervision hampers planning, professional development, transportation, and the acquisition of the necessary tools.8,23,61 A closely related issue is the inadequacy of supportive learning resources in schools, such as libraries, science laboratories, Internet access, and computers. These are essential for both effective teaching and meaningful supervisory assessment.19,62ā64
Additionally, the lack of dedicated training and rehabilitation halls limits opportunities for continuous professional development, and the absence of a specialized office for managing supervision compromises the coordination and institutionalization of supervisory activities.2
The Fuzzy Delphi Method (FDM) is a widely used technique for assessing the suitability of research indicators in measuring specific phenomena within defined environments because of its ability to combine expert judgment with the flexibility of fuzzy logic.35,39,40 Often, determining the relevance of indicators requires subjective input from domain experts, whose perspectives may vary or be influenced by uncertainties arising from complex contextual conditions.65 FDM addresses this challenge by allowing experts to provide assessments using linguistic terms, which are then converted into fuzzy numbers, enabling the method to handle ambiguity and imprecision more effectively.35,65 Compared to the traditional Delphi method,41 which often requires multiple rounds of consultation to reach a consensus and may lead to misinterpretation or loss of expert interest due to repetitive rounds, FDM enhances efficiency by minimizing the number of iterations needed. It also reduces the risk of bias introduced through forced modifications of expert opinions and lowers the overall cost and time required for data collection.35,39
Additionally, the FDM increases survey response rates, improves the reliability of expert feedback, and effectively resolves issues related to non-consensus cases. This helps make evaluations more consistent and clear, keeping expert opinions genuine while encouraging agreement and realism in finding relevant indicators. The method further provides a systematic approach to filter and validate indicators by applying a consensus threshold, ensuring the retention of only the most relevant and context-appropriate variables. Owing to its adaptability across domains and environments, FDM enables experts to assess indicator suitability based on theoretical soundness and practical relevance to the target context.35,39,42 Overall, by merging expert knowledge, fuzzy logic, and a structured consensus mechanism, the Fuzzy Delphi Method (FDM) enhances the robustness, credibility, and contextual relevance of selected indicators, making it particularly valuable in complex evaluation frameworks, such as the assessment of educational barriers in low-income countries and other context-sensitive settings.
In this study, the following FDM procedural steps were implemented42,65:
Based on the literature reviewed in Section 2.1, 11 critical technical barriers impacting the effectiveness of educational supervision were identified. These include (B1) teachersā disinterest in supervisorsā directives, (B2) Limited capability of certain educational supervisors in creating effective supervisory plans, (B3) Limited capability of certain educational supervisors in creating and implementing effective supervisory training programs, (B4) Weak competencies of educational supervisors in supervision, (B5) limited teacher acceptance of modern supervisory methods, (B6) poor relationships between supervisors and teachers, (B7) lack of access to modern technologies, (B8) inadequate financial allocations for supervision activities, (B9) insufficient availability of learning resources in schools, (B10) Absence of training and rehabilitation halls, and (B11) lack of dedicated offices for educational supervision. These barriers represent the foundational factors proposed for expert evaluation, as addressing them is essential to improve the quality of comprehensive educational supervision and ensure its alignment with Yemenās broader educational development objectives.
In accordance with FDM guidelines, this study purposively selected a panel of experts to ensure homogeneity, as recommended in previous research.39 A total of 16 educational supervisors were chosen based on their direct involvement in educational supervision and extensive experienceāeach with over 10 years in the field. A structured questionnaire was used to collect data, consisting of 11 barriers to effective supervision. Fifteen printed questionnaires were distributed and collected manually. Experts were asked to rate the relevance and contextual suitability of each barrier within the Yemeni educational environment using a five-point Likert scale (1 = strongly disagree, 5 = strongly agree). The collected responses (see Table 1) were then converted into fuzzy numbers using the conversion scale proposed by.42,65( Table 2).
Likert scale scoring | Linguistic variable | Fuzzy Scale scoring | ||
---|---|---|---|---|
m1 | m2 | m3 | ||
5 | Highly Agree | 0.6 | 0.8 | 1 |
4 | Agree | 0.4 | 0.6 | 0.8 |
3 | Moderately/Not sure | 0.2 | 0.4 | 0.6 |
2 | Not Agree | 0 | 0.2 | 0.4 |
1 | Highly Not Agree | 0 | 0 | 0.2 |
In fuzzy set theory, each element is characterized by a pair: the value and its degree of membership in the fuzzy set, defined within the [0,1] interval. A membership function quantifies this degree, with values closer to 1 indicating a stronger inclusion in the fuzzy set. As detailed in Table 2, fuzzy values were used to represent the uncertain expert responses (see Table 3).
The average fuzzy rating (AFR) for each variable (barrier z) was then calculated across all n experts using the following formula:
The fuzzy values for each variable were then converted into crisp scores (CS) (See Table 3) using the centroid method (center of gravity), defined as:
As shown in Table 4, the deviation between each expertās fuzzy rating and the aggregated fuzzy score was computed using the following formula:
For each variable, a threshold value was calculated by averaging these distances:
The overall threshold for the entire domain of variables was then determined by averaging the threshold values of all factors:
A domain and its variables were considered valid if their threshold values were ⤠0.2.42,65 Barriers with threshold values below this criterion were deemed acceptable for further analysis. In this study, As shown in Table 4, the overall domain threshold was calculated as 0.093, while all individual variable threshold values were found to be less than 0.2.
To determine expert consensus on each factor, the number of experts (n) whose ratings were within a fuzzy distance of ⤠0.2 from the average fuzzy rating ( was calculated (see Table 5). The percentage of these experts was defined as:
Expert n | B1 | B2 | B3 | B4 | B5 | B6 | B7 | B8 | B9 | B10 | B11 |
---|---|---|---|---|---|---|---|---|---|---|---|
Number of experts, d ⤠0.2 | 13 | 13 | 16 | 13 | 16 | 16 | 13 | 15 | 16 | 13 | 14 |
Percentage of Experts (%), d ⤠0.2 | 81 | 81 | 100 | 81 | 100 | 100 | 81 | 94 | 100 | 81 | 88 |
Average | 90% |
Consensus was measured by assessing the proportion of experts whose ratings deviated from the group average by less than or equal to 0.2, as indicated in Equation (6). A consensus threshold of 75% was adopted. All 11 barriers met or exceeded this threshold. Subsequently, the final crisp scores were used to rank the barriers in terms of priority, with any factor scoring below 0.4 excluded from further consideration.
After confirming the validity of the evaluation domain and identifying variables with sufficient expert consensus, the final step involved analyzing the crisp scores of the accepted barriers. These scores were used to rank the barriers in order of priority, with any variable scoring below 0.4 excluded from further consideration. However, as presented in the results section, all 11 barriers achieved scores above the threshold and were therefore accepted and retained for further analysis.
To investigate the technical barriers impacting comprehensive educational supervision in schools in Sanaāa, Yemen, a structured questionnaire was developed. The items in this instrument were derived from key barriers previously identified and validated through the Fuzzy Delphi Method. The questionnaire aimed to capture teachersā perceptions of the supervisory challenges within their institutional contexts.
Step 1: Sample of the Study
A descriptive quantitative approach was employed, aligning with the studyās objective to assess systemic supervisory barriers from the teachersā perspective. The study population comprised approximately 7,000 teachers working in schools that implemented comprehensive educational supervision in the capital city. Following Krejcie and Morganās sampling guidelines, a sample of 370 teachers was selected, surpassing the recommended minimum of 364 for a population of this size. This sample size ensured representativeness and allowed for statistical generalization to the broader population.
Step 2: Research Instrument Design
To gather the necessary data, a structured questionnaire was designed based on the study objectives, relevant literature, validated prior instruments, and the Fuzzy Delphi procedures outlined in Section 2.2. The instrument consisted of 11 items, each corresponding to a specific technical barrier affecting the effectiveness of comprehensive educational supervision. A five-point Likert scale was used to measure participantsā perceptions of each barrier, with response options ranging from āVery Lowā (1 point) to āVery Highā (5 points). The full scale included very low (1), low (2), moderate (3), high (4), and very high (5).
This structure enabled respondents to express their views clearly and allowed for the consistent quantification of data for analysis.
Step 3: Data Collection Procedures
The questionnaire was distributed across four educational districts in Amanat Al-asima Sanaāa, where comprehensive educational supervision is currently being practiced. These districts were selected from among the ten official educational zones in the city of Amanat Al-Asima, Yemen.
Teachers from private schools that implemented comprehensive supervision practices were also included. Completed questionnaires were collected through school administrations and returned directly to the researcher.
Step 4: Aggregation and Defuzzification of Responses
The collected responses were converted into fuzzy numbers using the predefined scale (see Table 1), enabling the representation of uncertainty in respondentsā judgments. The average fuzzy rating for each barrier was calculated across all respondents using Equation 1. These fuzzy values were then defuzzified into crisp scores using the centroid (center of gravity) method, as described in Equation 2 (see Table 6).
Following the evaluation of the barriers and the computation of the average score for each based on respondentsā input, the aggregated scores were used to rank and categorize the barriers according to their impact on the effectiveness of comprehensive educational supervision.
Step 5. Clustering Analysis and Barrier Prioritization Using K-Means
The aim of the clustering process was to categorize the identified barriers into distinct groups based on their combined impact. This approach allows for a deeper understanding of the relative effects of these factors on the success of comprehensive educational supervision and enables the targeted prioritization of interventions. Clustering is a fundamental unsupervised machine learning technique that organizes unlabeled data into distinct clusters based on inherent similarities, such as proximity or density metrics.29ā32
Unlike supervised learning, which relies on labeled outputs for training, clustering reveals hidden patterns without predefined categories, making it essential for exploratory analysis. By uncovering latent structures, clustering aids in hypothesis generation and provides valuable insights into strategic decision-making.66 Among the various clustering techniques, K-means is particularly favored for its simplicity, computational efficiency, and versatility across disciplines.29,30
The algorithm partitions n data points into k clusters by minimizing the variance within each cluster. It follows an iterative process29ā33: (1) randomly initializing k centroids; (2) assigning each data point to the nearest centroid; (3) recalculating centroids based on updated cluster memberships; and (4) repeating these steps until the model converges or reaches a predetermined iteration limit.
To determine the optimal number of clusters, the Elbow Method was employed using the Total Sum of Squared Errors (SSE) as a performance metric (see Figure 1). The SSE was computed for k values ranging from one to five. A significant reduction in SSE was observed between k = 1 (0.0055724) and k = 3 (0.0006534), indicating an improved model fit. Beyond k = 3, further reductions in SSE were minimal, suggesting diminishing returns. The inflection point at k = 3, known as the āelbow,ā indicates an optimal balance between simplicity and explanatory power. Consequently, k = 3 was selected as the ideal number of clusters.
The plot illustrates the correlation between the within-cluster sum of squares (WCSS) and the number of clusters (k).
The final clustering model classified the barriers into a three-tier system based on their aggregated impacts. Segment 3 represents a high-priority danger zone, encompassing barriers with high average ratings from respondents, indicating significant concerns and severity. Segment 2 reflects a moderate level of concern, where the perceived impact of the barriers is notable but less critical. Segment 1 comprises low-priority or minimal-risk barriers, characterized by relatively low average ratings and limited influence on the effectiveness of comprehensive education supervision. This structured grouping enhances the interpretability of the results and provides a practical framework for prioritizing interventions and developing targeted policy responses.
This study adhered to the principles of the Declaration of Helsinki and the national guidelines for research involving human participants. Official permission to conduct the study was secured from the Office of Education, Bani-Alhareth, Amanat Al Asimah, Republic of Yemen (Approval Reference No.: [72-9-24), authorizing data collection from participating schools. Participants received clear information about the studyās objectives and procedures and were assured that their participation was entirely voluntary, with the right to withdraw at any time, without consequences. Written informed consent was obtained from all participants before data collection, and no minors were included in this study.
The evaluation of domain acceptability aimed to assess the consistency of expert responses across all the identified barriers. The process began by calculating the distance between each expertās fuzzy rating and the aggregated fuzzy score for each barrier. This deviation indicates how closely an individual expertās judgment aligns with the groupās average perception. The mean of these deviations for each barrier was then computed to establish the threshold value. The results in Table 4 show that all 11 barriers had threshold values below the established cutoff of 0.2, ranging from 0.000 (for B9) to 0.150 (for B3 and B5). These findings suggest a high level of agreement among the experts for each variable. Furthermore, the domain-level threshold, derived by averaging the individual barrier thresholds, was calculated as 0.093, which is well below the acceptable upper limit. This confirmed the internal consistency of the expert evaluations and validated the overall domain of barriers for further analysis using the Fuzzy Delphi Method. Following the confirmation of domain validity, the focus shifted to assessing the expert consensus. This was achieved by identifying the number of experts whose ratings deviated from the group average by no more than 0.2 for each barrier. The percentage of experts who met this condition was calculated. The results in Table 5 reveal that all 11 barriers met or exceeded the minimum consensus threshold of 75%, with agreement levels ranging from 81% to 100%. Specifically, barriers B3, B5, B6, and B9 achieved full consensus (100%), while the lowest consensus rate was 81% for barriers B1, B2, B4, B7, and B10. The overall average expert agreement across all barriers was 90%, indicating a robust level of shared perceptions among the panel. Based on these results, all 11 barriers were accepted and retained for the final prioritization phase using defuzzified crisp scores.
In the final stage of the fuzzy Delphi analysis, the validated barriers were assessed and ranked according to their defuzzified crisp scores ( Table 7). These scores reflect the central tendency of expert judgments converted from fuzzy numbers into precise quantitative values using the centroid method. A threshold score of 0.4 was established to filter out low-priority barriers; any variable scoring below this cutoff was omitted from the final list. The results indicated that all 11 barriers achieved crisp scores well above the minimum threshold, underscoring their significance in the context of educational supervision challenges in Yemenās medical schools. Among them, the barrier āinsufficient availability of learning resources in schoolsā (B9) was ranked highest, with a crisp score of 0.800, followed by āinadequate financial allocations for supervisionā (B8) at 0.788, and both āweak competencies of educational supervisorsā (B4) and ālack of access to modern technologiesā (B7) tied with a score of 0.763. The lowest-ranking yet accepted variables included ālack of dedicated offices for educational supervisionā (B11) and āabsence of training and rehabilitation hallsā (B10), scoring 0.575 and 0.563, respectively. This ranking provides a clear prioritization of the most pressing barriers and serves as a critical reference point for targeted policy interventions and resource allocation to enhance the effectiveness of comprehensive educational supervision.
The sample included a diverse group of educators varying in gender, academic qualifications, years of experience, and school type. Specifically, 30% of the respondents (n = 111) were male and 70% (n = 259) were female. Regarding educational attainment, 40.8% (n = 151) held a high school diploma, 57.8% (n = 214) had a bachelorās degree, and 1.4% (n = 5) had a masterās diploma or higher. In terms of professional experience, 32.4% (n = 120) had 1ā5 years of teaching experience, 24.1% (n = 89) had 6ā10 years, and 43.5% (n = 161) had over 10 years.
The questionnaire was distributed almost evenly across school types, with 51.4% of respondents (n = 190) from public schools and 48.6% (n = 180) from private ones. These demographic characteristics ensured that the collected data reflected a broad range of perspectives, enhancing the credibility and depth of the subsequent analysis of the perceived technical barriers.
The demographic data of the sample ( Table 8) indicate a predominantly female teaching workforce, with most holding at least a bachelorās degree. A significant portion of respondents (43.5%) had more than ten years of teaching experience, suggesting that the dataset included insights from educators with substantial professional exposure. Additionally, the nearly equal distribution of respondents across public and private schools enhances the generalizability of the findings, allowing comparisons between different institutional contexts. Table 8 presents the full distribution of demographic variables among the sample:
Table 9 presents the average fuzzy ratings for the various barriers impacting the efficacy of comprehensive educational supervision, along with their respective ranks. This evaluation, derived from the respondentsā input, offers a thorough overview of the challenges encountered in educational supervision. Table 10 illustrates the distribution of these barriers, which are categorized into three segments based on their perceived severity and impact. A total of 11 barriers were assessed, culminating in an overall sum of 100.0%. The table also includes the Sum of Squared Errors (SSE) for each segment. This distribution underscores the varying levels of concern associated with each segment, providing a clear overview of the challenges that require future attention. Segment 3, containing only one barrier, accounted for 9.1% of the total and reflected high-priority concerns perceived as critical threats to effective field supervision. In contrast, Segment 2 comprises seven barriers, representing 63.6% of the total, highlighting moderate-impact issues that significantly affect the supervision process of the internship. Segment 1 included three barriers, accounting for 27.3% of the total, indicating low-priority or minimal risk concerns that required less immediate attention. These barriers are characterized by relatively low average ratings and limited influence on the effectiveness of comprehensive educational supervision.
Barrier | The average fuzzy rating for each variable | Rank | Segment |
---|---|---|---|
B1 | 0.422 | 6 | 2 |
B2 | 0.442 | 2 | 2 |
B3 | 0.425 | 5 | 2 |
B4 | 0.467 | 1 | 3 |
B5 | 0.398 | 9 | 1 |
B6 | 0.440 | 3 | 2 |
B7 | 0.389 | 11 | 1 |
B8 | 0.391 | 10 | 1 |
B9 | 0.417 | 8 | 2 |
B10 | 0.421 | 7 | 2 |
B11 | 0.436 | 4 | 2 |
The Fuzzy Delphi Method (FDM) process confirmed the appropriateness and relevance of all 11 barriers to effective educational supervision. Each barrier met the necessary consensus and threshold criteria, demonstrating that the selected variables were not only theoretically grounded in the literature but also validated by expert opinion as reflective of the real-world challenges facing Yemenās supervision system.
The high level of consensus among experts (average agreement of 90%) and a low domain-level threshold (0.093) indicate a consistent and strong agreement across the expert panel. These outcomes enhance the methodological rigor and credibility of barrier selection processes. Notably, the high crisp scores obtained for several barriers, particularly those related to resource limitations (e.g., insufficient learning materials and inadequate financial support), highlight their practical significance. These challenges are not just perceived concerns but are deeply embedded in the operational dynamics of the educational supervision system. Even barriers with relatively lower average ratings, such as the lack of training halls or dedicated office space, were retained in the final model, emphasizing their contextual importance despite being perceived as less urgent issues. The use of FDM in this study served as an effective filtering and validation mechanism, ensuring that only the most contextually relevant and expert-validated barriers were included in subsequent analytical phases. This is especially critical in complex decision-making environments, where aligning empirical evidence with expert insights enhances the focus, reliability, and applicability of advanced evaluation models, such as MCDM techniques or policy simulations. By incorporating expert judgment under conditions of uncertainty and applying structured fuzzy logic principles, the method strengthens the foundation for deeper investigation and prioritization [39,42,65].
Following FDM validation, ranking and clustering analyses ( Tables 9 and 10) provided further insights into how these barriers vary in severity and impact. Segmentation into three groups enabled the strategic categorization of challenges, offering a clearer roadmap for targeted interventions and policy reform.
The analysis identified a critical barrier in educational supervision: the weak competencies of educational supervisors (B4). This barrier, despite representing only 9.1% of total concerns, received the highest average score (0.467) and ranked first among all barriers, emphasizing its importance. Consistent with the existing literature,67ā69 this finding highlights the vital role of competent supervision in teacher development, instructional improvement, and overall school enhancement. Competent supervisors possess essential skills in instructional leadership, communication, observation, feedback delivery, and the ability to analyze teaching practices and design appropriate interventions.15,16 Weak supervisory competencies can undermine the entire supervisory process, leading to unclear guidance, ineffective support, and hindered teacher development,70 potentially compromising educational quality and impeding systemic improvement. This analysis emphasizes the need for immediate and sustained action to enhance supervisory competencies, as they form the foundation of effective educational supervision and improve teaching and learning outcomes.16ā18 To address this critical issue, policymakers and educational institutions must prioritize comprehensive professional development programs for supervisors, focusing on continuous training, mentorship programs, tailored performance evaluation systems, and the development of instructional leadership capabilities.15,17,18,68,69 Strengthening supervisory competencies is likely to have far-reaching effects, potentially mitigating the interconnected barriers within the educational system.
Furthermore, segment 2, comprising seven barriers and representing the largest portion (63.6%), included various moderate-impact issues influencing daily supervision operations. The most significant barrier, B2: Limited capability in creating effective supervisory plans (average score: 0.442), highlights the necessity of enhanced strategic planning skills among supervisors. This aligns with the understanding that well-developed supervisory plans are essential for effective supervision, providing structure, clear goals, and direction.53,54 Effective supervisory plans ensure a systematic rather than a reactive process, aligning supervision with instructional improvement. These plans must be goal-oriented, realistic, adaptable, time-bound, and developed collaboratively with the teachers. The ability to create and implement strong supervisory plans is central to ensuring purposeful and focused supervision aligned with educational goals.13,14 Conversely, poorly planned supervision can be inconsistent or unclear, limiting its impact on teacher development and student learning. Equally concerning is B6: Poor relationships between supervisors and teachers (0.440), which points to underlying communication and trust issues. Effective supervision relies not only on technical competence but also on interpersonal relationships that foster a supportive and collaborative school environment. A professional relationship built on trust and communication encourages teacher openness, responsiveness to feedback, and collaboration.18,59
In contrast, weak relationships or lack of trust may result in resistance and reduce the impact of supervisory efforts. Additional barriers in this segment include B11: Lack of dedicated offices for educational supervision (0.436) and B3: Limited capability in implementing training programs (0.425). These barriers indicate both infrastructural deficiencies and limited pedagogical leadership, suggesting a need for resource investment and capacity building. Well-designed and timely training programs directly enhance teachersā instructional skills, promote consistency in instructional quality, and foster a culture of professional growth.10,11 The distribution of barriers in Segment 2 underscores the need for a comprehensive improvement strategy that addresses both relational and structural challenges. Enhancing supervisorsā abilities across three key dimensionsāplanning, relational engagement, and training deliveryāis foundational to improving the overall effectiveness of supervision systems.
This approach would involve the socioeconomic status: Strengthening planning capabilities to improve the quality, consistency, and outcomes of the educational supervision process; developing meaningful relationships between supervisors and teachers to create a supportive and collaborative environment is essential; improving the design and implementation of impactful training programs to enhance teacher performance and professional development is crucial.
By addressing these interconnected aspects, supervisors can provide more meaningful feedback, manage their time effectively, and identify key areas for improvement.13 This comprehensive approach would significantly enhance the quality and impact of educational supervision, ultimately benefiting the development of teachers and student learning outcomes.
Finally, segment 1, comprising three barriers and representing 27.3% of the total, included concerns that were considered low-priority or minimal risk. These barriers are marked by relatively low average ratings and limited impact on the effectiveness of comprehensive educational supervision. Despite being rated as lower priority, the barriers in Segment 1 highlight structural limitations that could negatively affect the supervision process if not addressed. For example, limited teacher acceptance of modern supervisory methods can stifle innovation and diminish the effectiveness of such interventions. When teachers resist collaborative evaluation, peer feedback, or data-informed coaching, it undermines the culture of trust and continuous improvement that is essential for effective supervision.22 Similarly, inadequate financial allocations for supervision activities restrict supervisorsā ability to conduct school visits, attend professional development programs, or acquire necessary materials, resulting in poorly planned and under-resourced supervision systems.8,23 Additionally, the lack of access to modern technologies limits the use of digital tools for classroom observation, feedback, training and communication. Without these tools, supervision becomes less interactive, data-informed, and responsive.19,60 While these barriers may not have the most immediate impact, their cumulative effect over time can impede broader reform efforts, stifle innovation, and exacerbate other challenges in educational supervision. Therefore, continuous monitoring and proactive mitigation of lower-ranked issues are crucial for sustaining long-term improvements.
The low average ratings and limited influence of these barriers indicate that resources and efforts should primarily focus on addressing more critical issues in comprehensive educational supervision. Nevertheless, it remains important to monitor these low-priority barriers, as they could potentially affect the overall effectiveness of supervision if neglected for an extended period.
In summary, the integrated FDM and clustering analysis offered a multilayered understanding of the barriers hindering educational supervision in Yemen. The findings emphasize the urgency of enhancing supervisory competencies and highlight the need to address moderate-impact barriers related to planning, relationships, infrastructure, and training. Low-priority issues should not be ignored, as their future impact may become more significant in a changing educational landscape. Collectively, these insights provide a strategic framework for policymakers, educational leaders, and stakeholders to allocate resources, design interventions, and improve supervisory effectiveness, ultimately contributing to more robust and equitable educational outcomes
This study systematically investigated the technical barriers undermining the effectiveness of comprehensive educational supervision in Amanat Al Asimah, Yemen, using a robust mixed-methods framework. Eleven key challenges were identified, validated through the Fuzzy Delphi Method, and prioritized using fuzzy set theory and K-means clustering. The Fuzzy Delphi results demonstrated strong expert consensus, with a domain-level threshold of 0.093 and an average agreement of 90%, affirming the contextual relevance of the selected barriers. Through clustering analysis, the barriers were segmented into three priority levels. Segment 3, comprising a single high-priority issueāweak supervisory competenciesāemerged as the most critical concern, underscoring the central role of professional capacity in enabling effective supervision. Segment 2 contained seven moderate-priority barriers, including planning deficiencies, strained supervisorāteacher relationships, and inadequate training infrastructure. Segment 1 grouped three low-priority barriers, such as resistance to modern supervision methods and technological or financial constraints, which, though less urgent, could accumulate into significant challenges if left unaddressed.
These findings carry important implications for policymakers, educational leaders, and planners operating in fragile and resource-limited contexts. The study offers a data-driven roadmap for prioritizing reforms, emphasizing the urgent need to invest in supervisory training, institutional planning, and resource allocation. The segmentation of barriers provides a practical framework for designing targeted, phased interventions. Additionally, the approach used in this study can be replicated or adapted to other conflict-affected or low-income settings, contributing to broader efforts in educational reform and capacity building.
However, the study also faces several limitations. It was geographically restricted to Amanat Al Asimah, potentially limiting generalizability across Yemenās diverse regions. Furthermore, it focused primarily on teachersā and supervisorsā perspectives, excluding voices from students, principals, and policymakers. Also, it concentrated on technical barriers, leaving out sociopolitical or cultural dimensions that may influence supervisory effectiveness. Future research should expand geographically, incorporate multiple stakeholders, and explore non-technical factors influencing educational supervision. Longitudinal and experimental designs could also evaluate how barriers evolve and how proposed solutions perform over time. Comparative studies across similar contexts could provide valuable benchmarks and inform regional or global supervisory strategies.
In summary, this study highlights the urgency of strengthening supervisory competencies and institutional frameworks to enhance the quality and effectiveness of educational supervision in Yemen. By integrating expert validation, teacher feedback, and advanced clustering techniques, it presents a scalable and evidence-based framework that can inform strategic planning, policy development, and educational resilience in post-crisis and low-resource environments
This study adhered to the principles of the Declaration of Helsinki and the national guidelines for research involving human participants. Official permission to conduct the study was secured from the Office of Education, Bani-Alhareth, Amanat Al Asimah, Republic of Yemen (Approval Reference No.: [72-9-24), authorizing data collection from participating schools. Participants received clear information about the studyās objectives and procedures and were assured that their participation was entirely voluntary, with the right to withdraw at any time, without consequences. Written informed consent was obtained from all participants before data collection, and no minors were included in this study.
During the preparation of this work the author(s) used [DeepSeek v3, Paperpal, Quillbot, and ChatGPT] for language refinement and structure. After using this tools, the author(s) reviewed and edited the content as needed and take(s) full responsibility for the content of the publication.
Figshare: Toward Effective Educational Supervision in Yemen: A Hybrid Fuzzy Delphi and Clustering Analysis of Technical Barriers. Doi: https://doi.org/10.6084/m9.figshare.29483738.v4.71
The project contains the following underlying data:
All data, and processing results related to this study are presented in this file. This file integrates the application of the Fuzzy Delphi method to assess the consensus and suitability of the items, validated by 16 experienced educational experts. It also contains the collected and processed data from a quantitative survey targeting 370 teachers, along with the Fuzzy Set Theory processes for data aggregation. Additionally, it illustrates the K-means clustering results for segmenting the barriers based on their impacts. All this data is merged into a single Excel-based tool. This source also includes the values behind the results reported. This file also includes the values behind the measures reported in all analysis and discussion sections, as well as the values used to construct tables and figures.
This supplementary resource also provides detailed support for replicating the studyās methods and results.
- Supplementary files (copy of Ethical approval, Consent Form for participants and Questionnaires for data collection) (Original Arabic Version)
- Supplementary Files ā English Translation of Original Documents
This data are publicly available and can be accessed through the following repository (https://figshare.com/articles/dataset/Supplementary_material_-_Toward_Effective_Educational_Supervision_in_Yemen-A_Hybrid_Fuzzy_Delphi_ and_Clustering_Analysis_of_Technical_Barriers/29483738?file=56000291) and archived via [https://doi.org/10.6084/m9.figshare.29483738.v4]71).
Data are available under the terms of the Creative Commons Attribution 4.0 International license (CC-BY 4.0).
Views | Downloads | |
---|---|---|
F1000Research | - | - |
PubMed Central
Data from PMC are received and updated monthly.
|
- | - |
Provide sufficient details of any financial or non-financial competing interests to enable users to assess whether your comments might lead a reasonable person to question your impartiality. Consider the following examples, but note that this is not an exhaustive list:
Sign up for content alerts and receive a weekly or monthly email with all newly published articles
Already registered? Sign in
The email address should be the one you originally registered with F1000.
You registered with F1000 via Google, so we cannot reset your password.
To sign in, please click here.
If you still need help with your Google account password, please click here.
You registered with F1000 via Facebook, so we cannot reset your password.
To sign in, please click here.
If you still need help with your Facebook account password, please click here.
If your email address is registered with us, we will email you instructions to reset your password.
If you think you should have received this email but it has not arrived, please check your spam filters and/or contact for further assistance.
Comments on this article Comments (0)