Navigating Sample Size Determination in Educational Research: A Rapid Review Unveiling Strategies, Challenges, and Recommendations.

Smruti Besekar; Sangita Jogdand; Waqar Naqvi

doi:10.12688/f1000research.141173.2

Home Browse Navigating Sample Size Determination in Educational Research: A Rapid...

ALL Metrics

Views

Downloads

Get PDF

Get XML

Export

▬

✚

Systematic Review

Revised

Navigating Sample Size Determination in Educational Research: A Rapid Review Unveiling Strategies, Challenges, and Recommendations.

[version 2; peer review: 1 approved, 3 not approved]

Smruti Besekar ¹, Sangita Jogdand², Waqar Naqvi³

PUBLISHED 23 Feb 2024

Author details Author details

¹ Pharmacology, Datta Meghe Institute of Higher Education & Research, Sawangi, India
² Pharmacology, Datta Meghe Institute of Higher Education & Research, Sawangi, India
³ Physiotherapy, Gulf Medical University, Ajman, Saudi Arabia

Smruti Besekar
Roles: Conceptualization, Methodology, Writing – Original Draft Preparation, Writing – Review & Editing

Sangita Jogdand
Roles: Supervision, Visualization, Writing – Review & Editing

Waqar Naqvi
Roles: Conceptualization, Resources, Software

OPEN PEER REVIEW

REVIEWER STATUS

This article is included in the Datta Meghe Institute of Higher Education and Research collection.

Abstract

Background

By conducting an in-depth study of the publications, a review was conducted with the goal of evaluating the sample size in educational research. The sample size, represented by the letter “n,” is a key factor in this research because it specifies the number of participants who represent the target population. Although various studies have been published in the literature defining the processes for calculating sample sizes, there is still much uncertainty. It is vital to understand that there is no single all-encompassing method for determining sample sizes for different study designs. Instead, different study designs call for different approaches to determine sample numbers.

Methods

Information was retrieved from the databases in accordance with updated PRISMA recommendations. The keywords used for the retrieval of the relevant articles from two databases (Google Scholar and PubMed). The articles were selected by thorough scrutiny and application of inclusion and exclusion criteria.

Results

Seven articles were selected from the 9282 articles. The comparison was made among the studies in the relation to methods, objective, and outcome from the enrolled studies.

Conclusions

The evaluation of the seven studies as a whole concluded that the sample size for testing any novel approach essentially required 24.24 participants in each group. The median sample size for the simulation-based educational research was 30. Further research is required to determine the proper sample size based on a single universal formula for all types of designs.

Keywords

sample size, rapid review, study design, educational research

Corresponding author: Smruti Besekar

Competing interests: No competing interests were disclosed.

Grant information: The author(s) declared that no grants were involved in supporting this work.

Copyright: © 2024 Besekar S et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

How to cite: Besekar S, Jogdand S and Naqvi W. Navigating Sample Size Determination in Educational Research: A Rapid Review Unveiling Strategies, Challenges, and Recommendations. [version 2; peer review: 1 approved, 3 not approved]. F1000Research 2024, 12:1291 (https://doi.org/10.12688/f1000research.141173.2) First published: 09 Oct 2023, 12:1291 (https://doi.org/10.12688/f1000research.141173.1) Latest published: 07 May 2024, 12:1291 (https://doi.org/10.12688/f1000research.141173.3)

Revised Amendments from Version 1

I have made corrections suggested by the reviewer related to the title, objectives, explanation,elated to the search strategy, clearly stating the research question and insights.

See the authors' detailed response to the review by Jorge M. Mendes

Introduction

The term “sample size” describes the number of subjects or observations that make up a study ‘n’ is typically used to represent this number. The size of a sample affects two statistical properties:1) the accuracy of estimates and 2) the study's ability to draw inferences.¹

Surveys, experiments, observational studies, and other types of clinical research studies can all be categorized. Many different factors are involved in excellent research planning. The first step is to define the practical issue. Choosing the relevant participants and controls, as well as the experimental or observational units, was the second stage.

The inclusion and exclusion criteria must be carefully defined and should account for any potential variables that could affect the measurements and units being observed. The study design must be precise, and the procedures must follow the best technique currently available. Based on these considerations, the study's sample size needs to be appropriate for its goals and potential variability. The sample must be “large enough” for the effect to be statistically significant and have the expected size of scientific significance. At the same time, it is crucial that the study sample not be “too big,” where a statistically significant effect of minor scientific import could still be found.² Additionally, the sample size was economically significant. Resources may be wasted in an insufficient study because it may not yield valuable results, whereas an excessively large study consumes more resources than is required. The sample size of a study involving human or animal subjects is a crucial ethical concern, because a poorly planned experiment exposes participants to potentially hazardous procedures without contributing to information.³^,⁴ Therefore, calculating the power and sample size is crucial in the design of clinical research. Unaccountable studies printed in national and intercontinental journals have found that sample size estimation were incorrectly disclosed or had smaller samples than necessary, which reduced their power.¹^,²

There is still much confusion despite the fact that unnumbered studies clarifying the methods of sample size computation have been published in the existing literature. It is crucial to realize that there is no single universal formula for calculating the sample sizes for all study designs. Instead, different study designs require different methods to calculate sample sizes.³^,⁴ The study was conducted with the objective is to provide insights, address the challenges, and offer recommendation regarding sample size determination in educational research.

Method

To conduct this rapid review, Preferred Reporting Items for Systematic review and meta-analysis (PRISMA-S) criteria were used to guide the search.⁵ The search strategy was aimed to identify relevant articles addressing sample size determination in educational research. Researches searched the following databases such as PubMed, Embase, and the Cochrane Library, additionally; hand searching of reference lists and citation tracking were also conducted to identify additional relevant articles. The search term or keywords for accessing the data from the databases related to sample size (e.g. sample size calculation, sample size determination) and educational research (educational studies, pedagogic studies) were utilized. To perform search effectively and refine search results, the search strings were constructed using BOOLEAN operator (AND, OR). Free full-text, unlocked articles, pertinent terminology and information, and English language usage were considered as inclusion gauge. The exclusion criterion was contemplated and they excluded abstracts, locked articles and journals, no relevance in the data, and languages other than English. Two authors screened the titles and data of the extracted articles to determine the eligibility of the included articles. Data from the extracted articles were documented into standardized format in Microsoft excel sheet. The included articles were assessed by using appropriate criteria such as methodological rigor, relevance to the research question, and transparency of reporting. In the end all the data were synthesized for common themes, relevancy, challenges, and recommendation related to the sample size assessment. The principal investigator carried out the entire review planning process, which was authorized by the other authors. The presentation of the entire search is shown in Figure 1.

Figure 1. The schematic presentation of the data by using updated PRISMA guidelines.

Results

Seven studies were selected from the 9282 articles by using Google Scholar and PubMed with the applicable of stringent inclusion and exclusion criteria. All the information related to the articles were shown in the Table 1.

Table 1. Comparison of the studies in relation to their methods, article type, objectives and conclusions.

Number of article	Author name	Article type	Objectives	Method	Conclusion
1	McConnell et al.⁶	Editorial	The purpose of this editorial was to discuss sample size calculation in context of medical research intervention.	To teach nursing and anaesthetic colleagues about programmed intermittent epidural bolus analgesia, the author created a scenario in which they planned to accomplish their goal of estimating the required sample size. To this end, they developed a questionnaire and weekly tests to evaluate their coworkers' understanding of the novel method and efficacy of the intervention.	The formula produced n = 24.24, or 25 in each group, for a total sample size of 50 students, as per the statement. It is extremely important to use effect size when estimating the sample size.
2	Staffa et al.⁷	Review	The purpose of the study, which was conducted by paediatric surgeons, was to disseminate a method for selecting a sample size to identify an effect that would have therapeutic significance through the interpretation and validation of the findings.	Using various instances, the authors used a five-step technique to validate the sample size and statistical power analyses, including defining the primary outcome of interest and the expected impact size and power. Identify the relevant statistics and statistical test that will be taken into account. Conducted the necessary calculations to acquire the sample size needed using software or a reference table, Make a formal power and sample size declaration for the publication, grant application, or project proposal.	Calculating the suitable statistical test to employ for sample size depends on the type of the data, clinical hypothesis, and its applications.
3	Dreyhaupt et al.⁸	Review	The study was performed to describe the implementation and general principles of cluster randomization, and also for outlining the general aspects of using cluster randomization in prospective two arm comparative -educational research.	The study compared the individual randomization with the cluster randomization technique in educational research to evaluate the systematic bias reduction. It also demonstrated the general principles, its implementation and aspect of cluster randomization in a prospective two arm study.	The studies that involve cluster randomization required relevantly bigger sample size and complex method for calculations.
4	Cook et al.⁹	Systematic review	The study was conducted to determine the study power across a range of effect sizes, by re-analysing meta-analysis of simulation based education.	The author re-analysed 897 studies and the results of simulation based education to determine study power across a range of effect size.	The median sample size for the 627 no-intervention comparison group was found as 25, whereas the median sample size for different simulation group was found as 30.
5	Agnihotram 2018¹⁰	Review	This article focuses on the determination of the minimal sample size for a variety of objectives, providing a quick overview of the statistical methods employed in various research study phases.	The author discussed the various steps for estimating the sample size, that included 1- Clearly state aim of the study followed by the objectives. 2- To choose the appropriate study design for meeting the objectives. 3- Define target population. 4- Use statistical/sampling technique. 5- Decide data collection tools 6- Perform appropriate statistical analysis. 7- Communicate results and interpretation using tables and figures.	The study found that the sample size formula was based on the primary research purpose, conclusions, variables, statistical analysis planned, number of groups, and sampling technique.
6	Ferreira et al.¹¹	Review	By using objective methodologies as the standard, the study intended to validate a priori hypothesis and sample size for evaluating the intensity and duration of physical activity in a paediatric population.	The data from the electronic databases were searched, physical activity intensity was measured by questionnaire and duration was measured by accelerometer.	The study indicated weak to moderate agreement between subjective and objective approaches for determining the intensity and duration of physical activity. Additionally, assessments of the stability of method-to-method agreement were provided by sample sizes of 50 to 99 subjects.
7	Guo et al.¹²	Review	The goal of the study was to determine the sample size for two independent groups with equal and unequal unknown variances when power and differential cost were both taken into account.	In this study, Welch approximate test applied to test derive various sample size allocation ratios by minimizing the total cost or equivalently, maximizing statistical power and two types of hypothesis were used superiorly and equivalence of two means for sample size planning.	The sample size formula proposed in this study should be used whenever cost factor is involved and population variances are unknown and unequal.

Discussion

Research in health science education is expanding. Emerging educational research relies on relevant conceptual frameworks, reliable research techniques, and important discoveries.¹³^,¹⁴ Prior reviews have shown that many educational research articles employ small sample sizes, despite the fact that researchers rarely take into account the expected impact size, intend the sample size before, or describe the actual precision in evaluating the results.¹⁵^,¹⁶ Although authors rarely analyse the anticipated influence size, arrange the sample group in detail, or analyse the results from the perspective of actual precision.⁹

According to the definition of statistical power, it is “the likelihood that the null hypothesis will be rejected in the sample if the observed effect in the population is equal to the effect size”.¹⁷ In other words, the potential that a study will uncover a real, statistically significant effect is known as power. Studies with a higher power are preferable because lower-power studies may miss potentially important connections. A power of 90% is ideal and 80% is typically considered the minimum power. The sample size (the number of observations), the effect size (the value of the effect), and the risk of type I errors all affect power (the likelihood of recognising a “significant” difference when there is none, represented by alpha).⁹^,¹⁸

The study adopted a convenience sampling method for primary research for determination of sample size in education by examination of simulation-based education. First, most research in the sample only had the power to find effects with moderate to large SMD [0.8], while other studies only had the power to find effects with immensely large magnitudes ([2 standard deviations). Most of the negative studies, or those that did not find a statistically relevant difference, had very broad confidence intervals (CI), signifying the probability of large and likely important differences. The first point and discovery were connected. In these trials, the lack of a statistically relevant outcome did not establish superiority or equivalence of the interventions under study.⁹

In one study, the author aimed to present sample size calculations in the context of medical educational interventions and focused on computing sample sizes to compare distinct groups where the result was a continuous (interval or ratio) dependent variable, such as in interventional designs. The criteria for forecasting the sample group, such as the relevance factor, preferred statistical significance, predicted difference in score, and approximate evaluation variation, which may be estimated from previous studies, were discussed in order to determine the number of participants required to assess the effects of an intervention on a specific outcome or the association between variables.⁶^,¹⁹ Interventions in education frequently concentrate on changing latent conceptions, which are theoretical and cannot be readily seen or quantified. This causes the validated scales to vary, changing how the outcome measures are calculated. The educational researcher advocated the use of effect size in determining the sample size. The study design often affects the relationship between larger effect sizes and smaller sample sizes. This resulted in the effect sizes being categorized as “small,” “medium,” and “large,” respectively, for values of 0.20, 0.50, and 0.80.²⁰ Finally, the meta-analysis revealed that the sample size for each group was 24.24 respectively.²¹^,²²

Further, author’s also discussed errors to avoid, including considering sample size estimation as small, medium, or large, which leads to a failure in the accuracy of the evaluation tool and sample characteristics.²³ Second, unless necessary, researchers should avoid creating new institute-specific assessment instruments. This is because they must be validated for accuracy and reliability before use in interventional studies.²⁴ Third, the prospective dropout and attrition rates must be considered.²⁵Finally, the need to avoid equating the effect size with its true significance and employing a confidence interval that offered accuracy in the sample and effect sizes.⁶

The objective of the report was to establish the optimal number of subjects for a study during the planning stage, with sufficient patients to resolve the most clinically important questions and statistical power calculations. The evaluation of the sample size that must be randomised to each arm in order to achieve the standard 80% or 90% power to find a clinically meaningful effect in randomised controlled trials, which frequently use parallel group designs. The need for a control arm, statistical comparability, structural equality, and resemblance of management conditions and observations are among the themes that the author elaborated on as being essential for educational research investigations. If an academic research study exhibits these traits, hence the test arm's success is significantly greater than that of the control arm, and the distinctness cannot be the result of concurrence. The cluster randomization was usually performed for non-therapeutic intervention such as prevention program, healthcare program and training programs. Two to thousands of individuals were found in each cluster. Education research may also consider different cluster sizes.⁸

Minimizing or reducing contamination bias is the fundamental reason for performing cluster-randomized studies. Observations inside clusters are typically more comparable to one another than observations from distinct clusters, creating a unique data structure known as a statistical dependency. The effective sample size of a cluster-randomized study is less than the concrete sample size, which has an impact on sample size computation (i.e., the number of enrolled students). Consequently, it is inappropriate to use typical methods that presume the statistical competence of all observations to rule out the sample size for cluster-randomized investigations.⁸

The purpose of the study, which was conducted by paediatric surgeons, was to disseminate a method for selecting a sample size to identify an effect that would have therapeutic significance through the interpretation and validation of the findings. Using a five-step approach, it is possible to calculate the minimum sample size necessary to ensure sufficient power and accurate interpretation of the study's findings.⁷ The sample size that can be achieved to assess a significant effect on the basis of research or primary data must be justified using the power calculation. The research sample size should have adequate statistical power to identify clinically meaningful effects in scientific investigations.⁷^,²⁶ The sample size of the prior control group determined the statistical power. To compare the two groups effectively, comparisons must be made with a historical control group that is comparable to the research group, for which data on assessed confounders are available. The suggested 5-step approach can be used with any type of data or study design, although power and sample size primers do not provide examples for every possible research circumstance. The fundamental objective of the primers was to compare the two treatment groups. However, due to multiplicity and multiple testing, there is a higher risk of false-positive results (Type I error) when comparing more than two groups.²⁷^,²⁸

Guo et al. used two different types of hypotheses, taking into account sample size planning factors such superiority/non-inferiority and equivalence of two means. When population variances are unknown, no exact sample can be found through traditional sample size formula and resulting sample size must be suitable enough to meet the required level of significance and probability of correct decision and power. The cost constraint depends on the two experimental goals for given level of αand power 1-β i.e. allocation of having minimal total cost and ratios are a function of unit cost ration and standard deviations.¹²

Historically, three methods have been employed to determine the sample size. The first is an interval strategy, where the confidence interval is high (e.g., 95%) and the sampling error between the true parameter and its estimate is kept to a present modest amount, that is, 3 percent. Since there is no hypothesis testing involved in this method, no threshold of significance is required. The second is a hypothesis-related approach in which both the null and alternative hypotheses must be precisely specified beforehand to detect a significant difference between the parameters under study while simultaneously meeting the required level of significance (Type I error rate) and the desired power (probability of correctly accepting the specified alternative). The third strategy uses a “indifference zone,” where populations that perform better than the others are placed in a zone where they are more likely to be chosen correctly.²⁹

A “priori" literally translates from Latin as “what comes before” and they are a fundamental part of the scientific method since they are created based on assumptions.³⁰ From these assumptions, three hypotheses were inferred. With reference to objective methodologies, the aim of this systematic review was to offer proof for a priori hypotheses and sample size for evaluating the quantity and duration of physical activity in a pediatric population. The results of systematic review suggest that the degree of agreement between subjective and objective measures for determining the intensity and duration of physical exercise should be assumed modest to moderate.¹¹

Currently, there are no data to support an a priori assumption regarding how well the different methods of assessment agree. To select a sample size, attain precision, or have sufficient power to reject a false null hypothesis, a robust a priori hypothesis is necessary. Cost and feasibility, which are frequently the true drivers of the sample size, cannot be disregarded by researchers. Nonetheless, typical power calculations yield only specific sample sizes by making precise assumptions. This study’s results indicate that, for assessing nearly all physical activity, intensity and duration parameters, a sample size of 50–90 subjects offers constant agreement between subjective and objective approaches. The degree of uniformity displayed in each (often non-representative) sample studied, the accuracy of the subjective method created for a target sample, and the inadequacy of the correlation coefficient for detecting agreement issues are all potential explanations for stable agreement in this sample size interval. Additionally, studies with small samples showed higher levels of variability in the range of findings, perhaps as a result of the inferior design of these studies to those with larger samples.³¹

The “vibration of effects” diminishes the reliability of the consensus measures in samples with less than 50 respondents. The study predicts that the basis for the decreased reliability of the agreement measures in studies with samples of 100 or more persons is that the researchers' attempts to ignore the occurrence of an exaggerated effect in a small-sample trial when a finding is made is the primary factor.³² The superiority in methodology systematic evaluations addressing the agreement between subjective and objective measures for assessing physical activity has frequently found low methodological quality in the studies.³²^–³⁵

The COSMIN checklist, which was employed in the cited study, identified the absence of an a priori hypothesis and small sample size (n = 50) as the primary factors affecting the methodological standard of the redeemed studies. These factors were obeyed by a lack of data regarding subjects who were missing and the way in which missing data were handled. The author disapproved with questionnaires, diaries, and/or logs those received low ratings in methodological quality evaluations are ineffective tools for gathering subjective data.³²

The sample size was depended upon the degree of heterogeneity, if the analysis was performed by multiple investigators and teams. Moreover, studies with limited data showed higher levels of variability in the range of findings, perhaps as a result of the inferior design of these studies to those with larger samples.³³^,³⁴

A statistician is essential to rule out the number of subjects and analyse the final results of the entire investigation. To perform a suitable well-defined study that produces rational and trustworthy implications that can be applied to the sample population, it is crucial for the investigator to understand the fundamentals of analytical methods. Clinicians can use statistics to extract crucial information from empirical data, which improves patient care. Statistical notions must be considered from the initial planning stage to the final reporting phase. In general, there are two sorts of sample size estimation problems: sample size for (a) an estimating study and (b) to tests a hypothesis, or a comparison study.¹⁰

When performing an estimation study, the researcher was interested in estimating the quantity of one or more parameters, including, among other parameters, the mean haemoglobin level or arthritis prevalence. Researchers were interested in comparing population characteristics at one or more time points or characteristics of two or more populations in studies that test hypotheses. For instance, they might compare the prevalence of arthritis between two populations before and after the administration of an intervention. A researcher should select a large number of people if they wanted the estimation in their study to be more precise, because as the accuracy (or margin of error) grows (or lowers), the minimum sample size necessary increases. For instance, for a sample size greater than that preferred for a 95% confidence level, that estimate of a parameter is required. The computation of sample size in studies testing hypotheses aims to obtain the appealed power for disclosing a difference that is therapeutically or experimental significant at a predetermined significance level.³⁵

According to the statistics, there are various methods, test and formula for estimation of the sample size required to perform the research and other relevant studies. But the lack of research regarding the appropriate and whole number needed for performing any research is not established yet, like pilot study confirms the 12 participants for each group enrolling for the particular trial.³⁶

This review highlights the diverse approaches and procedures used in many studies to determine the appropriate sample sizes for educational research. It also emphasized the difficulties in defining the main outcome, choosing the right statistical tests, and taking effect size and statistical power into consideration. This article fosters a critical comprehension of the results and their relevance in various research contexts. Along with the recommendations, key considerations include defining research objectives, selecting appropriate study designs, and ensuring adequate statistical power. Through adherence to PRISMA-S guidelines for rapid reviews, the article emphasizes the importance of transparency and rigor in the review process. This commitment to methodological rigor enhances the credibility and trustworthiness of the insights presented.

Conclusion

The review suggested that the sample size should be considered as soon as possible throughout the research phase to gather more insightful background that will fundamentally have a stronger influence on pedagogic application. All types of research investigations require the determination of sample size, and selecting the appropriate formula is essential. According to the study's main goal, outcome variable, study plan, intended statistical investigation, study groups, and assorting procedure to be utilized, a suitable sample size formula was chosen. The sample population needed for a study is determined by a variety of variables, including the feasibility of the study, its power, the accuracy of the calculated value, its analytical relevance and confidence level, its ability to detect a clinically significant difference, and other factors, such as financial support, workforce, subject availability, and time. Studies involving cluster randomization require a larger sample size and a complex method for calculations. The sample size for conducting any new method basically required 24.24 members in each group. The median sample size for the simulation-based educational research was 30. Further, more research is needed for the appropriate sample size and universal single formula based on every study design.

Data availability

All data underlying the results are available as part of the article and no additional source data are required.

References

1. Jaykaran, Saxena D, Yadav P, et al.: Negative studies published in medical journals of India do not give sufficient information regarding power/sample size calculation and confidence interval. J. Postgrad. Med. Apr 2011; 57(2): 176. Publisher Full Text
2. Jaykaran C, Yadav P, Kantharia N: Reporting of sample size and power in negative clinical trials published in Indian medical journals. J. Pharm. July 2011; 2: 87. Publisher Full Text
3. Naduvilath TJ, John RK, Dandona L: Sample size for ophthalmology studies. Indian J. Ophthalmol. September 2000; 48(3): 245–250. PubMed Abstract
4. Patra P: Sample size in clinical research, the number we need. Semantic Scholar.2012. Reference Source
5. Page M, Bautron I, Shamseer L, et al.: The PRISMA 2020 statement: an updated guideline for reporting systematic reviews. The BMJ. 2021; 372. PubMed Abstract | Publisher Full Text | Free Full Text
6. McConnell MM, Monteiro S, Bryson GL: Sample size calculations for educational interventions: principles and methods. Can. J. Anesth Can Anesth. 2019; 66(8): 864–873. PubMed Abstract | Publisher Full Text
7. Staffa SJ, Zurakowski D: Statistical power and sample size calculations: A primer for pediatric surgeons. J. Pediatr. Surg. 2020; 55(7): 1173–1179. PubMed Abstract | Publisher Full Text
8. Dreyhaupt J, Mayer B, Keis O, et al.: Cluster-randomized Studies in Educational Research: Principles and Methodological Aspects. GMS. J. Med. Educ. 2017; 34(2): Doc26. PubMed Abstract | Publisher Full Text | Free Full Text
9. Got power? A systematic review of sample size adequacy in health professions education research|SpringerLink.May 13,2014. Accessed November 30, 2022. Publisher Full Text
10. Agnihotram DG: Standard statistical methods and calculation of sample size in Medical education research. Perspectives in medical research. 2018; 6(1): 3–6. Reference Source
11. Nascimento-Ferreira MV, De Moraes ACF, Toazza Oliveira PV, et al.: Assessment of physical activity intensity and duration in the paediatric population: evidence to support an a priori hypothesis and sample size in the agreement between subjective and objective methods. Obes. Rev. Off. J. Int. Assoc. Study Obes. 2018; 19(6): 810–824. PubMed Abstract | Publisher Full Text
12. Guo JH, Chen HJ, Luh WM: Sample size planning with the cost constraint for testing superiority and equivalence of two independent groups: Sample size planning. Br. J. Math. Stat. Psychol. 2011; 64(3): 439–461. PubMed Abstract | Publisher Full Text
13. Issenberg SB, McGaghie WC, Petrusa ER, et al.: Features and uses of high-fidelity medical simulations that lead to effective learning: a BEME systematic review. Med. Teach. 2005; 27(1): 10–28. PubMed Abstract | Publisher Full Text
14. Lineberry M, Walwanis M, Reni J: Comparative research on training simulators in emergency medicine: a methodological review. Simul. Healthc. J. Soc. Simul. Healthc. 2013; 8(4): 253–261. Publisher Full Text
15. Cook DA: If you teach them, they will learn: why medical education needs comparative effectiveness research. Adv. Health Sci. Educ. Theory Pract. 2012; 17(3): 305–310. Publisher Full Text
16. Michalczyk AE, Lewis LA: Significance alone is not enough. J. Med. Educ. 1980; 55(10): 834–838. Publisher Full Text
17. Hulley SB: Designing Clinical Research. 3rd ed.Lippincott Williams & Wilkins; 2007.
18. Hoenig JM, Heisey DM: The Abuse of Power: The Pervasive Fallacy of Power Calculations for Data Analysis. Am. Stat. 2001; 55(1): 19–24. Publisher Full Text
19. Noordzij M, Tripepi G, Dekker FW, et al.: Sample size calculations: basic principles and common pitfalls. Nephrol. Dial. Transplant Off. Publ. Eur. Dial. Transpl. Assoc - Eur. Ren Assoc. 2010; 25(5): 1388–1393. Publisher Full Text
20. Statistical Power Analysis for the Behavioral Sciences|Jacob Cohen|1987|Accessed November 25, 2022. http
21. Adesope OO, Trevisan DA, Sundararajan N: Rethinking the Use of Tests: A Meta-Analysis of Practice Testing.2017. Accessed November 25, 2022. Publisher Full Text
22. Lehr R: Sixteen S-squared over D-squared: A relation for crude sample size estimates. Stat. Med. 1992; 11(8): 1099–1102. Publisher Full Text
23. Using Effect Size—or Why the P Value Is Not Enough. J. Grad. Med. Educ. 8(3): 279–282. Accessed November 25, 2022.Publisher Full Text Reference Source
24. Blanchard RD, Artino AR Jr, Visintainer PF: Applying Clinical Research Skills to Conduct Education Research: Important Recommendations for Success. J. Grad. Med. Educ. 2014; 6(4): 619–622. PubMed Abstract | Publisher Full Text | Free Full Text
25. Wilkinson L: Statistical methods in psychology journals: Guidelines and explanations. Am. Psychol. 1999; 54: 594–604. Publisher Full Text
26. Viele K, Berry S, Neuenschwander B, et al.: Use of historical control data for assessing treatment effects in clinical trials. Pharm. Stat. 2014; 13(1): 41–54. Publisher Full Text
27. Bender R, Lange S: Adjusting for multiple testing—when and how?. J. Clin. Epidemiol. 2001; 54(4): 343–349. PubMed Abstract | Publisher Full Text
28. Althouse AD: Adjust for Multiple Comparisons? It’s Not That Simple. Ann. Thorac. Surg. 2016; 101(5): 1644–1645. PubMed Abstract | Publisher Full Text
29. Gupta SS, Panchapakesan S: Multiple Decision Procedures: Theory and Methodology of Selecting and Ranking Populations. Society for Industrial and Applied Mathematics; 2002.
30. Writer S: What Is a Priori Hypothesis? Published August 4, 2015. Accessed December 2, 2022. Reference Source Reference Source
31. Hobart JC, Cano SJ, Warner TT, et al.: What sample sizes for reliability and validity studies in neurology? J. Neurol. 2012; 259(12): 2681–2694. Publisher Full Text
32. Thompson D, Peacock O, Western M, et al.: Multidimensional physical activity: an opportunity, not a problem. Exerc. Sport Sci. Rev. 2015; 43(2): 67–74. PubMed Abstract | Publisher Full Text | Free Full Text
33. Babic MJ, Morgan PJ, Plotnikoff RC, et al.: Physical activity and physical self-concept in youth: systematic review and meta-analysis. Sports Med. Auckl. NZ. 2014; 44(11): 1589–1601. PubMed Abstract | Publisher Full Text
34. Brooke HL, Corder K, Atkin AJ, et al.: A systematic literature review with meta-analyses of within- and between-day differences in objectively measured physical activity in school-aged children. Sports Med. Auckl. NZ. 2014; 44(10): 1427–1438. PubMed Abstract | Publisher Full Text | Free Full Text
35. Chow SC, Shao J, Wang H, et al.: Sample Size Calculations in Clinical Research. 3rd ed.Chapman and Hall/CRC; 2017. Publisher Full Text
36. Julious SA: Sample size of 12 per group rule of thumb for a pilot study. Pharm. Stat. 2005; 4(4): 287–291. Publisher Full Text

Comments on this article Comments (0)

Version 3

VERSION 3 PUBLISHED 09 Oct 2023

Author details Author details

Smruti Besekar
Roles: Conceptualization, Methodology, Writing – Original Draft Preparation, Writing – Review & Editing

Sangita Jogdand
Roles: Supervision, Visualization, Writing – Review & Editing

Waqar Naqvi
Roles: Conceptualization, Resources, Software

Competing interests

No competing interests were disclosed.

Grant information

The author(s) declared that no grants were involved in supporting this work.

Article Versions (3)

version 3

Revised

Published: 07 May 2024, 12:1291

https://doi.org/10.12688/f1000research.141173.3

version 2

Revised

Published: 23 Feb 2024, 12:1291

https://doi.org/10.12688/f1000research.141173.2

version 1

Published: 09 Oct 2023, 12:1291

https://doi.org/10.12688/f1000research.141173.1

© 2024 Besekar S et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Download

Export To

metrics

	Views	Downloads
F1000Research	-	-
PubMed Central Data from PMC are received and updated monthly.	-	-

Citations

SEE MORE DETAILS

CITE

how to cite this article

Besekar S, Jogdand S and Naqvi W. Navigating Sample Size Determination in Educational Research: A Rapid Review Unveiling Strategies, Challenges, and Recommendations. [version 2; peer review: 1 approved, 3 not approved]. F1000Research 2024, 12:1291 (https://doi.org/10.12688/f1000research.141173.2)

NOTE: If applicable, it is important to ensure the information in square brackets after the title is included in all citations of this article.

track

receive updates on this article

Track an article to receive email alerts on any updates to this article.

Open Peer Review

Current Reviewer Status: ?

Key to Reviewer Statuses VIEW HIDE

ApprovedThe paper is scientifically sound in its current form and only minor, if any, improvements are suggested

Approved with reservations A number of small changes, sometimes more significant revisions are required to address specific details and improve the papers academic merit.

Not approvedFundamental flaws in the paper seriously undermine the findings and conclusions

Version 2

VERSION 2

PUBLISHED 23 Feb 2024

Revised

Views

Reviewer Report 10 May 2024

Jorge M. Mendes, NOVA Information Management School, Nova University Lisbon, The Knowledge Hub Universities, Cairo, Egypt; NOVA Information Management School, Universidade Nova de Lisboa, Lisbon, Lisbon, Portugal

Approved

https://doi.org/10.5256/f1000research.162948.r249709

The authors adequately addressed the concerns I raised in my ... Continue reading

CITE

Report a concern

Respond or Comment

Views

Reviewer Report 09 May 2024

Francesco Innocenti, Maastricht University, Maastricht, The Netherlands

Not Approved

https://doi.org/10.5256/f1000research.162948.r249708

The authors didn't address all of my previous concerns.

1) The manuscript remains very hard to read as it consists of a collection of sentences assembled without a clear structure.

2) Sample size guidelines should be based on sound statistical principles (as outlined in Ref [1 or in2]) rather than on a systematic review on current practices.

3) In the conclusion/abstract, please provide precise recommendations for the readers. The current version of the conclusion is either too vague to be useful (e.g. "Studies involving cluster randomization require a larger sample size and a complex method for calculations.", so how should a researcher compute the required sample size in this case?) or potentially misleading unless the required context is provided. For instance, the recommendation that "The sample size for conducting any new method basically required 24.24 members in each group.” was taken from one specific paper and maybe was appropriate for that specific case, but it is phrased in the conclusion and in the abstract as it could apply in any possible situation. Here, a simple counter-example that shows that that is not the case:

Suppose that an experimenter wants to compare a new teaching method relative to a standard teaching method. Students are randomly assigned either to the new method or the standard method. At the end of the intervention, students receive a test with a score from 1 to 100. The researcher can compare the mean score in the two groups with a two-sample t-test, and the required sample size per group for such a study can be computed with the following formula (see, for instance, Chow et al., 2008, eq 3.2.2, p. 58):

N1=N2=(Z_(1-beta)+Z_(1-alpha/2))^2 * 2/d^2

where d=Cohen's d effect size. Assuming a medium effect size d=0.5, desired power (1-beta)=0.80, and sig. level alpha=0.05, and a balanced design, the required sample size per group is 62.79, so approximately 2.5 times that recommended by the authors (i.e. 24.24).

4) The claim given in the abstract and in the conclusion that "Further research is required to determine the proper sample size based on a single universal formula for all types of designs" doesn't make sense, as even the authors of the manuscript admitted in the introduction "It is crucial to realize that there is no single universal formula for calculating the sample sizes for all study designs. Instead, different study designs require different methods to calculate sample sizes.".

Competing Interests: No competing interests were disclosed.

Reviewer Expertise: Statistics, Sample Size Calculation, Optimal Design of Experiments

I confirm that I have read this submission and believe that I have an appropriate level of expertise to state that I do not consider it to be of an acceptable scientific standard, for reasons outlined above.

CITE

Report a concern

Respond or Comment

Views

Reviewer Report 10 Apr 2024

Wan Nor Arifin, Biostatistics and Research Methodology Unit, School of Medical Sciences, Universiti Sains Malaysia, Kubang Kerian, Malaysia

Not Approved

https://doi.org/10.5256/f1000research.162948.r255862

Title:
- The term "rapid review" in the title suggests this was a rapid review type of review.
- Rapid review is typical in medical and health research (e.g. Cochrane guidelines, WHO guidelines), whenever urgent review of evidence is required to make clinical and public health decision in certain public health emergencies.
- In the case of sample size issue, it is a longstanding issue in many fields of research. I am in opinion that it is important, but not urgent to justify a "rapid review".
- I would suggest the authors consider a "systematic review", or just a traditional "review". That will give a better overview of the issue with sample size determination in educational research.

Introduction:
- the motivation behind this study was not clearly supported by any research gaps in the introduction, and the gaps were not linked with the objective of the study.

Methods:
- The authors stated that PRISMA-S was utilized. However, clearly this was not followed/reported in this study. For example, item number 8 in PRISMA-S clearly states that "Include the search strategies for each database and information source, copied and pasted exactly as run." This was not followed by the authors.
- Figure 1 shows too many records excluded at the screening stage. Reasons for exclusion of these records were not clearly indicated in Figure 1/in text (i.e. how many records were excluded because of non open-access article etc.)

-Overall the methods section was poorly described; as pointed above, it missed many important components as required by PRIMA-S, which does not allow future replication of the methods.

Results:
- It was unclear how many of the 9282 articles were identified through Google Scholar and how many through PubMed.
- I am quite skeptical that there are only SEVEN articles available for review out of 9282. Please give reasons.

Discussion:
- While the purpose of a systematic review is to systematically review the available literature and critically appraise these articles, it was unfortunate that articles with reference numbers 6-12 were not discussed in the Discussion section.
- Discussion should focus on discussing what the authors found (the RESULTS) from the systematic review. Instead, the authors discussed findings from other sources.
- The discussion section in this paper is more typical of a narrative review of literature, not a systematic review.
- The paper aimed "to provide insights, address the challenges, and offer recommendation regarding sample size determination in educational research." based on the systematic review results. However, these aims were not clearly provided in this paper with support from references 6-12 from the review.

Conclusion:
- The authors suggested that "Further, more research is needed for the appropriate sample size and universal single formula based on every study design." I am quite perplexed at this statement. Sample size determination is guided by several factors; research objectives, the scale of the measures variables, and statistical analysis approaches, among others. With respect to the authors, I believe the recommendation for a universal single formula lacks careful consideration.

Are the rationale for, and objectives of, the Systematic Review clearly stated?

No
Are sufficient details of the methods and analysis provided to allow replication by others?

No
Is the statistical analysis and its interpretation appropriate?

Partly
Are the conclusions drawn adequately supported by the results presented in the review?

No

Competing Interests: No competing interests were disclosed.

Reviewer Expertise: medical statistics, psychometrics, sample size

CITE

Report a concern

Respond or Comment

Views

Reviewer Report 21 Mar 2024

Mohamad Adam Bujang, Clinical Research Centre, Seksyen, Malaysia

Not Approved

https://doi.org/10.5256/f1000research.162948.r255855

Title: Navigating Sample Size Determination in Educational Research: A Rapid Review Unveiling Strategies, Challenges, and Recommendations.
Comment:
(The title is not in line with the discussion. Suggest to put sub-headings Strategies, Challenges, and Recommendations in the Discussion section.). Currently, the authors just did the narrative review of the literature.

Abstract – methods
Comment: Set the duration and key words for the searching strategy

Abstract – Results
Comment: Too dry. Should emphasize on Strategies, Challenges, and Recommendations since these are stated in the topic.

Abstract-Conclusion
Comment: Not fair to state conclusion regarding recommendation of sample size from other studies. The conclusion should emphasize your title.

Main text
Comment: Introduction – not well written.

Methods
Comment: Set the time frame of searching strategies. How the searching strategy missed some important papers. Perhaps due to the key words used.
Kindly read this article
https://scholar.google.com/citations?view_op=view_citation&hl=en&user=wTjLGCEAAAAJ&cstart=20&pagesize=80&sortby=pubdate&citation_for_view=wTjLGCEAAAAJ:hMod-77fHWUC ¹

https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10912549/ [ref -2]

Results
Comment: Why too short?

Discussions
Comment: Should emphasize on Strategies, Challenges, and Recommendations since are stated in the topic. Should put sub-headings for each.

Are the rationale for, and objectives of, the Systematic Review clearly stated?

Yes
Are sufficient details of the methods and analysis provided to allow replication by others?

No
Is the statistical analysis and its interpretation appropriate?

No
Are the conclusions drawn adequately supported by the results presented in the review?

No

References

1. Bujang MA: A Step-by-Step Process on Sample Size Determination for Medical Research.Malays J Med Sci. 2021; 28 (2): 15-27 PubMed Abstract | Publisher Full Text
2. Bujang MA, Omar ED, Foo DHP, Hon YK: Sample size determination for conducting a pilot study to assess reliability of a questionnaire.Restor Dent Endod. 2024; 49 (1): e3 PubMed Abstract | Publisher Full Text

Competing Interests: No competing interests were disclosed.

Reviewer Expertise: Statistics, Research methodology, questionnaire validation & development

CITE

Report a concern

Respond or Comment

Version 1

VERSION 1

PUBLISHED 09 Oct 2023

Views

Reviewer Report 16 Feb 2024

Approved with Reservations

https://doi.org/10.5256/f1000research.154590.r238774

The article titled "Sample size in educational research: A rapid synthesis" underwent a systematic review to evaluate the adequacy of sample sizes employed in educational research studies. The investigation focused on the crucial role of sample size, denoted as "n," in effectively representing the target population.
The article adheres to a well-structured organisation, encompassing an abstract, introduction, methodology, results, discussion, and conclusion. The English language used is of high quality, enabling clear comprehension of the study's objectives, methods, findings, and implications. The article is concise and maintains an objective tone throughout, concentrating on evidence gathered from the reviewed studies. The cited literature appears relevant and carefully chosen to support the study's rationale, methodology, and discussion of findings. The abstract follows a structured format that includes the background, methods, results, and conclusions, providing a comprehensive overview of the study. Although the abstract offers a good summary of the study, including more detail on the implications of the key findings would enhance it further.

The title of the article does not accurately represent its objective and content, which primarily focuses on sample size evaluation in educational research. However, a significant portion of the cited literature originates from medical research, leading to a potential bias in the study. Therefore, it is essential to rephrase the title to better reflect the article's focus. The research question is implicitly formulated through the study's objective, which is stated in the Abstract and Introduction. While the objective is sufficiently clear, it can be explicitly stated for enhanced clarity.

The methodology adheres to the PRISMA guidelines, indicating a systematic approach to the review process. The documentation is adequate, though the systematic review protocol can be further elaborated for increased transparency.
The inclusion and exclusion criteria are appropriately documented to ensure a rigorous and transparent selection process for the studies included in the review.
The procedure for identifying sources other than the searched databases has not been explicitly mentioned. This aspect can be improved by documenting additional search strategies.
The data extracted from the publications are well-described, with a synthesis that provides a clear overview of the findings from the selected studies. A comparison among the studies concerning methods, objectives, and outcomes is presented in the summary table.
There is no explicit mention of searching for other sources between the systematic identification and finalisation of the article. Enhancing this aspect could improve the comprehensiveness of the reviews.
The concepts are articulated clearly and understandably, facilitating the reader's comprehension of the study's focus, findings, and implications.
The authors performed a competent analysis of the included studies and effectively synthesised the results. However, a more comprehensive discussion on the quality of the primary research would enhance the review.
The conclusion effectively summarises the findings, highlights the need for further research, and emphasises the importance of determining sample size in all study designs.
In light of the thoroughness of the review, the relevance of the findings to educational research, and the minor areas for improvement identified, my decision is "approved with minor revisions." The revisions should focus on revising the title, clearly stating the research question, expanding on the systematic review protocol, documenting the search for additional sources, and improving the critical appraisal of the primary research studies. By addressing these minor revisions, this article would significantly contribute to the field of educational research.

Are the rationale for, and objectives of, the Systematic Review clearly stated?

Yes
Are sufficient details of the methods and analysis provided to allow replication by others?

No
Is the statistical analysis and its interpretation appropriate?

Yes
Are the conclusions drawn adequately supported by the results presented in the review?

No

Competing Interests: No competing interests were disclosed.

Reviewer Expertise: Epidemiology and Spatial Epidemiology; Bayesian modelling; Biostatistics

I confirm that I have read this submission and believe that I have an appropriate level of expertise to confirm that it is of an acceptable scientific standard, however I have significant reservations, as outlined above.

CITE

Report a concern

Author Response 13 Apr 2024

Smruti Besekar, Pharmacology, Datta Meghe Institute of Higher Education & Research, Sawangi, India

13 Apr 2024

Author Response

Dear reviewer,

I really appreciate your valuable time and efforts. I valued your suggestions and tried to make corrections accordingly. I have revised the title, methodology section and even ... Continue reading Dear reviewer,

I really appreciate your valuable time and efforts. I valued your suggestions and tried to make corrections accordingly. I have revised the title, methodology section and even the research question. I hope you read it further.
Thank you.

Sincerely,
Smruti Besekar
Dear reviewer,

I really appreciate your valuable time and efforts. I valued your suggestions and tried to make corrections accordingly. I have revised the title, methodology section and even the research question. I hope you read it further.
Thank you.

Sincerely,
Smruti Besekar
Competing Interests: None Close
Report a concern
Respond or Comment

COMMENTS ON THIS REPORT

Author Response 13 Apr 2024

Smruti Besekar, Pharmacology, Datta Meghe Institute of Higher Education & Research, Sawangi, India

13 Apr 2024

Author Response

Dear reviewer,

I really appreciate your valuable time and efforts. I valued your suggestions and tried to make corrections accordingly. I have revised the title, methodology section and even ... Continue reading Dear reviewer,

I really appreciate your valuable time and efforts. I valued your suggestions and tried to make corrections accordingly. I have revised the title, methodology section and even the research question. I hope you read it further.
Thank you.

Sincerely,
Smruti Besekar
Dear reviewer,

I really appreciate your valuable time and efforts. I valued your suggestions and tried to make corrections accordingly. I have revised the title, methodology section and even the research question. I hope you read it further.
Thank you.

Sincerely,
Smruti Besekar
Competing Interests: None Close
Report a concern

Views

Reviewer Report 05 Feb 2024

Francesco Innocenti, Maastricht University, Maastricht, The Netherlands

Not Approved

https://doi.org/10.5256/f1000research.154590.r232933

The general impression is that this paper is a collection of sentences about sample size calculations taken from different sources, assembled without a clear structure.
The authors stated at the end of the introduction that "the study was conducted with the aim of assessing the sample size in educational research". However, after carefully reading the paper the goal of this study remains unclear. Was the purpose to describe the best practices in sample size calculations for educational research and compare the current practices in the literature? If that was the goal, the authors should have first provided a clear and accessible description of how to determine the sample size (under the most relevant study designs) in educational research according to well-established statistical literature, and then made a comparison with the seven papers retrieved from the systematic review. However, the recommendations given in the conclusion are so general and vague to be of no practical use. For instance, the authors claimed that "Studies involving cluster randomization require a larger sample size and a complex method for calculations." So, how the sample size should be computed in randomized trials, a reader could ask? Another recommendation is "The sample size for conducting any new method basically required 24.24 members in each group.", which seems in contradiction with the statement given a few lines before "the sample [size] needed for a study is determined by a variety of variables, including the feasibility of the study, its power, the accuracy of the calculated value, its analytical relevance and confidence level, its ability to detect a clinically significant difference, and other factors, such as financial support, workforce, subject availability, and time."

Furthermore, the authors correctly stated in the introduction that "It is crucial to realize that there is no single universal formula for calculating the sample sizes for all study designs. Instead, different study designs require different methods to calculate sample sizes.", and then contradicted this statement in the conclusion, where they claimed that "Further, more research is needed for the appropriate sample size and universal single formula based on every study design.".

For these reasons, I cannot recommend this paper for approval. Furthermore, I kindly invite the authors to read the following references that provide useful guidelines on how to determine the sample size in the social sciences.

Are the rationale for, and objectives of, the Systematic Review clearly stated?

Partly
Are sufficient details of the methods and analysis provided to allow replication by others?

No
Is the statistical analysis and its interpretation appropriate?

Not applicable
Are the conclusions drawn adequately supported by the results presented in the review?

No

References

1. Cohen J: Statistical Power Analysis for the Behavioral Sciences. Academic press. 1988.
2. Maxwell SE, Kelley K, Rausch JR: Sample size planning for statistical power and accuracy in parameter estimation.Annu Rev Psychol. 2008; 59: 537-63 PubMed Abstract | Publisher Full Text

Competing Interests: No competing interests were disclosed.

Reviewer Expertise: Statistics

CITE

Report a concern

Respond or Comment

Comments on this article Comments (0)

Version 3

VERSION 3 PUBLISHED 09 Oct 2023

Open Peer Review

Reviewer Status

Reviewer Reports

	Invited Reviewers
	1	2	3	4	5	6
Version 3 (revision) 07 May 24	read				read	read
Version 2 (revision) 23 Feb 24	read	read	read	read
Version 1 09 Oct 23	read	read

Francesco Innocenti, Maastricht University, Maastricht, The Netherlands
Jorge M. Mendes, Nova University Lisbon, The Knowledge Hub Universities, Cairo, Egypt; Universidade Nova de Lisboa, Lisbon, Portugal
Mohamad Adam Bujang, Clinical Research Centre, Seksyen, Malaysia
Wan Nor Arifin, Universiti Sains Malaysia, Kubang Kerian, Malaysia
Haiyan Zheng, University of Bath, Bath, UK
Jayadevan Sreedharan, Gulf Medical University, Ajman, United Arab Emirates

Comments on this article

All Comments(0)

Add a comment

Browse by related subjects

Back to all reports

Reviewer Report

9 Views

22 Aug 2024 | for Version 3

Jayadevan Sreedharan, Department of Community Medicine, Gulf Medical University, Ajman, United Arab Emirates

9 Views Cite this report Responses(0)

Not Approved

The objective of this systematic review lacks clarity and direction. The selection of articles appears arbitrary, and the conclusions drawn from this limited article are questionable and not scientifically robust. Specifically, in conclusion, the authors assert that the sample size for certain studies needs to be 24. In contrast, other studies require 30, without providing a solid rationale or statistical justification for these numbers. Such conclusions lack precision and do not adhere to the rigorous standards expected in systematic reviews.
Furthermore, the methodology used to arrive at these conclusions is not adequately explained, leading to concerns about the validity and reliability of the findings. The review fails to offer new insights or meaningful contributions to the existing body of knowledge. A systematic review needs to be comprehensive, methodologically sound, and transparent in its approach. Unfortunately, this manuscript falls short of these criteria.
I recommend that this manuscript be not indexed. The authors may consider revisiting their methodology and providing a more rigorous analysis in future submissions.

Are the rationale for, and objectives of, the Systematic Review clearly stated?

No
Are sufficient details of the methods and analysis provided to allow replication by others?

No
Is the statistical analysis and its interpretation appropriate?

Not applicable
Are the conclusions drawn adequately supported by the results presented in the review?

No
If this is a Living Systematic Review, is the ‘living’ method appropriate and is the search schedule clearly defined and justified? (‘Living Systematic Review’ or a variation of this term should be included in the title.)

Partly

Competing Interests

No competing interests were disclosed.

Reviewer Expertise

Epidemiology and Biostatistics

Respond to this report

Responses (0)

Back to all reports

Reviewer Report

10 Views

25 Jul 2024 | for Version 3

Haiyan Zheng, University of Bath, Bath, England, UK

10 Views Cite this report Responses(0)

Not Approved

Sample size determination is usually approached from a quantitative perspective, given the objective, study design, level of accuracy or tolerance about certain error, etc. This topic remained of interest for many decades, yet the authors failed to acknowledge important literature including those noted by one reviewer.

Despite the scope was set to be in ‘educational research’ via search terms, there was no mention of the basic characteristics of related studies to discuss the sample size calculation. Providing summaries about the selected papers is of little value. The authors may consider classifying the methods involved, as well as illustrating how each can be used to handle the design of a certain type of educational research.

Text like ‘24 participants per group is typically required for conducting new methods, while the median sample size for simulation-based educational research is around 30’, can be very misleading, as emphasised earlier, the determination should be performed from a quantitative perspective, rather than read a median from publications that are distinctively different in terms of the aims, study designs, populations, etc.

Are the rationale for, and objectives of, the Systematic Review clearly stated?

Partly
Are sufficient details of the methods and analysis provided to allow replication by others?

No
Is the statistical analysis and its interpretation appropriate?

Not applicable
Are the conclusions drawn adequately supported by the results presented in the review?

Partly
If this is a Living Systematic Review, is the ‘living’ method appropriate and is the search schedule clearly defined and justified? (‘Living Systematic Review’ or a variation of this term should be included in the title.)

No

Competing Interests

No competing interests were disclosed.

Reviewer Expertise

Statistics; Bayesian methods; Clinical trial; Experimental design; Finite mixture distribution.

Respond to this report

Responses (0)

Back to all reports

Reviewer Report

19 Views

14 May 2024 | for Version 3

Francesco Innocenti, Maastricht University, Maastricht, The Netherlands

19 Views Cite this report Responses(0)

Not Approved

None of my comments were addressed by the authors. Thus, my conclusion remains the same: “Not approved”. My decision is based on one major flaw in the paper: Deriving sample size recommendations based on a systematic review of current practices instead of a review of statistical literature on sample size determination. This flaw leads the authors to give potentially misleading recommendations to the readers, as shown with a counter-example in my second revision. Furthermore, the last sentence of the paper: “However, it acknowledges the need for further research to establish appropriate sample sizes and develop a universal formula applicable to various study designs.”, although further research is always needed in any fields, suggests that there is no or little literature on sample size calculation that can be applicable in several study designs. As shown in the following references, this is not true:

Cohen (1988). Statistical Power Analysis for the Behavioral Sciences.[Ref 1]
Chow et al., 2008, Sample Size Calculations in Clinical Research.[Ref 2]
Julious, 2023, Sample Sizes for Clinical Trials.[Ref 3]
Machin et al., 2018, Sample Sizes for Clinical, Laboratory and Epidemiology Studies.[Ref 4]
Maxwell et al., Sample size planning for statistical power and accuracy in parameter estimation. Annu Rev Psychol. 2008; 59: 537-63.[Ref 5]
Moerbeek & Teerenstra, 2016, Power Analysis of Trials with Multilevel Data.[Ref 6]

References

1. Reuter M, Rigó M, Formazin M, Liebers F, et al.: Authors' response: Occupation and SARS-CoV-2 infection risk among workers during the first pandemic wave in Germany: potential for bias.Scand J Work Environ Health. 2022; 48 (7): 588-590 PubMed Abstract | Publisher Full Text
2. Chow S, Shao J, Wang H, Lokhnygina Y: Sample Size Calculations in Clinical Research: Third Edition. 2017. Publisher Full Text
3. Julious S: Sample Sizes for Clinical Trials. 2023. Publisher Full Text
4. Machin D, Campbell M, Tan S, Tan S: Sample Sizes for Clinical, Laboratory and Epidemiology Studies. 2018. Publisher Full Text
5. Maxwell SE, Kelley K, Rausch JR: Sample size planning for statistical power and accuracy in parameter estimation.Annu Rev Psychol. 2008; 59: 537-63 PubMed Abstract | Publisher Full Text
6. Moerbeek M, Teerenstra S: Power Analysis of Trials with Multilevel Data. 2015. Publisher Full Text

Competing Interests

No competing interests were disclosed.

Reviewer Expertise

Statistics, Sample Size Calculation, Optimal Design of Experiments

Respond to this report

Responses (0)

Back to all reports

Reviewer Report

11 Views

10 May 2024 | for Version 2

11 Views Cite this report Responses(0)

Approved

The authors adequately addressed the concerns I raised in my previous review report. My final decision is Approved.
Thank you.

Competing Interests

No competing interests were disclosed.

Reviewer Expertise

Epidemiology and Spatial Epidemiology; Bayesian modelling; Biostatistics

I confirm that I have read this submission and believe that I have an appropriate level of expertise to confirm that it is of an acceptable scientific standard.

Respond to this report

Responses (0)

Back to all reports

Reviewer Report

9 Views

09 May 2024 | for Version 2

Francesco Innocenti, Maastricht University, Maastricht, The Netherlands

9 Views Cite this report Responses(0)

Not Approved

Competing Interests

No competing interests were disclosed.

Reviewer Expertise

Statistics, Sample Size Calculation, Optimal Design of Experiments

Respond to this report

Responses (0)

Back to all reports

Reviewer Report

13 Views

10 Apr 2024 | for Version 2

Wan Nor Arifin, Biostatistics and Research Methodology Unit, School of Medical Sciences, Universiti Sains Malaysia, Kubang Kerian, Malaysia

13 Views Cite this report Responses(0)

Not Approved

Are the rationale for, and objectives of, the Systematic Review clearly stated?

No
Are sufficient details of the methods and analysis provided to allow replication by others?

No
Is the statistical analysis and its interpretation appropriate?

Partly
Are the conclusions drawn adequately supported by the results presented in the review?

No

Competing Interests

No competing interests were disclosed.

Reviewer Expertise

medical statistics, psychometrics, sample size

Respond to this report

Responses (0)

Back to all reports

Reviewer Report

23 Views

21 Mar 2024 | for Version 2

Mohamad Adam Bujang, Clinical Research Centre, Seksyen, Malaysia

23 Views Cite this report Responses(0)

Not Approved

Are the rationale for, and objectives of, the Systematic Review clearly stated?

Yes
Are sufficient details of the methods and analysis provided to allow replication by others?

No
Is the statistical analysis and its interpretation appropriate?

No
Are the conclusions drawn adequately supported by the results presented in the review?

No

References

Competing Interests

No competing interests were disclosed.

Reviewer Expertise

Statistics, Research methodology, questionnaire validation & development

Respond to this report

Responses (0)

Back to all reports

Reviewer Report

27 Views

16 Feb 2024 | for Version 1

27 Views Cite this report Responses(1)

Approved With Reservations

Are the rationale for, and objectives of, the Systematic Review clearly stated?

Yes
Are sufficient details of the methods and analysis provided to allow replication by others?

No
Is the statistical analysis and its interpretation appropriate?

Yes
Are the conclusions drawn adequately supported by the results presented in the review?

No

Competing Interests

No competing interests were disclosed.

Reviewer Expertise

Epidemiology and Spatial Epidemiology; Bayesian modelling; Biostatistics

Respond to this report

Responses (1)

Back to all reports

Reviewer Report

22 Views

05 Feb 2024 | for Version 1

Francesco Innocenti, Maastricht University, Maastricht, The Netherlands

22 Views Cite this report Responses(0)

Not Approved

Are the rationale for, and objectives of, the Systematic Review clearly stated?

Partly
Are sufficient details of the methods and analysis provided to allow replication by others?

No
Is the statistical analysis and its interpretation appropriate?

Not applicable
Are the conclusions drawn adequately supported by the results presented in the review?

No

References

Competing Interests

No competing interests were disclosed.

Reviewer Expertise

Statistics

Respond to this report

Responses (0)

Alongside their report, reviewers assign a status to the article:

Approved - the paper is scientifically sound in its current form and only minor, if any, improvements are suggested

Approved with reservations - A number of small changes, sometimes more significant revisions are required to address specific details and improve the papers academic merit.

Not approved - fundamental flaws in the paper seriously undermine the findings and conclusions

[1] 1. Jaykaran, Saxena D, Yadav P, et al.: Negative studies published in medical journals of India do not give sufficient information regarding power/sample size calculation and confidence interval. J. Postgrad. Med. Apr 2011; 57(2): 176. Publisher Full Text

[2] 2. Jaykaran C, Yadav P, Kantharia N: Reporting of sample size and power in negative clinical trials published in Indian medical journals. J. Pharm. July 2011; 2: 87. Publisher Full Text

[3] 3. Naduvilath TJ, John RK, Dandona L: Sample size for ophthalmology studies. Indian J. Ophthalmol. September 2000; 48(3): 245–250. PubMed Abstract

[4] 4. Patra P: Sample size in clinical research, the number we need. Semantic Scholar.2012. Reference Source

[5] 5. Page M, Bautron I, Shamseer L, et al.: The PRISMA 2020 statement: an updated guideline for reporting systematic reviews. The BMJ. 2021; 372. PubMed Abstract | Publisher Full Text | Free Full Text

[6] 6. McConnell MM, Monteiro S, Bryson GL: Sample size calculations for educational interventions: principles and methods. Can. J. Anesth Can Anesth. 2019; 66(8): 864–873. PubMed Abstract | Publisher Full Text

[7] 7. Staffa SJ, Zurakowski D: Statistical power and sample size calculations: A primer for pediatric surgeons. J. Pediatr. Surg. 2020; 55(7): 1173–1179. PubMed Abstract | Publisher Full Text

[8] 8. Dreyhaupt J, Mayer B, Keis O, et al.: Cluster-randomized Studies in Educational Research: Principles and Methodological Aspects. GMS. J. Med. Educ. 2017; 34(2): Doc26. PubMed Abstract | Publisher Full Text | Free Full Text

[9] 9. Got power? A systematic review of sample size adequacy in health professions education research|SpringerLink.May 13,2014. Accessed November 30, 2022. Publisher Full Text

[10] 10. Agnihotram DG: Standard statistical methods and calculation of sample size in Medical education research. Perspectives in medical research. 2018; 6(1): 3–6. Reference Source

[11] 11. Nascimento-Ferreira MV, De Moraes ACF, Toazza Oliveira PV, et al.: Assessment of physical activity intensity and duration in the paediatric population: evidence to support an a priori hypothesis and sample size in the agreement between subjective and objective methods. Obes. Rev. Off. J. Int. Assoc. Study Obes. 2018; 19(6): 810–824. PubMed Abstract | Publisher Full Text

[12] 12. Guo JH, Chen HJ, Luh WM: Sample size planning with the cost constraint for testing superiority and equivalence of two independent groups: Sample size planning. Br. J. Math. Stat. Psychol. 2011; 64(3): 439–461. PubMed Abstract | Publisher Full Text

[13] 13. Issenberg SB, McGaghie WC, Petrusa ER, et al.: Features and uses of high-fidelity medical simulations that lead to effective learning: a BEME systematic review. Med. Teach. 2005; 27(1): 10–28. PubMed Abstract | Publisher Full Text

[14] 14. Lineberry M, Walwanis M, Reni J: Comparative research on training simulators in emergency medicine: a methodological review. Simul. Healthc. J. Soc. Simul. Healthc. 2013; 8(4): 253–261. Publisher Full Text

[15] 15. Cook DA: If you teach them, they will learn: why medical education needs comparative effectiveness research. Adv. Health Sci. Educ. Theory Pract. 2012; 17(3): 305–310. Publisher Full Text

[16] 16. Michalczyk AE, Lewis LA: Significance alone is not enough. J. Med. Educ. 1980; 55(10): 834–838. Publisher Full Text

[17] 17. Hulley SB: Designing Clinical Research. 3rd ed.Lippincott Williams & Wilkins; 2007.

[18] 18. Hoenig JM, Heisey DM: The Abuse of Power: The Pervasive Fallacy of Power Calculations for Data Analysis. Am. Stat. 2001; 55(1): 19–24. Publisher Full Text

[19] 19. Noordzij M, Tripepi G, Dekker FW, et al.: Sample size calculations: basic principles and common pitfalls. Nephrol. Dial. Transplant Off. Publ. Eur. Dial. Transpl. Assoc - Eur. Ren Assoc. 2010; 25(5): 1388–1393. Publisher Full Text

[20] 20. Statistical Power Analysis for the Behavioral Sciences|Jacob Cohen|1987|Accessed November 25, 2022. http

[21] 21. Adesope OO, Trevisan DA, Sundararajan N: Rethinking the Use of Tests: A Meta-Analysis of Practice Testing.2017. Accessed November 25, 2022. Publisher Full Text

[22] 22. Lehr R: Sixteen S-squared over D-squared: A relation for crude sample size estimates. Stat. Med. 1992; 11(8): 1099–1102. Publisher Full Text

[23] 23. Using Effect Size—or Why the P Value Is Not Enough. J. Grad. Med. Educ. 8(3): 279–282. Accessed November 25, 2022.Publisher Full Text Reference Source

[24] 24. Blanchard RD, Artino AR Jr, Visintainer PF: Applying Clinical Research Skills to Conduct Education Research: Important Recommendations for Success. J. Grad. Med. Educ. 2014; 6(4): 619–622. PubMed Abstract | Publisher Full Text | Free Full Text

[25] 25. Wilkinson L: Statistical methods in psychology journals: Guidelines and explanations. Am. Psychol. 1999; 54: 594–604. Publisher Full Text

[26] 26. Viele K, Berry S, Neuenschwander B, et al.: Use of historical control data for assessing treatment effects in clinical trials. Pharm. Stat. 2014; 13(1): 41–54. Publisher Full Text

[27] 27. Bender R, Lange S: Adjusting for multiple testing—when and how?. J. Clin. Epidemiol. 2001; 54(4): 343–349. PubMed Abstract | Publisher Full Text

[28] 28. Althouse AD: Adjust for Multiple Comparisons? It’s Not That Simple. Ann. Thorac. Surg. 2016; 101(5): 1644–1645. PubMed Abstract | Publisher Full Text

[29] 29. Gupta SS, Panchapakesan S: Multiple Decision Procedures: Theory and Methodology of Selecting and Ranking Populations. Society for Industrial and Applied Mathematics; 2002.

[30] 30. Writer S: What Is a Priori Hypothesis? Published August 4, 2015. Accessed December 2, 2022. Reference Source Reference Source

[31] 31. Hobart JC, Cano SJ, Warner TT, et al.: What sample sizes for reliability and validity studies in neurology? J. Neurol. 2012; 259(12): 2681–2694. Publisher Full Text

[32] 32. Thompson D, Peacock O, Western M, et al.: Multidimensional physical activity: an opportunity, not a problem. Exerc. Sport Sci. Rev. 2015; 43(2): 67–74. PubMed Abstract | Publisher Full Text | Free Full Text

[33] 33. Babic MJ, Morgan PJ, Plotnikoff RC, et al.: Physical activity and physical self-concept in youth: systematic review and meta-analysis. Sports Med. Auckl. NZ. 2014; 44(11): 1589–1601. PubMed Abstract | Publisher Full Text

[34] 34. Brooke HL, Corder K, Atkin AJ, et al.: A systematic literature review with meta-analyses of within- and between-day differences in objectively measured physical activity in school-aged children. Sports Med. Auckl. NZ. 2014; 44(10): 1427–1438. PubMed Abstract | Publisher Full Text | Free Full Text

[35] 35. Chow SC, Shao J, Wang H, et al.: Sample Size Calculations in Clinical Research. 3rd ed.Chapman and Hall/CRC; 2017. Publisher Full Text

[36] 36. Julious SA: Sample size of 12 per group rule of thumb for a pilot study. Pharm. Stat. 2005; 4(4): 287–291. Publisher Full Text

Navigating Sample Size Determination in Educational Research: A Rapid Review Unveiling Strategies, Challenges, and Recommendations.

Abstract

Background

Methods

Results

Conclusions

Keywords

Revised Amendments from Version 1

Introduction

Method

Figure 1. The schematic presentation of the data by using updated PRISMA guidelines.

Results

Table 1. Comparison of the studies in relation to their methods, article type, objectives and conclusions.

Discussion

Conclusion

Data availability

References

Comments on this article Comments (0)

Open Peer Review

Comments on this article Comments (0)

Open Peer Review

Reviewer Status

Reviewer Reports

Comments on this article

Browse by related subjects

Competing Interests Policy

Stay Updated