<?xml version="1.0" encoding="UTF-8"?><!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Publishing DTD v1.2 20190208//EN" "http://jats.nlm.nih.gov/publishing/1.2/JATS-journalpublishing1.dtd"><article xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" article-type="research-article" dtd-version="1.2" xml:lang="en">
    <front>
        <journal-meta>
            <journal-id journal-id-type="pmc">F1000Research</journal-id>
            <journal-title-group>
                <journal-title>F1000Research</journal-title>
            </journal-title-group>
            <issn pub-type="epub">2046-1402</issn>
            <publisher>
                <publisher-name>F1000 Research Limited</publisher-name>
                <publisher-loc>London, UK</publisher-loc>
            </publisher>
        </journal-meta>
        <article-meta>
            <article-id pub-id-type="doi">10.12688/f1000research.172015.3</article-id>
            <article-categories>
                <subj-group subj-group-type="heading">
                    <subject>Research Article</subject>
                </subj-group>
                <subj-group>
                    <subject>Articles</subject>
                </subj-group>
            </article-categories>
            <title-group>
                <article-title>Closing the Gap in Early Mathematics: Domain and Cognitive Insights from TIMSS 2023 in South Africa and Singapore</article-title>
                <fn-group content-type="pub-status">
                    <fn>
                        <p>[version 3; peer review: 2 approved with reservations, 1 not approved]</p>
                    </fn>
                </fn-group>
            </title-group>
            <contrib-group>
                <contrib contrib-type="author" corresp="yes">
                    <name>
                        <surname>Mokgwathi</surname>
                        <given-names>Mathelela Steyn</given-names>
                    </name>
                    <role content-type="http://credit.niso.org/">Conceptualization</role>
                    <role content-type="http://credit.niso.org/">Data Curation</role>
                    <role content-type="http://credit.niso.org/">Formal Analysis</role>
                    <role content-type="http://credit.niso.org/">Investigation</role>
                    <role content-type="http://credit.niso.org/">Methodology</role>
                    <role content-type="http://credit.niso.org/">Project Administration</role>
                    <role content-type="http://credit.niso.org/">Resources</role>
                    <role content-type="http://credit.niso.org/">Software</role>
                    <role content-type="http://credit.niso.org/">Supervision</role>
                    <role content-type="http://credit.niso.org/">Validation</role>
                    <role content-type="http://credit.niso.org/">Visualization</role>
                    <role content-type="http://credit.niso.org/">Writing &#x2013; Original Draft Preparation</role>
                    <role content-type="http://credit.niso.org/">Writing &#x2013; Review &amp; Editing</role>
                    <uri content-type="orcid">https://orcid.org/0000-0002-8085-7447</uri>
                    <xref ref-type="corresp" rid="c1">a</xref>
                    <xref ref-type="aff" rid="a1">1</xref>
                </contrib>
                <aff id="a1">
                    <label>1</label>Department of Early Childhood Education, University of South Africa, Pretoria, Gauteng, South Africa</aff>
            </contrib-group>
            <author-notes>
                <corresp id="c1">
                    <label>a</label>
                    <email xlink:href="mailto:steyn.mokgwathi@gmail.com">steyn.mokgwathi@gmail.com</email>
                </corresp>
                <fn fn-type="conflict">
                    <p>No competing interests were disclosed.</p>
                </fn>
            </author-notes>
            <pub-date pub-type="epub">
                <day>3</day>
                <month>4</month>
                <year>2026</year>
            </pub-date>
            <pub-date pub-type="collection">
                <year>2025</year>
            </pub-date>
            <volume>14</volume>
            <elocation-id>1209</elocation-id>
            <history>
                <date date-type="accepted">
                    <day>27</day>
                    <month>2</month>
                    <year>2026</year>
                </date>
            </history>
            <permissions>
                <copyright-statement>Copyright: &#x00a9; 2026 Mokgwathi MS</copyright-statement>
                <copyright-year>2026</copyright-year>
                <license xlink:href="https://creativecommons.org/licenses/by/4.0/">
                    <license-p>This is an open access article distributed under the terms of the Creative Commons Attribution Licence, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</license-p>
                </license>
            </permissions>
            <self-uri content-type="pdf" xlink:href="https://f1000research.com/articles/14-1209/pdf"/>
            <abstract>
                <sec>
                    <title>Background</title>
                    <p>South Africa continues to underperform in primary mathematics, yet overall mean comparisons obscure where achievement gaps are most concentrated. This study examines domain-specific patterns of mathematics achievement using data from the Trends in International Mathematics and Science Study (TIMSS) 2023, benchmarking South African Grade 5 learners against Singaporean Grade 4 learners assessed on the same TIMSS Grade 4 mathematics framework.</p>
                </sec>
                <sec>
                    <title>Methods</title>
                    <p>A quantitative secondary analysis was conducted using nationally representative TIMSS 2023 datasets comprising 10,424 South African learners from 285 schools and 6,530 Singaporean learners from 181 schools. South Africa assessed learners in Grade 5 using the TIMSS Grade 4 mathematics instruments, consistent with TIMSS procedures where curriculum exposure aligns more closely with the benchmark framework. Mathematics achievement was analysed across content domains (number, measurement and geometry, data) and cognitive domains (knowing, applying, reasoning). Weighted estimates, replicate-based variance estimation, and effect sizes (Cohen's d) were computed in accordance with IEA technical guidelines. The analysis is descriptive and comparative rather than causal.</p>
                </sec>
                <sec>
                    <title>Results</title>
                    <p>South African learners performed substantially below the international centre point across all domains. The largest content-domain gap relative to Singapore was observed in Measurement and Geometry, indicating pronounced weaknesses in spatial reasoning. The largest cognitive-domain gap occurred in Knowing, reflecting fragile foundational fluency. Although applying was relatively stronger within the South African profile, it remained substantially below benchmark levels. Achievement gaps were therefore concentrated in foundational and spatial domains rather than uniformly distributed.</p>
                </sec>
                <sec>
                    <title>Conclusions</title>
                    <p>Concentrated achievement gaps in foundational knowledge and geometry characterise the underperformance of primary mathematics in South Africa. Strengthening procedural fluency, prioritising spatial reasoning, and improving curriculum alignment between intended, implemented, and assessed domains are highlighted as key areas for focused attention. The findings provide a structured diagnostic profile that may inform targeted, domain-specific curriculum and pedagogical considerations aimed at improving equitable foundational learning outcomes.</p>
                </sec>
            </abstract>
            <kwd-group kwd-group-type="author">
                <kwd>Trends in International Mathematics and Science Study (TIMSS)</kwd>
                <kwd>Mathematics Achievement</kwd>
                <kwd>Content and Cognitive Domains</kwd>
                <kwd>South Africa</kwd>
                <kwd>Singapore</kwd>
            </kwd-group>
            <funding-group>
                <funding-statement>The author(s) declared that no grants were involved in supporting this work.</funding-statement>
            </funding-group>
        </article-meta>
        <notes>
            <sec sec-type="version-changes">
                <label>Revised</label>
                <title>Amendments from Version 2</title>
                <p>This revised version strengthens the conceptual clarity, methodological transparency, and analytical precision of the original article. First, the introduction has been restructured to more clearly identify the specific achievement gaps motivating the study. The purpose of disaggregating performance across TIMSS content and cognitive domains is now articulated earlier, and the "achievement gap" is explicitly operationalised in the results section. Second, the research questions have been refined to improve focus and coherence. The empirical analysis of content and cognitive domains is clearly separated from the interpretive discussion of curriculum and pedagogical implications. Third, the conceptual framework has been streamlined to enhance theoretical parsimony. The manuscript now integrates two complementary frameworks only: the TIMSS domain structure and curriculum alignment theory. Figure 1 has been redesigned accordingly to reflect this dual structure. Fourth, the Methodology section has been improved by adding more technical information about plausible values, how to estimate variance using replication, and the fact that the comparative design is descriptive and not meant to show cause. The rationale for benchmarking South African Grade 5 learners against Singaporean Grade 4 learners is clarified in terms of instrument equivalence within the TIMSS framework. Lastly, the Results, Discussion, and Conclusion sections have been changed to better match the real-world results.&#x00a0;Domain-specific achievement gaps are now explicitly identified, and interpretive claims have been moderated to ensure that conclusions remain proportionate to the descriptive comparative design. These revisions enhance the rigour, coherence, and transparency of the study while preserving its core empirical contribution.</p>
            </sec>
        </notes>
    </front>
    <body>
        <sec id="sec5" sec-type="intro">
            <title>1. Introduction</title>
            <p>Mathematics achievement at the primary school level has profound implications for learners&#x2019; future participation in education, work, and society. Basic mathematics skills support higher-order thinking, problem-solving, and lifelong learning, all of which are central to academic success and broader social participation, as consistently demonstrated in international large-scale assessment research (
                <xref ref-type="bibr" rid="ref22">Mullis et al., 2020</xref>; 
                <xref ref-type="bibr" rid="ref34">von Davier et al., 2024</xref>). The Trends in International Mathematics and Science Study (TIMSS) has become a key tool for comparing math achievement around the world. It shows how different countries' curricula, teaching methods, and education systems get students ready for the mental challenges of math. South Africa has participated in TIMSS since 1995 and, despite modest gains over successive cycles, continues to rank among the lowest-performing education systems internationally (
                <xref ref-type="bibr" rid="ref22">Mullis et al., 2020</xref>; 
                <xref ref-type="bibr" rid="ref27">Reddy &amp; Hannan, 2019</xref>; 
                <xref ref-type="bibr" rid="ref36">Zuze et al., 2018</xref>).</p>
            <p>Much of the existing research focuses on Grade 9 data, emphasising long-term learning deficits and systemic inequities (
                <xref ref-type="bibr" rid="ref19">Mensah &amp; Baidoo-Anu, 2022</xref>; 
                <xref ref-type="bibr" rid="ref27">Reddy &amp; Hannan, 2019</xref>). Far less attention has been given to the primary phase, particularly Grade 5, where learners consolidate foundational numeracy and begin transitioning from concrete to abstract reasoning. Empirical studies focusing specifically on Grade 5 domain-level analysis remain limited, particularly within the South African context. The TIMSS 2023 results show that South African Grade 5 students got an average score of 362, while Singapore's Grade 4 students got an average score of 615 (TIMSS, 2023). This 250-point difference, despite Singaporean learners being a grade lower, signals substantial foundational weaknesses in South Africa&#x2019;s mathematics education system. More specifically, this disparity reflects structured achievement gaps across both mathematical content domains and cognitive domains, rather than a uniform decline across all aspects of mathematics learning.</p>
            <p>Preliminary TIMSS 2023 results indicate that these gaps are particularly concentrated in measurement and geometry, as well as in the knowing cognitive domain, suggesting fragile foundations in spatial reasoning and procedural fluency, which are critical for overall mathematical competence and problem-solving skills. Within TIMSS 2023, while the international benchmarking grades are 4 and 8, some education systems assess an adjacent grade using the same internationally calibrated instruments when curriculum alignment indicates that the benchmark framework better reflects learners&#x2019; opportunity to learn. In this study, South Africa&#x2019;s Grade 5 cohort is analysed because it was assessed using the TIMSS Grade 4 mathematics instrument and framework, enabling benchmarking with Singapore&#x2019;s Grade 4 results on a common measurement scale (
                <xref ref-type="bibr" rid="ref22">Mullis et al., 2020</xref>; 
                <xref ref-type="bibr" rid="ref34">von Davier et al., 2024</xref>). The comparison therefore does not involve different curricular standards, but rather two cohorts assessed on the same TIMSS Grade 4 framework and reporting scale, ensuring measurement equivalence.</p>
            <p>National assessments such as the Annual National Assessments (ANA), now discontinued, and systemic evaluation reports have consistently highlighted low levels of mathematics achievement among learners, yet they rarely examine how performance is distributed across specific content and cognitive domains. TIMSS distinguishes between three content domains (numbers, measurement and geometry, and data) and three cognitive domains (knowing, applying, and reasoning), enabling a more diagnostic analysis of learner performance. In this study, the term &#x201c;achievement gap&#x201d; refers to measurable differences in scale scores relative to the TIMSS international centre point and a high-performing benchmark system assessed on the same framework. The focus is therefore diagnostic and comparative rather than causal. Research from high-performing systems such as Singapore demonstrates that consistent curriculum alignment, spiral progression, and scaffolded teaching support balanced development of knowledge, application, and reasoning (
                <xref ref-type="bibr" rid="ref6">Choy &amp; Dindyal, 2024</xref>; 
                <xref ref-type="bibr" rid="ref14">Low &amp; Wong, 2021</xref>; 
                <xref ref-type="bibr" rid="ref21">Morony, 2023</xref>; 
                <xref ref-type="bibr" rid="ref22">Mullis et al., 2020</xref>), while South African studies point to persistent challenges in geometry, reasoning, and teacher content knowledge (
                <xref ref-type="bibr" rid="ref17">Maqoqa, 2024</xref>; 
                <xref ref-type="bibr" rid="ref32">Taylor, 2021</xref>).</p>
            <p>Compounded by curriculum overload and large class sizes, these challenges limit opportunities for formative assessment and conceptual engagement. Improving mathematics achievement therefore requires more than curriculum reform; it depends on strengthening coherence between curriculum design, teacher professional development, and classroom practice. Understanding how learners engage with content and cognitive demands is essential for identifying areas requiring targeted instructional support and pedagogical innovation.</p>
            <p>This study therefore examines South African Grade 5 learners&#x2019; mathematics achievement in TIMSS 2023, disaggregated across content and cognitive domains and benchmarked against Singapore&#x2019;s Grade 4 performance on the same TIMSS Grade 4 framework. By identifying where achievement gaps are most pronounced, the study provides a domain-specific diagnostic profile to inform curriculum alignment and pedagogical reform. It contributes in three key ways: first, by focusing on the under-researched area of upper primary mathematics learning; second, by disaggregating performance across content and cognitive domains to identify patterns of strength and weakness; and third, by linking these patterns to curriculum and pedagogical implications aimed at strengthening foundational knowledge, geometry instruction, and teacher professional development.</p>
            <sec id="sec6">
                <title>1.1 Research Questions</title>
                <p>

                    <list list-type="order">
                        <list-item>
                            <label>1.</label>
                            <p>What is the achievement profile of South Africa&#x2019;s Grade 5 learners across the TIMSS 2023 Grade 4 mathematics content domains (number, measurement and geometry, and data)?</p>
                        </list-item>
                        <list-item>
                            <label>2.</label>
                            <p>What is the achievement profile of South Africa&#x2019;s Grade 5 learners across the TIMSS 2023 Grade 4 mathematics cognitive domains (knowing, applying, and reasoning)?</p>
                        </list-item>
                        <list-item>
                            <label>3.</label>
                            <p>When benchmarked against Singapore&#x2019;s Grade 4 learners on the TIMSS Grade 4 framework, where are the largest achievement gaps across content and cognitive domains, and what are their implications for curriculum and pedagogy?</p>
                        </list-item>
                    </list>
                </p>
            </sec>
            <sec id="sec7">
                <title>1.2 Literature Review: Benchmarking South Africa&#x2019;s Grade 5 Results on the TIMSS Primary Mathematics Assessment against Singapore&#x2019;s Grade 4</title>
                <p>

                    <bold>Benchmarking with TIMSS 2023</bold>
                </p>
                <p>The Trends in International Mathematics and Science Study (TIMSS) is a global standard for measuring how well primary and secondary school students do in math. Singapore was consistently ranked as the top achiever, with learners assessed at Grade 4 achieving an overall average of 615 points, while South Africa, assessed at Grade 5, scored 362 points. This means that Singaporean learners who are on average a year younger still outperform South African learners by more than 250 points (
                    <xref ref-type="bibr" rid="ref34">von Davier et al., 2024</xref>). The magnitude of this achievement gap underscores the need to analyse not just overall scores but also performances across content domains (numbers, measurements, geometry, and data) and cognitive domains (knowing, applying, and reasoning) to understand how curricula and teaching practices shape outcomes.</p>
                <p>

                    <bold>Curriculum Alignment and Content Domains</bold>
                </p>
                <p>Singapore&#x2019;s mathematical curriculum is internationally recognised for its coherence and spiral structure, which involves systematically revisiting concepts at increasing levels of complexity. In TIMSS 2023, Singapore scored 613 in numbers, 619 in measurement and geometry, and 616 in data, while South Africa scored 362, 353, and 362, respectively. 
                    <xref ref-type="bibr" rid="ref17">Maqoqa (2024)</xref> and 
                    <xref ref-type="bibr" rid="ref30">Tachie (2020)</xref> reported that the largest achievement gap is in measurement and geometry (266 points), an area long identified as a &#x201c;blind spot&#x201d; in South African classrooms. These gaps suggest that South African learners struggle with reasoning and geometric domains, while Singaporean learners benefit from early exposure to concrete manipulatives, reasoning, and visual models that build conceptual skills and knowledge.</p>
                <p>

                    <bold>Cognitive Demands: Knowing, Applying, and Reasoning</bold>
                </p>
                <p>TIMSS distinguishes between three cognitive domains: knowing, applying, and reasoning. Singapore&#x2019;s Grade 4 learners achieved 624 in knowing, 615 in applying, and 609 in reasoning, whereas South Africa&#x2019;s Grade 5 learners scored 357, 366, and 363. This result reveals a profound weakness in knowing (&#x2013;267 points compared to Singapore), which reflects learners&#x2019; difficulties with knowing domains. South African learners performed slightly better in the applying domain (366) relative to their average, suggesting that when knowledge is available, learners can engage in routine applications. The reasoning domain, on the other hand, shows a persistent weakness. The result means that South African learners are not being prepared for non-routine, multi-step problem-solving tasks, which is a strong point of the Singaporean system.</p>
            </sec>
            <sec id="sec8">
                <title>1.3 Instructional and Structural Factors</title>
                <p>Several systemic factors reinforce these disparities. According to 
                    <xref ref-type="bibr" rid="ref18">Meier and West (2020)</xref>, South Africa&#x2019;s classrooms often suffer from overcrowding, with class sizes averaging over 50 learners, which hinders formative feedback and personalised support. Teacher content and pedagogical knowledge remain uneven, particularly in geometry and measurement (
                    <xref ref-type="bibr" rid="ref3">Bhagwonparsadh &amp; Pule, 2024</xref>; 
                    <xref ref-type="bibr" rid="ref32">Taylor, 2021</xref>). In contrast, Singapore invests heavily in sustained teacher development, smaller class sizes, and instructional leadership, creating a conducive learning environment where consistent teaching and learning take place. Furthermore, although South Africa&#x2019;s curriculum aims for comprehensive coverage, it has faced criticism for being &#x201c;overloaded&#x201d; and not allowing adequate time for the mastery of fundamental skills (
                    <xref ref-type="bibr" rid="ref20">Milne &amp; Mhlolo, 2021</xref>). By contrast, Singapore&#x2019;s Concrete&#x2013;Pictorial&#x2013;Abstract (CPA) approach deliberately scaffolds learning so that conceptual understanding precedes abstraction, enabling positive learner achievement in higher-order reasoning (
                    <xref ref-type="bibr" rid="ref13">Leong et al., 2015</xref>; 
                    <xref ref-type="bibr" rid="ref15">Lutfi &amp; Dasari, 2024</xref>).</p>
            </sec>
            <sec id="sec9">
                <title>1.4 Curriculum&#x2013;Cognitive Alignment in International Research</title>
                <p>International evidence further illustrates how disparities between curriculum objectives and classroom practices shape academic learner achievement. 
                    <xref ref-type="bibr" rid="ref35">Y&#x0131;lmaz et al. (2021)</xref> found that, while mathematics curricula emphasised reasoning, textbook activities leaned more toward application, creating a mismatch between intended and taught cognitive emphases. Similarly, Bulut and Ta&#x015f;p&#x0131;nar-&#x015f;ener (2023) reported that the application domain is most frequently prioritised in the secondary mathematics curriculum, disregarding the primary mathematics curriculum, whereas the emphasis placed on knowledge and reasoning differs across grade levels. In primary schools, 
                    <xref ref-type="bibr" rid="ref24">Pertiwi and Wahidin (2020)</xref> showed that fourth-grade assessments are dominated by number-related content, with far less attention given to geometry or data activities. These results reflect TIMSS&#x2019;s framework, where knowing entails factual recall and procedural fluency, applying involves transferring knowledge to structured contexts, and reasoning requires non-routine problem-solving and critical thinking (
                    <xref ref-type="bibr" rid="ref23">Peduk &amp; Ate&#x015f;, 2019</xref>). Importantly, the study also found that content domains exert a more positive effect on mathematics learner achievement than cognitive domains. This body of research provides an explanatory lens for South Africa&#x2019;s TIMSS 2023 mathematics learner achievement. The relative strengths of South African Grade 5 learners in the application domain, alongside their persistent weaknesses in knowledge and reasoning, reflect the misalignment noted by several studies internationally, where instruction and textbooks emphasise routine application but fail to develop the basic knowledge and reasoning capacity required for learner progression. In contrast, Singapore&#x2019;s balanced curriculum and pedagogy demonstrate how alignment across content and cognitive domains fosters sustained learner achievement.</p>
            </sec>
            <sec id="sec10">
                <title>1.5 TIMSS: A Diagnostic Instrument</title>
                <p>TIMSS provides a structured empirical basis for analysing domain-specific performance patterns, beyond broad structural explanations. The comparison with Singapore shows that South Africa&#x2019;s curriculum is superficially similar in content but not in cognitive expectations, especially when it comes to knowing and reasoning. This mismatch leaves learners underprepared for both academic progression and broader applications of mathematics in their everyday lives. The fact that Singapore&#x2019;s Grade 4 learners significantly outperform South Africa&#x2019;s Grade 5 learners further shows that the gap in mathematics achievement among learners is not simply attributable to learner age or exposure but to differences in curriculum coherence, cognitive scaffolding, and teacher training and development.</p>
                <p>The TIMSS 2023 results highlight not just the scale of South Africa&#x2019;s underperformance but also its domain-specific and cognitive weaknesses. While Singapore demonstrates how curriculum coherence and sustained teacher training and development can support an understanding of teaching and learning across content and cognitive domains, South Africa&#x2019;s challenges are rooted in weak foundations, gaps in geometry and measurement, and systemic barriers in teacher training, development, and the classroom environment. Addressing these challenges requires targeted curriculum reforms in early-grade numeracy, curriculum streamlining, and teacher professional development, with a particular emphasis on geometry, critical thinking (reasoning), and basic knowledge. Context-sensitive adaptations, rather than comprehensive importation of Singapore&#x2019;s model, offer a pathway for South Africa to strengthen learner trajectories in mathematics.</p>
            </sec>
            <sec id="sec11">
                <title>1.6 Conceptual Framework</title>
                <p>This study is guided by the TIMSS conceptual model, which distinguishes between mathematical content domains (numbers, measurements, geometry, and data) and cognitive domains (knowing, applying, and reasoning) (
                    <xref ref-type="bibr" rid="ref22">Mullis et al., 2020</xref>). These cognitive domains broadly align with established constructs of mathematical proficiency, including conceptual understanding, procedural fluency, and adaptive reasoning (
                    <xref ref-type="bibr" rid="ref43">Findell et al., 2001</xref>). Together, these dimensions provide a diagnostic lens for examining not only what learners are expected to know but also how they engage with mathematical tasks at increasing levels of cognitive demand. In light of South Africa&#x2019;s historically low mathematics achievement, this framework facilitates the identification of domain-specific deficiencies in foundational knowledge, procedural proficiency, and higher-order reasoning, which are frequently obscured by overall performance averages. Within this study, the TIMSS framework functions as the primary analytical structure, guiding the disaggregation of achievement results and the identification of domain-specific achievement gaps.</p>
                <p>The study employs curriculum alignment theory, which emphasises the coherence between curricular intentions, classroom enactment, and assessment demands, to interpret these patterns (
                    <xref ref-type="bibr" rid="ref26">Porter, 2002</xref>). Alignment theory posits that meaningful learning gains are most likely when the intended curriculum, the implemented curriculum, and the attained curriculum operate in concert. In this study, curriculum alignment theory serves as the principal interpretive lens, enabling explanation of why certain content and cognitive domains exhibit more pronounced achievement gaps than others. International evidence from high-performing systems such as Singapore illustrates how strong alignment is achieved through a spiral curriculum that revisits mathematical concepts at progressively higher levels of complexity, supported by the Concrete&#x2013;Pictorial&#x2013;Abstract (CPA) approach that scaffolds learning from concrete representations to abstract reasoning (
                    <xref ref-type="bibr" rid="ref13">Leong et al., 2015</xref>; 
                    <xref ref-type="bibr" rid="ref15">Lutfi &amp; Dasari, 2024</xref>). In contrast, research on South African primary mathematics reveals an overloaded curriculum, limited instructional time for mastery of foundational domains, and persistent weaknesses in geometry and spatial reasoning, compounded by gaps in teacher content knowledge and pedagogical confidence (
                    <xref ref-type="bibr" rid="ref17">Maqoqa, 2024</xref>; 
                    <xref ref-type="bibr" rid="ref32">Taylor, 2021</xref>). Situating South Africa&#x2019;s Grade 5 TIMSS 2023 performance against Singapore&#x2019;s Grade 4 results therefore provides a comparative curriculum lens for examining how differences in sequencing, pacing, and instructional coherence shape progression from basic knowledge to application and reasoning. To ensure conceptual clarity and avoid theoretical fragmentation, the study is grounded in only two complementary frameworks. The TIMSS model structures the measurement and domain disaggregation, while curriculum alignment theory structures the interpretation of domain-specific disparities. This dual framework ensures both diagnostic precision and explanatory coherence without theoretical fragmentation.</p>
                <p>
                    <xref ref-type="fig" rid="f1">
Figure 1</xref> Conceptual framework integrating the TIMSS domain structure and Curriculum Alignment Theory guiding the diagnostic and interpretive analysis.</p>
                <fig fig-type="figure" id="f1" orientation="portrait" position="float">
                    <label>
Figure 1. </label>
                    <caption>
                        <title>Adapted conceptual framework.</title>
                    </caption>
                    <graphic id="gr1" orientation="portrait" position="float" xlink:href="https://f1000research-files.f1000.com/manuscripts/196841/aecdd10d-03cb-414d-986a-a75288a21b8b_figure1.gif"/>
                </fig>
                <p>
                    <xref ref-type="fig" rid="f1">
Figure 1</xref> illustrates the streamlined conceptual framework, showing how the TIMSS domain structure and curriculum alignment theory interact to guide the analysis. Guided by this framework, the study&#x2019;s methodology operationalises these dimensions through a secondary analysis of the TIMSS 2023 data. The TIMSS framework informs the analysis of content and cognitive domains, while curriculum alignment theory shapes the interpretation of performance patterns in relation to instructional coherence, learning progression, and assessment demands. The following section therefore outlines the research design, participants, data sources, analytical procedures, and ethical considerations employed in the study.</p>
            </sec>
        </sec>
        <sec id="sec12">
            <title>2. Methodology</title>
            <sec id="sec13">
                <title>2.1 Research Design</title>
                <p>This study employed a quantitative secondary data analysis design, using data from the Trends in International Mathematics and Science Study (TIMSS) 2023. This design is particularly appropriate for investigating the mathematics achievement of South African Grade 5 learners across content and cognitive domains for several reasons. First, TIMSS provides a large, internationally standardised dataset that is both rigorous in design and nationally representative, making it suitable for examining learners&#x2019; performance patterns with a high degree of reliability. Second, secondary analysis enables the use of TIMSS&#x2019;s robust psychometric procedures, including item response theory and plausible values, which strengthen the validity of inferences about learners&#x2019; achievement. Third, the TIMSS framework allows for meaningful international benchmarking, making it possible to situate South Africa&#x2019;s performance in relation to high-performing education systems such as Singapore. Guided by the TIMSS assessment framework, the analysis focused on three content domains (numbers, measurement, geometry, and data) and three cognitive domains (knowing, applying, and reasoning). This two-fold focus not only enabled a diagnostic assessment of learners&#x2019; strengths and weaknesses within the South African context but also provided comparative insights to inform curriculum and policy reform globally. Importantly, the South African sample reported here reflects the TIMSS 2023 Grade 5 administration using the Grade 4 mathematics assessment instruments and framework. Within TIMSS procedures, some education systems assess an adjacent grade when curriculum alignment indicates that the benchmark framework better reflects learners&#x2019; opportunity to learn. Consequently, the comparison undertaken in this study is not between different grade curricula but between two cohorts assessed on the same TIMSS Grade 4 mathematics framework and common international reporting scale. The comparison does not assume equivalence in grade-level progression; rather, it examines relative performance under a common TIMSS measurement framework calibrated to ensure comparability of scale scores. The analysis therefore benchmarks Grade 5 in South Africa against Grade 4 in Singapore on an equivalent instrument, enabling comparison of performance patterns across TIMSS content and cognitive domains on the same international reporting scale (
                    <xref ref-type="bibr" rid="ref22">Mullis et al., 2020</xref>; 
                    <xref ref-type="bibr" rid="ref34">von Davier et al., 2024</xref>). It is important to emphasise that this design is descriptive and comparative rather than causal. The study identifies structured achievement gaps across domains but does not infer grade-level learning gains or causal mechanisms.</p>
            </sec>
            <sec id="sec14">
                <title>2.2 Participants</title>
                <p>The South African TIMSS 2023 primary grade sample comprised 10,424 Grade 5 learners from 285 schools who were assessed using the TIMSS Grade 4 mathematics instruments, while Singapore assessed 6,530 Grade 4 learners from 181 schools using the same Grade 4 mathematics assessment framework (
                    <xref ref-type="bibr" rid="ref9">Department of Basic Education, 2024</xref>; 
                    <xref ref-type="bibr" rid="ref34">von Davier et al., 2024</xref>). The International Association for the Evaluation of Educational Achievement (IEA), in collaboration with Statistics Canada, used a two-stage stratified cluster sampling methodology to guarantee nationally representative estimates (
                    <xref ref-type="bibr" rid="ref29">Siegel &amp; Foy, 2024</xref>). In the first stage, schools were selected with probabilities proportional to their size, and in the second stage, intact Grade 5 classes were sampled. The stratification variables included the school sector (public or private), the language of instruction, the geographic region, socioeconomic indicators, the degree of urbanisation, and prior academic achievements (Ibid.). The use of stratified cluster sampling and population weights ensures that reported statistics represent national achievement distributions rather than sample-level estimates.</p>
            </sec>
            <sec id="sec15">
                <title>2.3 Data Collection and Analysis</title>
                <p>Data for this study were drawn from the TIMSS 2023 mathematics assessment and associated contextual background questionnaires administered to participating learners, teachers, and schools. The TIMSS mathematics achievement is reported on an internationally standardised scale with a centre point of 500 and a standard deviation of 100, enabling valid comparisons across countries and education systems. The mathematics assessment comprised 183 items distributed across three content domains, namely number (94 items), measurement and geometry (49 items), and data (40 items), as well as three cognitive domains, namely knowing (58 items), applying (85 items), and reasoning (40 items) (
                    <xref ref-type="bibr" rid="ref28">Reynolds, 2024</xref>). Each learner completed one assessment booklet, with achievement estimates derived using item response theory and reported as plausible values. This design supports reliable population-level estimation while minimising respondents&#x2019; burden (
                    <xref ref-type="bibr" rid="ref45">von Davier, 2020</xref>).</p>
                <p>The analysis focused on South Africa&#x2019;s Grade 5 results, and Singapore&#x2019;s Grade 4 performance was used as an international benchmark to contextualise domain-specific patterns of mathematics achievement. This comparison is consistent with TIMSS procedures, as both cohorts were assessed using the same Grade 4 mathematics framework and instruments, calibrated on a common international scale. Weighted descriptive statistics were computed in accordance with International Association for the Evaluation of Educational Achievement (IEA) guidelines to account for TIMSS&#x2019;s two-stage stratified cluster sampling design. Sampling weights were applied to ensure nationally representative estimates and to correct for unequal probabilities of selection and non-response, thereby reducing bias in cross-national comparisons (
                    <xref ref-type="bibr" rid="ref29">Siegel &amp; Foy, 2024</xref>). All analyses incorporated the full set of plausible values, and estimates were combined using Rubin's rules in accordance with TIMSS technical procedures. To assess differences in performance across content and cognitive domains, weighted mean scores were compared and tested for statistical significance at the 1% level to account for multiple comparisons. In addition to significance testing, effect sizes (Cohen&#x2019;s 
                    <italic toggle="yes">d</italic>) were calculated to evaluate the practical magnitude of observed differences between domains and between South Africa and Singapore. Variance estimation incorporated the TIMSS replicate weights to account for the two-stage stratified cluster sampling design. Standard errors and confidence intervals were computed using these replicate weights, ensuring appropriate estimation of sampling variability. The combined use of statistical significance and effect size estimation enabled a more substantively meaningful interpretation of performance gaps, beyond reliance on mean differences alone.</p>
                <p>Although TIMSS 2023 collects extensive contextual information through learner, teacher, school, and curriculum questionnaires, the present study prioritised a diagnostic comparison of domain-specific achievement patterns. Consequently, contextual variables such as socioeconomic status, language of instruction, school resources, and teacher characteristics were not incorporated into multivariate or multilevel models in the main analysis. The exclusion of these variables from inferential modelling was deliberate, given that the primary objective was to identify and quantify achievement gaps across content and cognitive domains rather than to estimate explanatory or causal effects. Instead, these variables were used interpretively in the discussion to contextualise the observed performance trends. This analytic decision reflects a staged research logic: descriptive domain disaggregation precedes explanatory modelling. Such an approach is consistent with established practices in large-scale assessment research, where descriptive diagnostics and explanatory modelling are viewed as complementary rather than competitive strategies (
                    <xref ref-type="bibr" rid="ref40">OECD, 2019</xref>; 
                    <xref ref-type="bibr" rid="ref42">Rutkowski &amp; Delandshere, 2016</xref>).</p>
                <p>To ensure appropriate estimation of statistical uncertainty, all achievement comparisons were based on the full set of TIMSS plausible values. Estimates were combined using Rubin&#x2019;s rules, as prescribed in IEA technical documentation, to account for measurement imputation variability inherent in plausible value methodology. Variance estimation was conducted in accordance with IEA technical guidelines for complex survey data. Weighted means were calculated using student sampling weights, and standard errors were derived using the TIMSS replicate weights to account for the two-stage stratified cluster sampling design.</p>
                <p>Statistical inference was undertaken using these replicate-based variance estimates. Effect sizes (Cohen&#x2019;s 
                    <italic toggle="yes">d</italic>) were reported alongside significance tests to provide an educationally meaningful interpretation of observed differences beyond statistical significance alone. Confidence intervals are reported in the supplementary materials to enhance transparency and facilitate replication of domain-level comparisons.</p>
            </sec>
            <sec id="sec16">
                <title>2.4 Ethical Considerations</title>
                <p>This study is based on secondary analysis of TIMSS 2023 restricted-use datasets provided by the IEA. The data contain no personal identifiers and were collected under strict international ethical protocols during the original administration. Because this research involved secondary analysis of anonymised data, no institutional ethics approval was required. There was no formal request to use the dataset from the IEA since the data is available in the public domain, and all analyses adhered to its guidelines for responsible data use.</p>
            </sec>
        </sec>
        <sec id="sec17" sec-type="results">
            <title>3. Results</title>
            <p>This section presents the empirical results addressing Research Questions 1 and 2. Research Question 1 examines South Africa&#x2019;s achievement profile across TIMSS mathematics content domains, while Research Question 2 examines achievement across cognitive domains. Research Question 3, concerning curriculum and pedagogical implications, is addressed in the Discussion section. For the purposes of this study, an &#x201c;achievement gap&#x201d; is operationally defined as (a) the scale score distance from the TIMSS international centre point of 500, and (b) the benchmark scale score difference between South African Grade 5 learners and Singaporean Grade 4 learners assessed on the same TIMSS Grade 4 mathematics framework. Effect sizes (Cohen&#x2019;s 
                <italic toggle="yes">d</italic>) are used to indicate the magnitude of these benchmark disparities.</p>
            <sec id="sec18">
                <title>3.1 Content Domain Achievement</title>
                <p>In response to Research Question 1, the analysis reveals that South African Grade 5 learners perform substantially below Singaporean Grade 4 learners across all three TIMSS content domains: Number, Measurement and Geometry, and Data. These differences constitute substantial achievement gaps across all content domains rather than isolated performance variations. Relative to the TIMSS international centre point (500), South Africa&#x2019;s mean performance in each content domain reflects a substantial negative deviation, indicating systemic underperformance rather than isolated topic weaknesses. The largest benchmark achievement gap is observed in Measurement and Geometry. This domain exhibits the greatest scale score distance between South Africa and Singapore and the largest effect size (
                    <italic toggle="yes">d</italic> = 2.66), indicating an extremely large practical disparity. 
                    <xref ref-type="table" rid="T1">
Table 1</xref> reports the weighted mean scores and effect sizes for South African Grade 5 learners and Singaporean Grade 4 learners across the three TIMSS mathematics content domains.</p>
                <table-wrap id="T1" orientation="portrait" position="float">
                    <label>
Table 1. </label>
                    <caption>
                        <title>Mean Scale Scores and Effect Sizes in Mathematics Content Domains, TIMSS 2023</title>
                    </caption>
                    <table content-type="article-table" frame="hsides">
                        <thead>
                            <tr>
                                <th align="left" colspan="1" rowspan="1" valign="top">Content domain</th>
                                <th align="left" colspan="1" rowspan="1" valign="top">Singapore (Grade 4)</th>
                                <th align="left" colspan="1" rowspan="1" valign="top">
South Africa (Grade 5)</th>
                                <th align="left" colspan="1" rowspan="1" valign="top">Gap</th>
                                <th align="left" colspan="1" rowspan="1" valign="top">
Cohen&#x2019;s 
                                    <italic toggle="yes">d</italic>
</th>
                            </tr>
                        </thead>
                        <tbody>
                            <tr>
                                <td align="left" colspan="1" rowspan="1" valign="top">Numbers</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">613</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">362</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">251</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">2.51</td>
                            </tr>
                            <tr>
                                <td align="left" colspan="1" rowspan="1" valign="top">Measurement &amp; Geometry</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">619</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">353</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">266</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">2.66</td>
                            </tr>
                            <tr>
                                <td align="left" colspan="1" rowspan="1" valign="top">Data</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">616</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">362</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">254</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">2.54</td>
                            </tr>
                        </tbody>
                    </table>
                    <table-wrap-foot>
                        <p>

                            <italic toggle="yes">Note.</italic> All estimates are weighted. Standard errors and confidence intervals were computed using TIMSS plausible values and replicate weights and are reported in the supplementary materials.</p>
                    </table-wrap-foot>
                </table-wrap>
                <p>All three domains reveal large effect sizes (
                    <italic toggle="yes">d</italic> &gt; 2.5), signifying deep and systematic achievement gaps across the mathematics curriculum. The concentration of the largest gaps in measurement and geometry indicates that spatial reasoning, geometric visualisation, and conceptual understanding in this domain represent the most acute areas of weakness within the South African achievement profile. The characteristics of this gap indicate constraints in students' capacity to interact with spatial and relational concepts that necessitate conceptual integration rather than mere procedural recall. Importantly, while performance is low across domains, the relative ordering of domain means indicates that the gap is not uniform; it is most severe in Measurement and Geometry and comparatively less pronounced, though still substantial, in Number and Data.</p>
            </sec>
            <sec id="sec19">
                <title>3.2 Cognitive Domain Achievement</title>
                <p>Addressing Research Question 2, the results indicate that South African learners demonstrate markedly lower performance than Singaporean learners across all three cognitive domains: knowing, applying, and reasoning. These results reveal pronounced cognitive achievement gaps across all levels of mathematical thinking. The Knowing domain exhibits the largest cognitive achievement gap, with an effect size of 
                    <italic toggle="yes">d</italic> = 2.67. This domain captures factual recall, procedural fluency, and basic computational knowledge. The magnitude of this disparity indicates a profound benchmark gap in foundational mathematical knowledge.</p>
                <table-wrap id="T2" orientation="portrait" position="float">
                    <label>
Table 2. </label>
                    <caption>
                        <title>Mean Scale Scores and Effect Sizes in Mathematics Cognitive Domains, TIMSS 2023.</title>
                    </caption>
                    <table content-type="article-table" frame="hsides">
                        <thead>
                            <tr>
                                <th align="left" colspan="1" rowspan="1" valign="top">Cognitive domain</th>
                                <th align="left" colspan="1" rowspan="1" valign="top">Singapore (Grade 4)</th>
                                <th align="left" colspan="1" rowspan="1" valign="top">
South Africa (Grade 5)</th>
                                <th align="left" colspan="1" rowspan="1" valign="top">Gap</th>
                                <th align="left" colspan="1" rowspan="1" valign="top">
Cohen&#x2019;s 
                                    <italic toggle="yes">d</italic>
</th>
                            </tr>
                        </thead>
                        <tbody>
                            <tr>
                                <td align="left" colspan="1" rowspan="1" valign="top">Knowing</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">624</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">357</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">267</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">2.67</td>
                            </tr>
                            <tr>
                                <td align="left" colspan="1" rowspan="1" valign="top">Applying</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">615</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">366</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">249</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">2.49</td>
                            </tr>
                            <tr>
                                <td align="left" colspan="1" rowspan="1" valign="top">Reasoning</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">609</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">363</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">246</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">2.46</td>
                            </tr>
                        </tbody>
                    </table>
                    <table-wrap-foot>
                        <p>

                            <italic toggle="yes">Note.</italic> All estimates are weighted. Standard errors and confidence intervals were computed using TIMSS plausible values and replicate weights and are reported in the supplementary materials.</p>
                    </table-wrap-foot>
                </table-wrap>
                <p>While performance in the applying domain (mean = 366) is relatively stronger than in knowing or reasoning within the South African profile, it remains substantially below both the international centre point and the Singapore benchmark. This indicates that even where relative strengths exist, significant achievement gaps persist when evaluated against international standards. This pattern indicates a differentiated achievement profile in which routine procedural engagement is comparatively more accessible than foundational fluency or non-routine reasoning. The reasoning domain also reflects a large achievement gap relative to Singapore. The combined pattern across cognitive domains indicates that foundational knowledge deficits are accompanied by substantial limitations in higher-order reasoning, with the largest disparity concentrated at the most basic cognitive level (knowing). This layered structure of gaps suggests that weaknesses in foundational knowledge constrain progression to more complex forms of mathematical reasoning, which may ultimately hinder students' overall mathematical performance and their ability to solve advanced problems effectively.</p>
            </sec>
            <sec id="sec20">
                <title>3.3 Item-Level Illustrations</title>
                <p>To further illustrate the domain-specific patterns identified in Research Questions 1 and 2, selected released TIMSS items are used to demonstrate how content and cognitive disparities manifest at the task level. In the Knowing domain (Number), more than 90 percent of Singaporean learners correctly recalled basic multiplication facts, compared to fewer than 40 percent of South African learners. This example illustrates a substantial foundational achievement gap in procedural fluency. In the applying domain (data), routine tasks such as interpreting a simple bar chart were accessible to a proportion of South African learners, indicating some competence in structured and familiar problem contexts. However, in the reasoning domain (geometry), where items required multi-step reasoning with angle relationships, fewer than 20 percent of South African learners responded correctly, compared with a majority of Singaporean learners. This reflects a pronounced achievement gap in tasks requiring conceptual reasoning and spatial integration. These illustrative items reinforce the quantitative results by showing that the largest cognitive and content disparities are concentrated in foundational recall and spatial reasoning tasks requiring conceptual integration rather than routine execution alone. Collectively, these examples highlight how curriculum exposure and classroom instructional practices shape learners&#x2019; preparedness to engage with domain-specific cognitive demands.</p>
            </sec>
            <sec id="sec21">
                <title>3.4 Visualising the Gaps</title>
                <p>
                    <xref ref-type="fig" rid="f2">
Figure 2</xref> illustrates the comparative performance of South African Grade 5 and Singaporean Grade 4 learners across the three TIMSS content domains. The visualisation confirms the consistently lower achievement of South African learners, with the largest gap observed in Measurement and Geometry. This pattern aligns with the tabulated results and underscores the concentration of content-domain disparities in spatial reasoning and geometric understanding. By presenting scale score distances graphically, 
                    <xref ref-type="fig" rid="f2">
Figure 2</xref> highlights the distribution and magnitude of domain-specific achievement gaps rather than suggesting uniform underperformance across all content areas.</p>
                <fig fig-type="figure" id="f2" orientation="portrait" position="float">
                    <label>
Figure 2. </label>
                    <caption>
                        <title>Comparative performance of South African Grade 5 and Singaporean Grade 4 learners across TIMSS content domains (Numbers, Measurement &amp; Geometry, and Data).</title>
                    </caption>
                    <graphic id="gr2" orientation="portrait" position="float" xlink:href="https://f1000research-files.f1000.com/manuscripts/196841/aecdd10d-03cb-414d-986a-a75288a21b8b_figure2.gif"/>
                </fig>
                <fig fig-type="figure" id="f3" orientation="portrait" position="float">
                    <label>
Figure 3. </label>
                    <caption>
                        <title>Comparative performance of South African Grade 5 and Singaporean Grade 4 learners across TIMSS cognitive domains (Knowing, Applying, and Reasoning).</title>
                    </caption>
                    <graphic id="gr3" orientation="portrait" position="float" xlink:href="https://f1000research-files.f1000.com/manuscripts/196841/aecdd10d-03cb-414d-986a-a75288a21b8b_figure3.gif"/>
                </fig>
                <p>
                    <xref ref-type="fig" rid="f3">
Figure 3</xref> presents the comparative performance of South African Grade 5 and Singaporean Grade 4 learners across the TIMSS cognitive domains. The figure clearly indicates that the widest cognitive disparity occurs in the Knowing domain, reflecting substantial weaknesses in foundational fluency. Although the Applying domain appears relatively stronger within the South African profile, it remains substantially below benchmark levels, indicating that relative strength does not imply adequacy relative to international expectations. Together, the figures visually reinforce the empirical finding that achievement gaps are systematic, domain-specific, and concentrated in foundational knowledge and spatial reasoning.</p>
            </sec>
            <sec id="sec22">
                <title>3.5 Summary of Results</title>
                <p>Synthesising the results addressing Research Questions 1 and 2, the results highlight three critical insights into South African learners&#x2019; mathematics achievement. First, geometry and spatial reasoning continue to represent the weakest domain, pointing to enduring structural gaps in both curriculum design and teacher preparation. Second, the severity of deficits in foundational knowledge, as indicated in the knowing domain, hinders learners' advancement to higher-order reasoning tasks. Third, although learners demonstrate relatively positive learner achievement in the applying domain, this potential remains constrained by the absence of solid basic skills and limited opportunities for reasoning skills, which prevents the development of sustained mathematical learner achievement. These results collectively indicate that achievement gaps are not uniform but are concentrated in foundational knowledge and spatial reasoning domains. The benchmark comparison with Singapore highlights the magnitude of these domain-specific disparities and provides a structured reference point for interpreting their educational significance.</p>
            </sec>
        </sec>
        <sec id="sec23" sec-type="discussion">
            <title>4. Discussion</title>
            <p>This section addresses Research Question 3 by interpreting domain-specific and cognitive performance patterns in relation to curriculum alignment, pedagogy, and teacher professional development and outlining implications for improving mathematics achievement in South Africa. The TIMSS 2023 results reaffirm that South African Grade 5 learners perform substantially below the international mathematics benchmark, achieving a mean score of 362, compared with the TIMSS centre point of 500. However, the more analytically significant finding lies not in the overall mean alone but in the concentration and distribution of achievement gaps across specific content and cognitive domains. These comparisons are interpreted within a common TIMSS measurement framework and do not imply equivalence in grade-level progression; rather, they provide a structured benchmark for examining domain-specific achievement patterns. Disaggregating achievement reveals that the largest content-domain gap occurs in measurement and geometry, while the largest cognitive-domain gap occurs in knowing. These results are consistent with prior research highlighting persistent domain-specific weaknesses rather than uniform underperformance across mathematics (
                <xref ref-type="bibr" rid="ref17">Maqoqa, 2024</xref>; 
                <xref ref-type="bibr" rid="ref32">Taylor, 2021</xref>). These domain-specific disparities provide a more precise diagnostic basis for interpreting systemic challenges.</p>
            <sec id="sec10.6">
                <title>4.1 Content-Domain Patterns and Foundational Gaps</title>
                <p>The largest benchmark gap was observed in measurement and geometry, where effect sizes indicated a large disparity relative to Singapore. This result indicates that spatial reasoning, geometric visualisation, and conceptual understanding constitute the most acute areas of weakness in the South African achievement profile. While performance in Number and Data also remains substantially below international expectations, the relative ordering of domain means indicates that underperformance is not uniform across content areas. The disproportionate weakness in measurement and geometry is consistent with national studies identifying geometry as a longstanding area of difficulty in South African classrooms (
                    <xref ref-type="bibr" rid="ref17">Maqoqa, 2024</xref>; 
                    <xref ref-type="bibr" rid="ref32">Taylor, 2021</xref>). These results do not indicate a lack of geometry instruction; instead, they suggest a possible misalignment between the intended curriculum and its implementation in the classroom. Consistent with Curriculum Alignment Theory (
                    <xref ref-type="bibr" rid="ref26">Porter, 2002</xref>), gaps may emerge when intended learning outcomes emphasise conceptual reasoning, while implemented instruction remains focused on procedural routines. This interpretation extends prior research by demonstrating how such misalignments manifest empirically in large-scale assessment data. Strengthening geometry education therefore requires approaches that prioritise visualisation, contextualisation, and progressive abstraction. The empirical evidence indicates that targeted improvement in spatial reasoning should be treated as a priority domain rather than as one component among equally distributed weaknesses.</p>
            </sec>
            <sec id="sec10.7">
                <title>4.2 Cognitive-Domain Performance and Learning Progression</title>
                <p>In further addressing research question 3, the largest cognitive achievement gap was observed in the Knowing domain. South African learners scored substantially lower than Singaporean learners (357 versus 624), with a very large effect size. Because the Knowing domain captures factual recall and procedural fluency, this result reflects a pronounced gap in foundational knowledge. In the TIMSS framework, foundational fluency is the basis for moving on to tasks that require applying and reasoning. The magnitude of the knowing gap is therefore associated with the overall cognitive achievement profile. Although applying represents a relatively stronger domain within the South African distribution, it remains substantially below international benchmarks. These results indicate that learners may engage with familiar, structured procedures, but their capacity to generalise, justify, or reason abstractly appears limited in relation to their foundational fluency levels. This pattern is consistent with prior results that procedural competence without conceptual depth does not reliably support sustained reasoning development (
                    <xref ref-type="bibr" rid="ref38">Mullis &amp; Martin, 2017</xref>). The combined domain pattern therefore reflects a vertically constrained achievement profile, in which weaknesses at the most basic cognitive level are associated with reduced engagement with higher-order reasoning. These results extend prior work by demonstrating how domain-specific disparities in foundational knowledge correspond with differentiated cognitive performance patterns across applying and reasoning domains.</p>
            </sec>
            <sec id="sec10.8">
                <title>4.3 Curriculum Alignment and Cross-National Insights</title>
                <p>In relation to Research Question 3, the benchmark comparison with Singapore illustrates how curriculum alignment may be associated with domain-specific achievement distributions. Singapore&#x2019;s tightly sequenced spiral curriculum and systematic use of the Concrete&#x2013;Pictorial&#x2013;Abstract progression support continuity across content and cognitive domains (
                    <xref ref-type="bibr" rid="ref13">Leong et al., 2015</xref>). This aligns with international evidence that coherent curriculum progression and structured pedagogical scaffolding are associated with balanced development across knowledge, application, and reasoning domains. In contrast, South Africa&#x2019;s curriculum has been described as broad in scope, with limited instructional time for consolidation of foundational domains (
                    <xref ref-type="bibr" rid="ref17">Maqoqa, 2024</xref>; 
                    <xref ref-type="bibr" rid="ref32">Taylor, 2021</xref>). A lack of sufficient depth may constrain cumulative mastery, especially in domains like geometry that require sustained conceptual development. Curriculum Alignment Theory (
                    <xref ref-type="bibr" rid="ref26">Porter, 2002</xref>) indicates that meaningful gains are most likely when the intended, implemented, and assessed curricula are coherently aligned. Therefore, partial misalignments between policy aspirations, classroom practice, and assessment demands may be associated with the observed achievement gaps in knowing, measurement, and geometry. These results extend prior work by indicating that such misalignments correspond with clearly differentiated domain-specific achievement patterns rather than uniform underperformance. This interpretation situates the present results within a broader body of curriculum alignment research, highlighting the consequences of systemic incoherence. Importantly, the comparison does not imply direct causal attribution but rather provides a structured reference point for interpreting domain-specific disparities.</p>
            </sec>
            <sec id="sec10.9">
                <title>4.4 Interpreting Singapore as a Benchmark</title>
                <p>In relation to Research Question 3, Singapore&#x2019;s performance provides a valuable benchmark for examining curriculum coherence and learning progression, but it is essential to recognise the contextual conditions under which it was achieved. Singapore's education system functions within a highly structured and competitive framework, characterised by strict central curriculum oversight, selective teacher recruitment, rigorous professional training, and elevated societal expectations concerning academic performance (
                    <xref ref-type="bibr" rid="ref39">Ng, 2017</xref>). These features are accompanied by early differentiation and sustained academic pressure, which, although associated with high achievement, may also generate stress and equity concerns that are not fully captured in large-scale assessment data (
                    <xref ref-type="bibr" rid="ref44">Tan, 2018</xref>). The value of Singapore as a comparator resides not in direct policy transplantation but in the identification of transferable principles, including curriculum coherence, focused content progression, and pedagogical scaffolding (
                    <xref ref-type="bibr" rid="ref41">Deng et al., 2013</xref>). For South Africa, such an approach implies selective and context-sensitive adaptation rather than wholesale adoption, taking into account systemic capacity, linguistic diversity, and resource constraints (
                    <xref ref-type="bibr" rid="ref46">Spaull &amp; Kotze, 2015</xref>). The benchmark therefore functions analytically, providing scale and distributional reference rather than prescriptive replication.</p>
            </sec>
            <sec id="sec10.10">
                <title>4.5 Implications for Policy and Practice</title>
                <p>In relation to Research Question 3, these results point to a dual challenge. First, persistent weaknesses in number sense, procedural fluency, and spatial reasoning are evident and warrant focused instructional attention. Second, learners&#x2019; relative strength in applying may represent a potential entry point for developing higher-order reasoning, provided that foundational knowledge is more consistently supported and instructional practices are orientated toward conceptual engagement. Addressing these challenges requires coordinated consideration across curriculum design, teacher professional development, and classroom practice. These results align with existing research, which emphasises the importance of integrating foundational skill development with opportunities for conceptual reasoning. Reducing class sizes and increasing specialist support remain important considerations, but they may face short-term financial or political constraints, which could hinder their implementation in the immediate future. Targeted diagnostic assessment, structured small-group instruction, and school-based professional learning communities offer contextually feasible strategies that may support incremental improvement in student learning outcomes within the constraints of existing educational contexts and resources.</p>
            </sec>
            <sec id="sec10.11">
                <title>4.6 Pillars for Strengthening Foundational Mathematics</title>
                <p>

                    <bold>Pillar 1: Strengthening Foundations through Diagnostic Teaching</bold>
                </p>
                <p>The pronounced deficits in the knowing domain indicate that many learners progress without secure mastery of basic mathematical foundations. Early diagnostic assessment in the foundation and intermediate phases can help identify learning gaps before they compound. Adapted support programmes, combined with teacher development in formative assessment and error analysis, can translate diagnostic information into responsive classroom practices.</p>
                <p>

                    <bold>Pillar 2: Enhancing Geometry and Spatial Reasoning</bold>
                </p>
                <p>Given the consistently poor performance in measurement and geometry, professional development should prioritise visual, experiential, and problem-based approaches to spatial reasoning. In resource-constrained contexts, locally available materials and outdoor measurement activities can provide effective entry points, supplemented over time by affordable digital visualisation tools.</p>
                <p>

                    <bold>Pillar 3: Leveraging Application to Cultivate Reasoning</bold>
                </p>
                <p>The comparatively stronger performance in applying suggests an opportunity to scaffold reasoning through contextually relevant problem-solving tasks. Supporting teachers as they design inquiry-based lessons that emphasise explanation, justification, and collaborative reasoning can help bridge the gap between application and higher-order thinking.</p>
            </sec>
            <sec id="sec10.12">
                <title>4.7 System-Level Coherence</title>
                <p>The effectiveness of these priorities depends on systemic alignment across phases. Consistent with Curriculum Alignment Theory, sustainable improvement requires coherence between curriculum intent, classroom enactment, assessment design, and teacher preparation. The domain-specific gaps identified in this study underscore that improvement is unlikely to result from isolated interventions. Instead, targeted strengthening of foundational fluency and spatial reasoning, supported by aligned curriculum and professional development structures, is necessary to shift the achievement profile. When curriculum goals, pedagogical support, and assessment expectations converge, foundational mathematics instruction can move beyond procedural compliance toward deeper conceptual engagement, contributing to more equitable learning trajectories in alignment with Sustainable Development Goal 4.</p>
            </sec>
        </sec>
        <sec id="sec30" sec-type="conclusion">
            <title>5. Conclusion</title>
            <p>This study examined South African Grade 5 learners&#x2019; mathematics achievement in TIMSS 2023, disaggregated by content and cognitive domains and benchmarked against Singapore&#x2019;s Grade 4 performance on the same TIMSS Grade 4 mathematics framework. Rather than relying solely on overall mean comparisons, the study identified the distribution and concentration of achievement gaps across specific domains. Consistent with earlier research (
                <xref ref-type="bibr" rid="ref16">Mabena et al., 2021</xref>; 
                <xref ref-type="bibr" rid="ref32">Taylor, 2021</xref>), the results confirm that South Africa continues to perform substantially below international benchmarks. However, the more diagnostic contribution of this study lies in demonstrating that the largest content-domain gap occurs in measurement and geometry, while the largest cognitive-domain gap occurs in knowing. These domain-specific disparities are most evident in foundational procedural fluency and spatial reasoning within the national achievement profile.</p>
            <p>The results further indicate that South African learners demonstrate comparatively stronger performance in the 'Applying' domain than in 'Knowing' or 'Reasoning' within their internal distribution. Although this relative positioning does not reduce the magnitude of the benchmark gap, it points to structured procedural engagement as a potential area of instructional focus, particularly in enhancing learners' foundational procedural fluency and spatial reasoning skills. Interpreted through the TIMSS domain framework and Curriculum Alignment Theory (
                <xref ref-type="bibr" rid="ref26">Porter, 2002</xref>), the results indicate that achievement gaps are not uniformly distributed but are concentrated in foundational and spatial domains that are critical for progression to higher-order reasoning. The study does not identify causal mechanisms; instead, it offers a structured diagnostic profile that can guide focused curriculum and pedagogical decisions.</p>
            <p>Building on these empirical insights, three interrelated priorities emerge: strengthening foundational fluency through early diagnostic assessments and targeted support, prioritising spatial reasoning and conceptual geometry instruction, and supporting the progression from application to reasoning through tasks requiring explanation and justification. Sustainable improvement is unlikely to result from isolated interventions. Consistent with Curriculum Alignment Theory, meaningful gains are more likely when curriculum design, assessment expectations, classroom enactment, and teacher development operate coherently. Therefore, the results draw attention to the importance of domain-specific reform efforts grounded in empirical achievement profiles rather than generalised prescriptions. By aligning curriculum intent with classroom practice and assessment demands, South Africa is better positioned to strengthen foundational mathematics instruction in ways that promote deeper conceptual engagement and contribute to more equitable learning trajectories in line with Sustainable Development Goal 4.</p>
        </sec>
        <sec id="sec31">
            <title>6. Limitations and Future Research</title>
            <p>While this study offers significant insights into Grade 5 mathematics achievement in South Africa, it is important to acknowledge several limitations. First, the analysis relied on secondary, cross-sectional data from TIMSS 2023, which precludes causal inference. Consequently, the associations identified between content and cognitive domains should be regarded as descriptive rather than causal relationships. While such evidence is valuable for diagnostic and policy-relevant comparison, it cannot on its own establish mechanisms of effect.</p>
            <p>Second, domain-specific subscales, particularly those for knowing and measurement and geometry, are based on fewer test items than the overall mathematics scale, which may reduce measurement precision. Although the use of plausible values enhances the reliability of population-level estimates, it also introduces statistical uncertainty that should be considered when interpreting effect sizes and domain-level differences.</p>
            <p>Third, the use of large-scale assessment data limits direct examination of classroom processes, instructional strategies, and learner experiences that shape achievement. Without qualitative or longitudinal evidence, it is not possible to observe how curriculum intentions are enacted in practice or how learners engage with mathematical tasks over time. Future research would benefit from mixed-methods designs that integrate TIMSS data with classroom observations, teacher interviews, and learner case studies to deepen explanatory insight.</p>
            <p>Finally, although TIMSS sampling ensures national representativeness, it may under-represent learners in marginalised or remote contexts characterised by multigrade teaching, language-of-instruction challenges, and severe resource constraints. Targeted studies and oversampling in such settings could provide a more nuanced perspective on how structural inequalities shape foundational learning trajectories.</p>
            <p>Future research should therefore extend this work through longitudinal, multilevel, and design-based studies that examine how targeted interventions in foundational fluency and spatial reasoning influence learner trajectories over time. Experimental or quasi-experimental evaluations of diagnostic teaching approaches and geometry-focused professional development would provide stronger evidence regarding the mechanisms underlying domain-specific improvement. By integrating descriptive large-scale assessment diagnostics with explanatory and intervention research, a more comprehensive evidence base can be developed to inform curriculum reform, teacher education, and policy innovation aimed at improving mathematics achievement and equity in South Africa.</p>
        </sec>
        <sec id="sec37">
            <title>Author Contributions</title>
            <p>The author, Mathelela Steyn Mokgwathi, is responsible for the conceptualisation, data curation, formal analysis, investigation, methodology, project administration, resources, software, supervision, validation, visualisation, and writing of the original draft, as well as the writing, review, and editing processes.</p>
        </sec>
        <sec id="sec32">
            <title>Declaration of AI Use</title>
            <p>The author affirms that no generative artificial intelligence tools (such as ChatGPT or similar models) were used to produce the academic content, analysis, or interpretations presented in this manuscript. QuillBot (premium) was employed solely for grammar and spelling checks. The author personally reviewed and edited the final manuscript and takes full responsibility for its content and conclusions.</p>
        </sec>
    </body>
    <back>
        <sec id="sec35" sec-type="data-availability">
            <title>Data Availability</title>
            <p>The data that support the results of this article are derived from the publicly accessible Trends in International Mathematics and Science Study (TIMSS) 2023 database, available through the International Association for the Evaluation of Educational Achievement (IEA) at 
                <ext-link ext-link-type="uri" xlink:href="https://timssandpirls.bc.edu">https://timssandpirls.bc.edu</ext-link>. Derived and processed data underlying the statistical analyses, including the variables used to generate the tables, figures, and descriptive statistics presented in this article, are openly available in the Zenodo repository (
                <xref ref-type="bibr" rid="ref37">Mokgwathi, 2025</xref>) at 
                <ext-link ext-link-type="uri" xlink:href="https://doi.org/10.5281/zenodo.17379412">https://doi.org/10.5281/zenodo.17379412</ext-link>, under the 
                <ext-link ext-link-type="uri" xlink:href="https://creativecommons.org/licenses/by/4.0/deed.en">Creative Commons Attribution 4.0 International</ext-link> (CC BY 4.0) licence. These files include anonymised values underlying the means, standard deviations, and effect sizes, as well as detailed variable descriptions and methodological notes supporting replication of the comparative analyses between South Africa (Grade 5) and Singapore (Grade 4). No ethical approval or participant consent was required, as all data originate from secondary sources that are publicly available through the IEA TIMSS 2023 repository.</p>
            <sec id="sec36">
                <title>Extended Data</title>
                <p>Supplementary materials supporting this article are available in the Zenodo repository at 
                    <ext-link ext-link-type="uri" xlink:href="https://doi.org/10.5281/zenodo.17379412">https://doi.org/10.5281/zenodo.17379412</ext-link> (
                    <xref ref-type="bibr" rid="ref37">Mokgwathi, 2025</xref>), under the 
                    <ext-link ext-link-type="uri" xlink:href="https://creativecommons.org/licenses/by/4.0/deed.en">CC BY 4.0 license</ext-link>. These include:</p>
                <p>
TIMSS2023_SGvZA_Derived_Dataset.xlsx (domain-level statistics and effect sizes),</p>
                <p>
TIMSS2023_Variable_Descriptions.xlsx (variable definitions and sources), and</p>
                <p>
TIMSS2023_Supplementary_Notes.pdf (methodological documentation).</p>
            </sec>
        </sec>
        <ack>
            <title>Acknowledgements</title>
            <p>The author acknowledges the International Association for the Evaluation of Educational Achievement (IEA) for making the Trends in International Mathematics and Science Study (TIMSS) 2023 data publicly available and appreciates its continued commitment to advancing educational research globally.</p>
        </ack>
        <ref-list>
            <title>References</title>
            <ref id="ref3">
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Bhagwonparsadh</surname>
                            <given-names>Y</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Pule</surname>
                            <given-names>K</given-names>
                        </name>
</person-group>:
                    <article-title>The Effects of Educators&#x2019; Mathematics Pedagogical Content Knowledge on The Mentoring of Grade 12 Students Using Problem-Based Learning.</article-title>
                    <source>

                        <italic toggle="yes">Journal of Culture and Values in Education.</italic>
</source>
                    <year>2024</year>;<volume>7</volume>(<issue>1</issue>):<fpage>99</fpage>&#x2013;<lpage>117</lpage>.
                    <pub-id pub-id-type="doi">10.46303/jcve.2024.7</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref6">
                <mixed-citation publication-type="book">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Choy</surname>
                            <given-names>BH</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Dindyal</surname>
                            <given-names>J</given-names>
                        </name>
</person-group>:
                    <chapter-title>The Singapore mathematics curriculum: Influences and confluences.</chapter-title>
                    <source>

                        <italic toggle="yes">Research Handbook on Curriculum and Education.</italic>
</source>
                    <publisher-name>Edward Elgar Publishing</publisher-name>;<year>2024</year>; pp.<fpage>534</fpage>&#x2013;<lpage>549</lpage>.
                    <pub-id pub-id-type="doi">10.4337/9781802208542.00048</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref41">
                <mixed-citation publication-type="book">
                    <person-group person-group-type="editor">

                        <name name-style="western">
                            <surname>Deng</surname>
                            <given-names>Z</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Gopinathan</surname>
                            <given-names>S</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Lee</surname>
                            <given-names>CK-E</given-names>
                        </name>
</person-group>, editors:
                    <source>

                        <italic toggle="yes">Globalization and the Singapore curriculum: From policy to classroom.</italic>
</source>
                    <publisher-name>Springer</publisher-name>;<year>2013</year>.
                    <pub-id pub-id-type="doi">10.1007/978-981-4451-57-4</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref9">
                <mixed-citation publication-type="book">
                    <collab>Department of Basic Education</collab>:
                    <source>

                        <italic toggle="yes">South Africa 2023 Trends in International Mathematics and Science Study (TIMSS) Highlights Report.</italic>
</source>
                    <publisher-loc>Pretoria</publisher-loc>:
                    <publisher-name>Department of Basic Education</publisher-name>;<year>2024</year>.</mixed-citation>
            </ref>
            <ref id="ref10">
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Grundy</surname>
                            <given-names>S</given-names>
                        </name>
</person-group>:
                    <article-title>Beyond guaranteed outcomes: Creating a discourse for educational praxis.</article-title>
                    <source>

                        <italic toggle="yes">Australian Journal of Education.</italic>
</source>
                    <year>1992</year>;<volume>36</volume>(<issue>2</issue>):<fpage>157</fpage>&#x2013;<lpage>169</lpage>.
                    <pub-id pub-id-type="doi">10.1177/000494419203600204</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref43">
                <mixed-citation publication-type="book">
                    <person-group person-group-type="editor">

                        <name name-style="western">
                            <surname>Findell</surname>
                            <given-names>B</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Swafford</surname>
                            <given-names>J</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Kilpatrick</surname>
                            <given-names>J</given-names>
                        </name>
</person-group>:
                    <source>

                        <italic toggle="yes">Adding it up: Helping children learn mathematics.</italic>
</source>
                    <publisher-name>National Academies Press</publisher-name>;<year>2001</year>.</mixed-citation>
            </ref>
            <ref id="ref13">
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Leong</surname>
                            <given-names>YH</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Ho</surname>
                            <given-names>WK</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Cheng</surname>
                            <given-names>LP</given-names>
                        </name>
</person-group>:
                    <article-title>Concrete-Pictorial-Abstract: Surveying its origins and charting its future.</article-title>
                    <source>

                        <italic toggle="yes">The Mathematics Educator.</italic>
</source>
                    <year>2015</year>;<volume>16</volume>(<issue>1</issue>):<fpage>1</fpage>&#x2013;<lpage>18</lpage>.
                    <ext-link ext-link-type="uri" xlink:href="http://math.nie.edu.sg/ame/matheduc/tme/tmeV16_1/TME16_1.pdf">Reference Source</ext-link>
                </mixed-citation>
            </ref>
            <ref id="ref14">
                <mixed-citation publication-type="book">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Low</surname>
                            <given-names>L</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Wong</surname>
                            <given-names>LF</given-names>
                        </name>
</person-group>:
                    <chapter-title>Using connecting mathematical tasks for coherence, connections and continuity.</chapter-title>In
                    <person-group person-group-type="editor">

                        <name name-style="western">
                            <surname>Choy</surname>
                            <given-names>BH</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Toh</surname>
                            <given-names>TL</given-names>
                        </name>
</person-group>, editors.
                    <source>Mathematics&#x2014;connection and beyond: Yearbook 2020 Association of Mathematics Educators.</source>
                    <publisher-name>World Scientific</publisher-name>;<year>2021</year>; pp.<fpage>95</fpage>&#x2013;<lpage>120</lpage>.
                    <pub-id pub-id-type="doi">10.1142/9789811236983_0006</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref15">
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Lutfi</surname>
                            <given-names>JS</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Dasari</surname>
                            <given-names>D</given-names>
                        </name>
</person-group>:
                    <article-title>Research trends on learning mathematics with the CPA (Concrete-Pictorial-Abstract) approach.</article-title>
                    <source>

                        <italic toggle="yes">Prisma.</italic>
</source>
                    <year>2024</year>;<volume>13</volume>(<issue>1</issue>):<fpage>103</fpage>&#x2013;<lpage>112</lpage>.
                    <pub-id pub-id-type="doi">10.35194/jp.v13i1.3947</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref16">
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Mabena</surname>
                            <given-names>N</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Mokgosi</surname>
                            <given-names>PN</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Ramapela</surname>
                            <given-names>SS</given-names>
                        </name>
</person-group>:
                    <article-title>Factors contributing to poor learner performance in mathematics: A case of selected schools in Mpumalanga province, South Africa.</article-title>
                    <source>

                        <italic toggle="yes">Problems of Education in the 21st Century.</italic>
</source>
                    <year>2021</year>;<volume>79</volume>(<issue>3</issue>):<fpage>451</fpage>&#x2013;<lpage>466</lpage>.
                    <pub-id pub-id-type="doi">10.33225/pec/21.79.451</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref17">
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Maqoqa</surname>
                            <given-names>T</given-names>
                        </name>
</person-group>:
                    <article-title>An exploration of learners&#x2019; understanding of Euclidean geometric concepts: A case study of secondary schools in the OR Tambo Inland District of the Eastern Cape.</article-title>
                    <source>

                        <italic toggle="yes">E-Journal of Humanities, Arts and Social Sciences.</italic>
</source>
                    <year>2024</year>;<volume>5</volume>(<issue>5</issue>):<fpage>658</fpage>&#x2013;<lpage>675</lpage>.
                    <pub-id pub-id-type="doi">10.38159/ehass.2024557</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref18">
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Meier</surname>
                            <given-names>C</given-names>
                        </name>

                        <name name-style="western">
                            <surname>West</surname>
                            <given-names>J</given-names>
                        </name>
</person-group>:
                    <article-title>Overcrowded classrooms&#x2013;the Achilles heel of South African education?</article-title>
                    <source>

                        <italic toggle="yes">South African Journal of Childhood Education.</italic>
</source>
                    <year>2020</year>;<volume>10</volume>(<issue>1</issue>):<fpage>1</fpage>&#x2013;<lpage>10</lpage>.
                    <pub-id pub-id-type="doi">10.4102/sajce.v10i1.617</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref19">
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Mensah</surname>
                            <given-names>E</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Baidoo-Anu</surname>
                            <given-names>D</given-names>
                        </name>
</person-group>:
                    <article-title>Towards quality and equitable education in South Africa: Unpacking the relationship between teacher factors, students&#x2019; socioeconomic background and mathematics achievements.</article-title>
                    <source>

                        <italic toggle="yes">Research in Mathematics.</italic>
</source>
                    <year>2022</year>;<volume>9</volume>(<issue>1</issue>):<fpage>2088645</fpage>.
                    <pub-id pub-id-type="doi">10.1080/27684830.2022.2088645</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref20">
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Milne</surname>
                            <given-names>A</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Mhlolo</surname>
                            <given-names>M</given-names>
                        </name>
</person-group>:
                    <article-title>Lessons for South Africa from Singapore&#x2019;s gifted education&#x2013;A comparative study.</article-title>
                    <source>

                        <italic toggle="yes">South African Journal of Education.</italic>
</source>
                    <year>2021</year>;<volume>41</volume>(<issue>1</issue>):<fpage>1</fpage>&#x2013;<lpage>8</lpage>.
                    <pub-id pub-id-type="doi">10.15700/saje.v41n1a1839</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref37">
                <mixed-citation publication-type="data">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Mokgwathi</surname>
                            <given-names>MS</given-names>
                        </name>
</person-group>:
                    <data-title>Derived Data and Extended Tables: TIMSS 2023 South Africa&#x2013;Singapore Mathematics Comparison (1.0).</data-title>[Data set].
                    <source>

                        <italic toggle="yes">Zenodo.</italic>
</source>
                    <year>2025</year>.
                    <pub-id pub-id-type="doi">10.5281/zenodo.17379412</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref21">
                <mixed-citation publication-type="book">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Morony</surname>
                            <given-names>W</given-names>
                        </name>
</person-group>:
                    <chapter-title>Coherence in a Range of Mathematics Curriculum Reforms.</chapter-title>In:
                    <person-group person-group-type="editor">

                        <name name-style="western">
                            <surname>Shimizu</surname>
                            <given-names>Y</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Vithal</surname>
                            <given-names>R</given-names>
                        </name>
</person-group>, editors.
                    <source>

                        <italic toggle="yes">Mathematics Curriculum Reforms Around the World.</italic>
</source>
                    <publisher-loc>Cham</publisher-loc>:
                    <publisher-name>New ICMI Study Series. Springer</publisher-name>;<year>2023</year>.
                    <pub-id pub-id-type="doi">10.1007/978-3-031-13548-4_10</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref22">
                <mixed-citation publication-type="book">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Mullis</surname>
                            <given-names>IVS</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Martin</surname>
                            <given-names>MO</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Foy</surname>
                            <given-names>P</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <source>

                        <italic toggle="yes">TIMSS 2019 International Results in Mathematics and Science.</italic>
</source>
                    <publisher-name>TIMSS &amp; PIRLS International Study Center</publisher-name>;<year>2020</year>.
                    <ext-link ext-link-type="uri" xlink:href="https://timssandpirls.bc.edu/timss2019/international-results/">Reference Source</ext-link>
                </mixed-citation>
            </ref>
            <ref id="ref38">
                <mixed-citation publication-type="book">
                    <person-group person-group-type="editor">

                        <name name-style="western">
                            <surname>Mullis</surname>
                            <given-names>IVS</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Martin</surname>
                            <given-names>MO</given-names>
                        </name>
</person-group>, editors:
                    <source>

                        <italic toggle="yes">TIMSS 2019 Assessment Frameworks.</italic>
</source>Retrieved from Boston College, TIMSS &amp; PIRLS International Study Center website;<year>2017</year>.
                    <ext-link ext-link-type="uri" xlink:href="http://timssandpirls.bc.edu/timss2019/frameworks/">Reference Source</ext-link>
                </mixed-citation>
            </ref>
            <ref id="ref39">
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Ng</surname>
                            <given-names>PT</given-names>
                        </name>
</person-group>:
                    <source>

                        <italic toggle="yes">Learning from Singapore: The power of paradoxes.</italic>
</source>
                    <publisher-name>Routledge</publisher-name>;<year>2017</year>.
                    <pub-id pub-id-type="doi">10.4324/9781315682914</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref40">
                <mixed-citation publication-type="book">
                    <collab>OECD</collab>:
                    <source>

                        <italic toggle="yes">PISA 2018 Assessment and Analytical Framework.</italic>
</source>
                    <publisher-loc>Paris</publisher-loc>:
                    <publisher-name>PISA, OECD Publishing</publisher-name>;<year>2019</year>.
                    <pub-id pub-id-type="doi">10.1787/b25efab8-en</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref23">
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Peduk</surname>
                            <given-names>B</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Ates</surname>
                            <given-names>O</given-names>
                        </name>
</person-group>:
                    <article-title>Analysis of the science course curriculum objectives and high school entrance examination questions according to TIMSS framework.</article-title>
                    <source>

                        <italic toggle="yes">International Journal of Educational Methodology.</italic>
</source>
                    <year>2019</year>;<volume>5</volume>(<issue>3</issue>):<fpage>433</fpage>&#x2013;<lpage>449</lpage>.
                    <pub-id pub-id-type="doi">https://doi.org/10.12973/ijem.5.3.433</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref24">
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Pertiwi</surname>
                            <given-names>A</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Wahidin</surname>
                            <given-names>W</given-names>
                        </name>
</person-group>:
                    <article-title>Are the mathematics textbooks for eighth-grade meet the Trends in International Mathematics and Science Study (TIMSS) 2019 mathematics framework?</article-title>
                    <source>

                        <italic toggle="yes">Edumatika: Jurnal Riset Pendidikan Matematika.</italic>
</source>
                    <year>2020</year>;<volume>3</volume>(<issue>2</issue>):<fpage>129</fpage>&#x2013;<lpage>135</lpage>.
                    <pub-id pub-id-type="doi">10.32939/ejrpm.v3i2.623</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref26">
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Porter</surname>
                            <given-names>AC</given-names>
                        </name>
</person-group>:
                    <article-title>Measuring the content of instruction: Uses in research and practice.</article-title>
                    <source>

                        <italic toggle="yes">Educational Researcher.</italic>
</source>
                    <year>2002</year>;<volume>31</volume>(<issue>7</issue>):<fpage>3</fpage>&#x2013;<lpage>14</lpage>.
                    <pub-id pub-id-type="doi">10.3102/0013189X031007003</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref27">
                <mixed-citation publication-type="book">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Reddy</surname>
                            <given-names>V</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Hannan</surname>
                            <given-names>S</given-names>
                        </name>
</person-group>:
                    <source>

                        <italic toggle="yes">TIMSS in South Africa: Making global research locally meaningful.</italic>
</source>
                    <publisher-loc>Pretoria</publisher-loc>:
                    <publisher-name>Human Sciences Research Council</publisher-name>;<year>2019</year>.</mixed-citation>
            </ref>
            <ref id="ref28">
                <mixed-citation publication-type="book">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Reynolds</surname>
                            <given-names>KA</given-names>
                        </name>
</person-group>:
                    <chapter-title>Updating the TIMSS instruments for describing contexts of student learning. </chapter-title>
                    <person-group person-group-type="editor">

                        <name name-style="western">
                            <surname>Davier</surname>
                            <given-names>M</given-names>
                            <prefix>von</prefix>
                        </name>

                        <name name-style="western">
                            <surname>Fishbein</surname>
                            <given-names>B</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Kennedy</surname>
                            <given-names>A</given-names>
                        </name>
</person-group>, editors.
                    <source>

                        <italic toggle="yes">TIMSS 2023 Technical Report (Methods and Procedures).</italic>
</source>
                    <publisher-name>Boston College, TIMSS &amp; PIRLS International Study Center</publisher-name>;<year>2024</year>; pp.<fpage>2.1</fpage>&#x2013;<lpage>2.8</lpage>.
                    <ext-link ext-link-type="uri" xlink:href="https://timss2023.org/methods/">Reference Source</ext-link>
                </mixed-citation>
            </ref>
            <ref id="ref42">
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Rutkowski</surname>
                            <given-names>D</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Delandshere</surname>
                            <given-names>G</given-names>
                        </name>
</person-group>:
                    <article-title>Causal inferences with large-scale assessment data: Using a validity framework.</article-title>
                    <source>

                        <italic toggle="yes">Large-Scale Assessments in Education</italic>
</source>
                    <year>2016</year>;<volume>4</volume>(<issue>6</issue>).
                    <pub-id pub-id-type="doi">10.1186/s40536-016-0019-1</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref29">
                <mixed-citation publication-type="book">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Siegel</surname>
                            <given-names>P</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Foy</surname>
                            <given-names>P</given-names>
                        </name>
</person-group>:
                    <chapter-title>TIMSS sample design. </chapter-title>
                    <person-group person-group-type="editor">

                        <name name-style="western">
                            <surname>Davier</surname>
                            <given-names>M</given-names>
                            <prefix>von</prefix>
                        </name>

                        <name name-style="western">
                            <surname>Fishbein</surname>
                            <given-names>B</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Kennedy</surname>
                            <given-names>A</given-names>
                        </name>
</person-group>, editors.
                    <source>

                        <italic toggle="yes">TIMSS 2023 Technical Report (Methods and Procedures).</italic>
</source>
                    <publisher-name>Boston College, TIMSS &amp; PIRLS International Study Center</publisher-name>;<year>2024</year>; pp.<fpage>3.1</fpage>&#x2013;<lpage>3.30</lpage>.
                    <pub-id pub-id-type="doi">10.6017/lse.tpisc.timss.rs7952</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref46">
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Spaull</surname>
                            <given-names>N</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Kotze</surname>
                            <given-names>J</given-names>
                        </name>
</person-group>:
                    <article-title>Starting behind and staying behind in South Africa: The case of insurmountable learning deficits in mathematics.</article-title>
                    <source>

                        <italic toggle="yes">International Journal of Educational Development.</italic>
</source>
                    <year>2015</year>;<volume>41</volume>:<fpage>13</fpage>&#x2013;<lpage>24</lpage>.
                    <pub-id pub-id-type="doi">10.1016/j.ijedudev.2015.01.002</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref30">
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Tachie</surname>
                            <given-names>SA</given-names>
                        </name>
</person-group>:
                    <article-title>The challenges of South African teachers in teaching Euclidean geometry.</article-title>
                    <source>

                        <italic toggle="yes">International Journal of Learning, Teaching and Educational Research.</italic>
</source>
                    <year>2020</year>;<volume>19</volume>(<issue>8</issue>):<fpage>297</fpage>&#x2013;<lpage>312</lpage>.
                    <pub-id pub-id-type="doi">https://doi.org/10.26803/ijlter.19.8.16</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref44">
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Tan</surname>
                            <given-names>C</given-names>
                        </name>
</person-group>:
                    <article-title>Parental responses to education reform in Singapore, Shanghai and Hong Kong.</article-title>
                    <source>

                        <italic toggle="yes">Asia Pacific Education Review.</italic>
</source>
                    <year>2019</year>;<volume>20</volume>:<fpage>91</fpage>&#x2013;<lpage>99</lpage>.
                    <pub-id pub-id-type="doi">10.1007/s12564-018-9571-4</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref32">
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Taylor</surname>
                            <given-names>N</given-names>
                        </name>
</person-group>:
                    <article-title>The dream of Sisyphus: Mathematics education in South Africa.</article-title>
                    <source>

                        <italic toggle="yes">South African Journal of Childhood Education.</italic>
</source>
                    <year>2021</year>;<volume>11</volume>(<issue>1</issue>):<fpage>a911</fpage>.
                    <pub-id pub-id-type="doi">10.4102/sajce.v11i1.911</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref45">
                <mixed-citation publication-type="book">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Davier</surname>
                            <given-names>M</given-names>
                            <prefix>von</prefix>
                        </name>
</person-group>:
                    <chapter-title>TIMSS 2019 scaling methodology: Item Response Theory, population models, and linking across modes.</chapter-title>In
                    <person-group person-group-type="editor">

                        <name name-style="western">
                            <surname>Martin</surname>
                            <given-names>MO</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Davier</surname>
                            <given-names>M</given-names>
                            <prefix>von</prefix>
                        </name>

                        <name name-style="western">
                            <surname>Mullis</surname>
                            <given-names>IVS</given-names>
                        </name>
</person-group>, editors.
                    <source>

                        <italic toggle="yes">Methods and procedures: TIMSS 2019 Technical Report.</italic>
</source>
                    <publisher-name>TIMSS &amp; PIRLS International Study Center</publisher-name>;<year>2020</year>; pp.<fpage>11.1</fpage>&#x2013;<lpage>11.25</lpage>.
                    <ext-link ext-link-type="uri" xlink:href="https://timssandpirls.bc.edu/timss2019/methods/chapter-11.html">Reference Source</ext-link>
                </mixed-citation>
            </ref>
            <ref id="ref34">
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Davier</surname>
                            <given-names>M</given-names>
                            <prefix>von</prefix>
                        </name>

                        <name name-style="western">
                            <surname>Kennedy</surname>
                            <given-names>A</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Reynolds</surname>
                            <given-names>K</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>TIMSS 2023 International Results in Mathematics and Science.</article-title>
                    <source>

                        <italic toggle="yes">Boston College, TIMSS &amp; PIRLS International Study Center.</italic>
</source>
                    <year>2024</year>.
                    <pub-id pub-id-type="doi">10.6017/lse.tpisc.timss.rs6460</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref35">
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Y&#x0131;lmaz</surname>
                            <given-names>N</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Ay</surname>
                            <given-names>Z</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Ayd&#x0131;n</surname>
                            <given-names>&#x015e;</given-names>
                        </name>
</person-group>:
                    <article-title>An Investigation of tasks in the mathematics textbooks and objectives in mathematics curriculum from 4th to 8th grade related with data content domain according to TIMSS 2019 cognitive domains.</article-title>
                    <source>

                        <italic toggle="yes">Cukurova University Faculty of Education Journal.</italic>
</source>
                    <year>2021</year>;<volume>50</volume>(<issue>2</issue>):<fpage>1397</fpage>&#x2013;<lpage>1436</lpage>.
                    <pub-id pub-id-type="doi">10.14812/cuefd.745164</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref36">
                <mixed-citation publication-type="book">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Zuze</surname>
                            <given-names>L</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Reddy</surname>
                            <given-names>V</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Visser</surname>
                            <given-names>M</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <source>

                        <italic toggle="yes">TIMSS 2015 Grade 9 national report: Understanding mathematics and achievement amongst Grade 9 learners in South Africa.</italic>
</source>
                    <publisher-name>Human Sciences Research Council</publisher-name>;<year>2018</year>.
                    <ext-link ext-link-type="uri" xlink:href="https://www.iea.nl/sites/default/files/2019-04/TIMSS%202015%20Grade%209%20National%20Report%20South%20Africa.pdf">Reference Source</ext-link>
                </mixed-citation>
            </ref>
        </ref-list>
    </back>
    <sub-article article-type="reviewer-report" id="report472940">
        <front-stub>
            <article-id pub-id-type="doi">10.5256/f1000research.196841.r472940</article-id>
            <title-group>
                <article-title>Reviewer response for version 3</article-title>
            </title-group>
            <contrib-group>
                <contrib contrib-type="author">
                    <name>
                        <surname>Ukobizaba</surname>
                        <given-names>Fidele</given-names>
                    </name>
                    <xref ref-type="aff" rid="r472940a1">1</xref>
                    <xref ref-type="aff" rid="r472940a2">2</xref>
                    <role>Referee</role>
                    <uri content-type="orcid">https://orcid.org/0000-0003-1502-2395</uri>
                </contrib>
                <aff id="r472940a1">
                    <label>1</label>University of Rwanda College of Education, Rwamagana, Rwanda</aff>
                <aff id="r472940a2">
                    <label>2</label>African Centre of Excellence for Innovative Teaching and Learning Mathematics and Science (ACEITLMS), University of Rwanda College of Education, Kayonza, Rwanda</aff>
            </contrib-group>
            <author-notes>
                <fn fn-type="conflict">
                    <p>
                        <bold>Competing interests: </bold>No competing interests were disclosed.</p>
                </fn>
            </author-notes>
            <pub-date pub-type="epub">
                <day>13</day>
                <month>4</month>
                <year>2026</year>
            </pub-date>
            <permissions>
                <copyright-statement>Copyright: &#x00a9; 2026 Ukobizaba F</copyright-statement>
                <copyright-year>2026</copyright-year>
                <license xlink:href="https://creativecommons.org/licenses/by/4.0/">
                    <license-p>This is an open access peer review report distributed under the terms of the Creative Commons Attribution Licence, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</license-p>
                </license>
            </permissions>
            <related-article ext-link-type="doi" id="relatedArticleReport472940" related-article-type="peer-reviewed-article" xlink:href="10.12688/f1000research.172015.3"/>
            <custom-meta-group>
                <custom-meta>
                    <meta-name>recommendation</meta-name>
                    <meta-value>approve-with-reservations</meta-value>
                </custom-meta>
            </custom-meta-group>
        </front-stub>
        <body>
            <p>The&#x00a0;authors have made notable improvements to the manuscript compared to the previous version. Most of the earlier concerns have been addressed, and the manuscript is now generally readable. However, some issues still require revision: 
                <list list-type="order">
                    <list-item>
                        <p>The third research objective is overly lengthy and lacks sufficient focus. Furthermore, it is unclear how this objective is operationalized in the results section, as no corresponding findings are presented. Considering this and given the absence of aligned results, it is recommended that this objective be removed.</p>
                    </list-item>
                    <list-item>
                        <p>Following the revision of Comment 1, the statement in the first paragraph of results section: &#x201c;Research Question 3, concerning curriculum and pedagogical implications, is addressed in the Discussion section&#x201d; should also be deleted.</p>
                    </list-item>
                    <list-item>
                        <p>The discussion section should explicitly interpret and synthesize the study findings in relation to relevant literature. Accordingly, Sections 4.6 and 4.7 should be removed, as they do not directly engage with or interpret the study results.</p>
                    </list-item>
                    <list-item>
                        <p>A strong conclusion should sound clear, concise, and synthesis-oriented. It should not introduce new ideas, nor discuss the results, and should not include citations. Instead, it should bring the study to a logical close by highlighting meaning, contribution, and implications.</p>
                    </list-item>
                    <list-item>
                        <p>The limitations section appears excessively extensive, which may weaken the perceived robustness of the study. It is recommended that the authors consolidate this section into two concise paragraphs.</p>
                    </list-item>
                </list>
            </p>
            <p>Is the work clearly and accurately presented and does it cite the current literature?</p>
            <p>Yes</p>
            <p>If applicable, is the statistical analysis and its interpretation appropriate?</p>
            <p>Yes</p>
            <p>Are all the source data underlying the results available to ensure full reproducibility?</p>
            <p>Partly</p>
            <p>Is the study design appropriate and is the work technically sound?</p>
            <p>No</p>
            <p>Are the conclusions drawn adequately supported by the results?</p>
            <p>No</p>
            <p>Are sufficient details of methods and analysis provided to allow replication by others?</p>
            <p>No</p>
            <p>Reviewer Expertise:</p>
            <p>Mathematics education.</p>
            <p>I confirm that I have read this submission and believe that I have an appropriate level of expertise to confirm that it is of an acceptable scientific standard, however I have significant reservations, as outlined above.</p>
        </body>
    </sub-article>
    <sub-article article-type="reviewer-report" id="report453643">
        <front-stub>
            <article-id pub-id-type="doi">10.5256/f1000research.194776.r453643</article-id>
            <title-group>
                <article-title>Reviewer response for version 2</article-title>
            </title-group>
            <contrib-group>
                <contrib contrib-type="author">
                    <name>
                        <surname>Ukobizaba</surname>
                        <given-names>Fidele</given-names>
                    </name>
                    <xref ref-type="aff" rid="r453643a1">1</xref>
                    <xref ref-type="aff" rid="r453643a2">2</xref>
                    <role>Referee</role>
                    <uri content-type="orcid">https://orcid.org/0000-0003-1502-2395</uri>
                </contrib>
                <aff id="r453643a1">
                    <label>1</label>University of Rwanda College of Education, Rwamagana, Rwanda</aff>
                <aff id="r453643a2">
                    <label>2</label>African Centre of Excellence for Innovative Teaching and Learning Mathematics and Science (ACEITLMS), University of Rwanda College of Education, Kayonza, Rwanda</aff>
            </contrib-group>
            <author-notes>
                <fn fn-type="conflict">
                    <p>
                        <bold>Competing interests: </bold>No competing interests were disclosed.</p>
                </fn>
            </author-notes>
            <pub-date pub-type="epub">
                <day>17</day>
                <month>2</month>
                <year>2026</year>
            </pub-date>
            <permissions>
                <copyright-statement>Copyright: &#x00a9; 2026 Ukobizaba F</copyright-statement>
                <copyright-year>2026</copyright-year>
                <license xlink:href="https://creativecommons.org/licenses/by/4.0/">
                    <license-p>This is an open access peer review report distributed under the terms of the Creative Commons Attribution Licence, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</license-p>
                </license>
            </permissions>
            <related-article ext-link-type="doi" id="relatedArticleReport453643" related-article-type="peer-reviewed-article" xlink:href="10.12688/f1000research.172015.2"/>
            <custom-meta-group>
                <custom-meta>
                    <meta-name>recommendation</meta-name>
                    <meta-value>approve-with-reservations</meta-value>
                </custom-meta>
            </custom-meta-group>
        </front-stub>
        <body>
            <p>The study addresses an important and timely issue, &#x201c;Closing the gap in early mathematics&#x201d;, which is commendable. However, the manuscript has several substantial concerns that should be addressed to strengthen its quality and clarity.</p>
            <p> </p>
            <p> The introduction does not clearly identify or elaborate on the specific gaps in early mathematics among South African students. As a result, the purpose and direction of the study are not sufficiently clear to the reader. The focus and motivation of the study should be made explicit earlier in the manuscript. In this regard, think about the first line of the conclusion (which may be the aim of the study), where it is written: &#x201c;This study examined South African Grade 5 learners&#x2019; mathematics achievement in TIMSS 2023, disaggregated by content and cognitive domains and benchmarked against Singapore&#x2019;s Grade 4 performance.&#x201d;</p>
            <p> </p>
            <p> The research questions are currently too broad and appear to target multiple aspects simultaneously, resulting in a lack of focus. For greater coherence and rigor, the research questions should be more specific and precisely formulated.</p>
            <p> </p>
            <p> The manuscript draws on three theoretical frameworks. This may dilute the conceptual focus of the study. The authors are encouraged to consider using one framework, or at most two that clearly complement each other, to provide a more coherent theoretical grounding.</p>
            <p> </p>
            <p> Research design raises concerns. The rationale for comparing Grade 5 South African students with Grade 4 Singaporean students is not adequately justified, as these groups are not at the same educational level. Comparisons should ideally be made between equivalent grade levels. Moreover, comparing two groups&#x2019; results alone is insufficient to identify learning gaps in mathematics. The authors should consider an alternative design or provide a stronger justification for their approach.</p>
            <p> </p>
            <p> The results section primarily reports differences in performance between the two groups (effect size) but does not clearly identify or analyze &#x201c;gaps&#x201d; in learning. Given the title, 
                <italic>&#x201c;Closing the gap in early mathematics&#x2026;,&#x201d;</italic> the manuscript should explicitly identify where the gaps exist, describe their nature, explore possible reasons for them, and suggest ways these gaps could be addressed.</p>
            <p> </p>
            <p> In the discussion section, it is written: &#x201c;This section addresses Research Question 3 by interpreting domain-specific and cognitive performance patterns in relation to curriculum alignment, pedagogy, and teacher professional development and outlining implications for improving mathematics achievement in South Africa.&#x201d; However, this section should be discussing the results in line with the research questions. Additionally, this section should more explicitly connect the findings to existing literature and prior research to situate the study within the broader scholarly conversation.</p>
            <p> </p>
            <p>Is the work clearly and accurately presented and does it cite the current literature?</p>
            <p>Yes</p>
            <p>If applicable, is the statistical analysis and its interpretation appropriate?</p>
            <p>Yes</p>
            <p>Are all the source data underlying the results available to ensure full reproducibility?</p>
            <p>Partly</p>
            <p>Is the study design appropriate and is the work technically sound?</p>
            <p>No</p>
            <p>Are the conclusions drawn adequately supported by the results?</p>
            <p>No</p>
            <p>Are sufficient details of methods and analysis provided to allow replication by others?</p>
            <p>No</p>
            <p>Reviewer Expertise:</p>
            <p>Mathematics education.</p>
            <p>I confirm that I have read this submission and believe that I have an appropriate level of expertise to confirm that it is of an acceptable scientific standard, however I have significant reservations, as outlined above.</p>
        </body>
        <sub-article article-type="response" id="comment15497-453643">
            <front-stub>
                <contrib-group>
                    <contrib contrib-type="author">
                        <name>
                            <surname>Mokgwathi</surname>
                            <given-names>Mathelela Steyn</given-names>
                        </name>
                        <aff>Early Childhood Education, University of South Africa, Pretoria, Gauteng, South Africa</aff>
                    </contrib>
                </contrib-group>
                <author-notes>
                    <fn fn-type="conflict">
                        <p>
                            <bold>Competing interests: </bold>No competing interests were disclosed.</p>
                    </fn>
                </author-notes>
                <pub-date pub-type="epub">
                    <day>19</day>
                    <month>2</month>
                    <year>2026</year>
                </pub-date>
            </front-stub>
            <body>
                <p>
                    <bold>Reviewer Comments</bold>
                </p>
                <p> 
                    <bold>Response to Comments</bold>
                </p>
                <p> </p>
                <p> The introduction does not clearly identify or elaborate on the specific gaps in early mathematics among South African students. The purpose and direction of the study are not sufficiently clear.</p>
                <p> The introduction has been revised to explicitly state the purpose of the study earlier and to clearly identify the specific achievement gaps motivating the analysis. The revised text now highlights that the largest content-domain gap occurs in Measurement and Geometry and the largest cognitive-domain gap occurs in Knowing. The study aim is explicitly articulated in the opening section, clarifying the diagnostic intent of disaggregating achievement across content and cognitive domains.</p>
                <p> </p>
                <p> The research questions are too broad and lack focus.</p>
                <p> The research questions were refined for precision and coherence. Research Questions 1 and 2 now focus specifically on content-domain and cognitive-domain profiling, while Research Question 3 is limited to curriculum and pedagogical implications derived from the empirical findings. This ensures a clear alignment between Results (RQ1 and RQ2) and Discussion (RQ3).</p>
                <p> </p>
                <p> The manuscript draws on three theoretical frameworks, which may dilute conceptual focus.</p>
                <p> The conceptual framework has been streamlined to retain two complementary frameworks: the TIMSS domain framework and Curriculum Alignment Theory (Porter, 2002). Additional theoretical layering was removed to improve conceptual clarity. Figure 1 was redesigned to reflect this streamlined structure.</p>
                <p> </p>
                <p> The rationale for comparing Grade 5 South African learners with Grade 4 Singaporean learners is not adequately justified. Comparisons should ideally be between equivalent grade levels.</p>
                <p> The methodology section now clearly explains that both cohorts were assessed using the same TIMSS Grade 4 mathematics framework and reporting scale. The comparison is instrument-equivalent rather than curriculum-equivalent. The manuscript explicitly clarifies that the design is descriptive and comparative, not causal, and is intended to benchmark performance patterns rather than infer grade-level learning differences.</p>
                <p> </p>
                <p> Comparing two groups alone is insufficient to identify learning gaps.</p>
                <p> The Results section now operationalises &#x201c;achievement gap&#x201d; explicitly as (a) deviation from the TIMSS international centre point and (b) benchmark scale-score differences with associated effect sizes. The analysis identifies the location, magnitude, and concentration of domain-specific gaps rather than merely reporting mean differences.</p>
                <p> </p>
                <p> The Results section reports effect sizes but does not clearly analyse gaps in learning.</p>
                <p> The Results section was restructured to explicitly identify where gaps exist, describe their magnitude, and distinguish between uniform underperformance and concentrated domain-specific weaknesses. Item-level illustrations were included to demonstrate how gaps manifest in foundational fluency and spatial reasoning tasks.</p>
                <p> </p>
                <p> The Discussion should more explicitly connect findings to the research questions and prior literature.</p>
                <p> The Discussion was reorganised to mirror the empirical findings and to explicitly anchor each subsection in the identified content and cognitive gaps. Connections to prior South African and international research were strengthened, and interpretations are framed within the retained theoretical structure.</p>
                <p> </p>
                <p> Is the study design appropriate and technically sound? (Reviewer indicated &#x201c;No&#x201d;)</p>
                <p> The Method section was strengthened by explicitly detailing the use of plausible values, Rubin&#x2019;s rules for combining estimates, replicate weights for variance estimation, complex sampling adjustments, and effect size interpretation. The manuscript now clearly states that the analysis is descriptive and non-causal.</p>
                <p> </p>
                <p> Are sufficient details provided to allow replication? (Reviewer indicated &#x201c;No&#x201d;)</p>
                <p> Additional methodological transparency was incorporated. The manuscript now specifies estimation procedures, variance computation methods, and reporting of confidence intervals. Derived datasets and methodological documentation have been deposited in an open-access repository to support replication.</p>
                <p> </p>
                <p> Are the conclusions adequately supported by the results? (Reviewer indicated &#x201c;No&#x201d;)</p>
                <p> The Conclusion was revised to align strictly with the empirical findings. Causal overreach was removed, reform priorities are directly grounded in identified domain-specific gaps, and claims are framed proportionately to the descriptive comparative design.</p>
            </body>
        </sub-article>
        <sub-article article-type="response" id="comment15960-453643">
            <front-stub>
                <contrib-group>
                    <contrib contrib-type="author">
                        <name>
                            <surname>Mokgwathi</surname>
                            <given-names>Mathelela Steyn</given-names>
                        </name>
                        <aff>Early Childhood Education, University of South Africa, Pretoria, Gauteng, South Africa</aff>
                    </contrib>
                </contrib-group>
                <author-notes>
                    <fn fn-type="conflict">
                        <p>
                            <bold>Competing interests: </bold>No competing interests</p>
                    </fn>
                </author-notes>
                <pub-date pub-type="epub">
                    <day>15</day>
                    <month>4</month>
                    <year>2026</year>
                </pub-date>
            </front-stub>
            <body>
                <p>Dear Dr Ukobizaba,</p>
                <p> </p>
                <p> I would like to thank you for your detailed and constructive feedback, which has significantly strengthened this manuscript. In this revision, I have clarified the research gap in the Introduction by explicitly identifying the limited focus on domain-specific mathematics achievement at the primary level and by stating the purpose of the study more clearly at an earlier stage. The research questions have been refined and reduced to ensure greater focus and alignment with the study design. In addition, the conceptual framework has been streamlined by adopting curriculum alignment theory as the primary analytical lens, with the TIMSS framework providing the structure for domain-level analysis.</p>
                <p> </p>
                <p> I have also strengthened the justification for comparing South African Grade 5 learners with Singaporean Grade 4 learners by explaining the use of the TIMSS Grade 4 assessment framework and ensuring comparability on a common measurement scale. The Results section has been revised to explicitly identify and analyse achievement gaps across content and cognitive domains, rather than focusing only on differences in performance.</p>
                <p> </p>
                <p> Furthermore, the Discussion section has been restructured to align directly with the research questions, remove references to the previously included third research question, and strengthen the integration of findings with existing literature. The Methods section has been expanded to provide clearer detail on analytical procedures, including the handling of plausible values, sampling weights, and variance estimation, to enhance transparency and reproducibility. Finally, the conclusion has been rewritten to ensure that it is fully supported by the findings and provides a clear and concise synthesis of the study&#x2019;s contribution.</p>
                <p> </p>
                <p> I trust that these revisions address the concerns raised and improve the overall clarity, methodological rigour, and contribution of the study.</p>
                <p> </p>
                <p> Kind regards,</p>
            </body>
        </sub-article>
    </sub-article>
    <sub-article article-type="reviewer-report" id="report435305">
        <front-stub>
            <article-id pub-id-type="doi">10.5256/f1000research.189692.r435305</article-id>
            <title-group>
                <article-title>Reviewer response for version 1</article-title>
            </title-group>
            <contrib-group>
                <contrib contrib-type="author">
                    <name>
                        <surname>O'CONNOR</surname>
                        <given-names>MARGUERITE KHAKASA MIHESO</given-names>
                    </name>
                    <xref ref-type="aff" rid="r435305a1">1</xref>
                    <role>Referee</role>
                </contrib>
                <aff id="r435305a1">
                    <label>1</label>Kenyatta University, Nairobi, Nairobi County, Kenya</aff>
            </contrib-group>
            <author-notes>
                <fn fn-type="conflict">
                    <p>
                        <bold>Competing interests: </bold>No competing interests were disclosed.</p>
                </fn>
            </author-notes>
            <pub-date pub-type="epub">
                <day>30</day>
                <month>12</month>
                <year>2025</year>
            </pub-date>
            <permissions>
                <copyright-statement>Copyright: &#x00a9; 2025 O'CONNOR MKM</copyright-statement>
                <copyright-year>2025</copyright-year>
                <license xlink:href="https://creativecommons.org/licenses/by/4.0/">
                    <license-p>This is an open access peer review report distributed under the terms of the Creative Commons Attribution Licence, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</license-p>
                </license>
            </permissions>
            <related-article ext-link-type="doi" id="relatedArticleReport435305" related-article-type="peer-reviewed-article" xlink:href="10.12688/f1000research.172015.1"/>
            <custom-meta-group>
                <custom-meta>
                    <meta-name>recommendation</meta-name>
                    <meta-value>approve-with-reservations</meta-value>
                </custom-meta>
            </custom-meta-group>
        </front-stub>
        <body>
            <p>
                <bold>&#x00a0;</bold>Review&#x00a0; report</p>
            <p> Introduction</p>
            <p> The article addresses an&#x00a0;important and timely question&#x00a0;in mathematics education: domain-specific patterns of achievement in TIMSS 2023, comparing South African Grade 5 learners with Singaporean Grade 4 learners. The topic is relevant and well&#x00a0;grounded&#x00a0;providing an opportunity&#x00a0;for policy and practice impact&#x00a0;in South Africa and the wider Sub-Saharan region.</p>
            <p> The research purpose and alignment with TIMSS conceptual frameworks are quite clear.&#x00a0;Descriptive statistical analysis and accurate identification of domain-specific weaknesses&#x00a0;is evident.&#x00a0;&#x00a0;There is observed an effective use of TIMSS as an international benchmark to highlight systemic performance gaps.&#x00a0;The integration of literature from both high-performing (Singapore) and lower-performing (South Africa) systems&#x00a0;is provided in a coherent narrative that links content and cognitive deficits to curriculum and instructional factors.</p>
            <p> However several conceptual and methodological limitations&#x00a0;&#x00a0;which currently reduce the manuscript&#x2019;s rigor. have been identified. Revisions are required, particularly in theoretical framing, TIMSS-&#x00a0;specific methodology, and the depth of interpretation and contextualization.&#x00a0;Some minor corrections have also been identified for authors attention for proof reading</p>
            <p> </p>
            <p> The&#x00a0; following observations have been made and related. action points&#x00a0; suggested for consideration</p>
            <p> A): Presentation</p>
            <p> The&#x00a0;article &#x00a0;is generally well structured, articulate, and contextually grounded.&#x00a0;The introduction to the article provides a&#x00a0;clear rationale&#x00a0;for focusing on South African primary mathematics achievement using TIMSS 2023 and situates the work within persistent national under&#x00a0;performance.&#x00a0;The performance gap (- 250 points) between South African Grade 5 and Singaporean Grade 4 learners is clearly stated and framed as a&#x00a0;systemic issue rather than a purely learner-level deficit. The need to move beyond overall scores to content and cognitive domain analysis&#x00a0;is well argued and aligned with TIMSS reporting structures. In addition, the contribution of the study, especially the focus on Grade 5 as an under-researched phase&#x00a0;relative to Grade 9, is well articulated</p>
            <p> The following issues were noted in the presentation</p>
            <p> The rationale&#x00a0; of&#x00a0;benchmarking South African Grade 5 learners against Singaporean Grade 4 learners though&#x00a0; methodologically common in TIMSS is not well explained. The clarification of&#x00a0;why Singapore Grade 4 is used as a comparator is necessary . Secondly ,the article introduces and sustains a misaligned reference to Early Childhood Education (ECE). This&#x00a0;introduction at one point frames the work as focusing on
                <italic> Early Childhood Education,</italic> which is conceptually inaccurate for TIMSS Grade 5.&#x00a0;Grade 5 sits in the primary/intermediate phase, not ECE.. In addition, the conceptual rationale for the &#x00a0;theoretical framing of content and cognitive domains did not recognize frameworks such as:&#x00a0;TIMSS assessment framework, Kilpatrick et al.&#x2019;s strands of mathematical proficiency and &#x00a0;theories of cognitive demand and learning progression,. Finally, there is need to clarify &#x00a0;the research gap.&#x00a0; It is stated that Grade 5 and domain-specific analyses are under-researched, but this gap is not supported by specific citations and a brief synthesis of existing work..&#x00a0;</p>
            <p> As a result, the author is encouraged&#x00a0; to Action as follows 
                <list list-type="bullet">
                    <list-item>
                        <p>Clearly justify&#x00a0;the benchmarking of South Africa Grade 5 against Singapore Grade 4.</p>
                    </list-item>
                    <list-item>
                        <p>Remove or revise all references to ECE; and&#x00a0; use more accurate terms&#x00a0;&#x00a0;</p>
                    </list-item>
                    <list-item>
                        <p>Strengthen theoretical framing&#x00a0;by linking content and cognitive domains to recognized frameworks (e.g., mathematical proficiency, cognitive demand, learning trajectories).</p>
                    </list-item>
                    <list-item>
                        <p>Clarify the research gap&#x00a0;with citations that show the relative neglect of Grade 5 and domain-specific analyses.</p>
                    </list-item>
                    <list-item>
                        <p>Streamline the introduction&#x00a0;by reducing redundant statements and improving paragraph transitions</p>
                    </list-item>
                </list> &#x00a0;B: Study Design and Data Sources.</p>
            <p> The use of quantitative secondary analysis&#x00a0;of TIMSS 2023 data is appropriate and well aligned with the research questions. The sample sizes&#x00a0;and the&#x00a0;national representativeness&#x00a0;of the TIMSS samples justifies the generalizability of the findings. The&#x00a0;recognition of &#x00a0;TIMSS as a well-established international assessment&#x00a0;enables&#x00a0;valid cross-national comparisons. The description of the test design is broadly accurate and shows familiarity with large-scale assessment methodology.</p>
            <p> &#x00a0;The following issues were observed</p>
            <p> The &#x00a0;description of TIMSS technical procedures&#x00a0;in the article is incomplete.&#x00a0; Although TIMSS design is mentioned, the article&#x00a0;does not explicitly state how plausible values (PVs) were handled&#x00a0;in the analysis. These details will help to&#x00a0;confirm that TIMSS technical standards were followed.</p>
            <p> 
                <italic>The</italic> analysis does not explain whether or how contextual variables collected during the study (SES, language of instruction, teacher qualifications, opportunity-to-learn indicators) were used. Given TIMSS&#x2019;s strong emphasis on context, this is a missed opportunity for explanatory depth.</p>
            <p> For the psychometric and reliability considerations, there is need for&#x00a0;reference to internal consistency, While TIMSS provides validated items, secondary analyses typically reference psychometric adequacy or refer explicitly to IEA technical documentation. Inclusion of some justification of Grade-Level Comparison&#x00a0;is missing
                <italic>
                    <italic>. &#x00a0;</italic>
                </italic>A rationale for South Africa testing at Grade 5 (rather than Grade 4) for the primary cycle should be explicitly noted to strengthen the conceptual legitimacy of the comparison with Singapore Grade 4.</p>
            <p> The following action points are suggested 
                <list list-type="bullet">
                    <list-item>
                        <p>Ex
                            <italic>plicitly describe</italic>&#x00a0;the use of plausible values (number of PVs, which domains, how handled in analysis).</p>
                    </list-item>
                    <list-item>
                        <p>
                            <italic>
                                <italic>Detail the weighting strategy</italic>
                            </italic>, including sampling weights and replicate weights, and explain how the two-stage sampling structure was accounted for.</p>
                    </list-item>
                    <list-item>
                        <p>
                            <italic>
                                <italic>Indicate whether contextual variables</italic>
                            </italic>&#x00a0;(SES, language, school resources, teacher characteristics) were included in any extended analysis; if not, explain this limitation.</p>
                    </list-item>
                    <list-item>
                        <p>
                            <italic>
                                <italic>Reference psychometric evidence</italic>
                            </italic>&#x00a0;(e.g., IEA technical reports) to reassure readers about reliability and validity of domain scores.</p>
                    </list-item>
                    <list-item>
                        <p>
                            <italic>
                                <italic>Clarify the grade-level choice</italic>
                            </italic>&#x00a0;for South Africa and how it affects comparability with Singapore Grade 4.</p>
                    </list-item>
                </list> &#x00a0;C: Methodology and Statistical Analysis&#x00a0;The categorization of content domains&#x00a0;(Number, Measurement &amp; Geometry, Data) and cognitive domains&#x00a0;(Knowing, Applying, Reasoning) is appropriate and consistent with TIMSS.&#x00a0;The use of weighted estimates, mean differences, and effect sizes (Cohen&#x2019;s d)&#x00a0;is methodologically&#x00a0; suitable for characterizing performance gaps..&#x00a0;</p>
            <p> The following issues were observed:</p>
            <p> To avoid biased estimates, domain-level comparisons in TIMSS require appropriate PV handling and replication methods. It is not clear&#x00a0;whether the analysis uses domain-specific plausible values correctly&#x00a0;</p>
            <p> For the variance estimation and significance testing, it is does not specify whether standard errors, confidence intervals, or p-values&#x00a0;were computed.&#x00a0;Descriptive statistics without uncertainty estimates weaken inferential robustness.&#x00a0;The author had limited use of advanced modelling. The analysis is largely descriptive (means and effect sizes)..Given the richness of TIMSS data, multivariate methods&#x00a0;(e.g., regression, multilevel models, mediation with contextual variables, DIF analyses) could substantially deepen the insights. While Cohen&#x2019;s D &#x00a0;is reported, the interpretation of effect sizes&#x00a0;in context is limited. There is little discussion of what constitutes a small/medium/large effect in educational terms.,&#x00a0;</p>
            <p> Suggested action points 
                <list list-type="bullet">
                    <list-item>
                        <p>Specify analytical tools and procedures&#x00a0;used to handle PVs and weights (software, settings, treatment of domains).</p>
                    </list-item>
                    <list-item>
                        <p>Report standard errors and/or confidence intervals&#x00a0;for key estimates and clarify whether hypothesis tests were performed.</p>
                    </list-item>
                    <list-item>
                        <p>Where feasible, incorporate multivariate or multilevel analyses&#x00a0;to examine how contextual variables relate to domain-specific performance.</p>
                    </list-item>
                    <list-item>
                        <p>Interpret effect sizes&#x00a0;in educationally meaningful terms, not just numerically.</p>
                    </list-item>
                </list> D: Interpretation of Findings, Discussion, and Conclusion</p>
            <p> The discussion is comprehensive and analytically rich. Content and cognitive domain findings and linking them to curriculum, pedagogy, and systemic factors is integrated well.&#x00a0;Content-level weaknesses&#x00a0; and cognitive weaknesses are clearly interpreted and consistent with previous research .Policy and practice implications&#x00a0;including pillars of reform around foundational knowledge, geometry/spatial reasoning, and leveraging application skills to cultivate reasoning is well structured. The conclusion offers a strong synthesis, links to broader goals such as SDG 4, and emphasizes curriculum-pedagogy-assessment coherence through Curriculum Alignment Theory</p>
            <p> Issues observed included&#x00a0; the need for greater engagement with learning progression theory, to deepen the explanatory power.</p>
            <p> Some &#x00a0;claims require evidence or qualification statements such as &#x201c;procedural accuracy over conceptual exploration&#x201d; or &#x201c;curriculum spreads content thinly&#x201d; need either citation or more cautious phrasing.</p>
            <p> The claim that learners are &#x201c;stronger in applying&#x201d; requires quantification and statistical validation .</p>
            <p> ECE References&#x00a0; in this section should also be expunged and corrected .</p>
            <p> &#x00a0;To maintain academic balance, the discussion on using&#x00a0; Singapore as a model should acknowledge potential constraints such as high academic pressure, streaming, selective teacher recruitment, and cultural expectations.</p>
            <p> Some weaknesses (e.g., geometry and knowing) are repeated in multiple sections affecting&#x00a0; structure&#x00a0; &#x00a0;In Pillar(3) three&#x00a0; repetition&#x00a0; is noted, The discussion is not using empirical evidence and relies of vague quantifiers. The structure of the discussion could be sharpened with clearer subheadings and more concise phrasing.</p>
            <p> Some 
                <italic>recommendations</italic> (e.g., reducing class sizes) may be politically or financially challenging; indicating alternative or incremental strategies&#x00a0;would strengthen the practical relevance.</p>
            <p> The conclusion would benefit from a brief statement of study limitations&#x00a0;(e.g., reliance on secondary data, limited use of contextual variables, cross-sectional design) and directions for future research.</p>
            <p> Action points suggested 
                <list list-type="bullet">
                    <list-item>
                        <p>Deepen theoretical engagement&#x00a0;by explicitly drawing on mathematical proficiency and learning progression frameworks.</p>
                    </list-item>
                    <list-item>
                        <p>Support key claims with evidence&#x00a0;(citations or data) and quantify statements about &#x201c;relative strength&#x201d; in applying.</p>
                    </list-item>
                    <list-item>
                        <p>Remove or reframe ECE references, using appropriate phase terminology.</p>
                    </list-item>
                    <list-item>
                        <p>Provide a more nuanced view of Singapore, noting contextual constraints and non-transferable features.</p>
                    </list-item>
                    <list-item>
                        <p>Consolidate repeated material&#x00a0;on geometry/knowing weaknesses and improve subheadings and paragraph transitions.</p>
                    </list-item>
                    <list-item>
                        <p>Comment on feasibility&#x00a0;of policy recommendations and suggest realistic pathways or phases of implementation.</p>
                    </list-item>
                    <list-item>
                        <p>Add a short section on study limitations and future research directions</p>
                    </list-item>
                    <list-item>
                        <p>&#x00a0;replace vague quantifiers with measurable empirical evidence</p>
                    </list-item>
                </list> &#x00a0;Summary of Key Action Points for Revision</p>
            <p> 
                <italic>Conceptual and Theoretical</italic>
            </p>
            <p> Clarify the benchmarking rationale&#x00a0;(Singapore Grade 4 vs South Africa Grade 5).</p>
            <p> Correct ECE misalignment&#x00a0;and use appropriate phase terminology.</p>
            <p> Strengthen theoretical framing&#x00a0;using TIMSS, mathematical proficiency, and learning progression frameworks.</p>
            <p> Provide a more balanced portrayal of Singapore, acknowledging contextual constraints.</p>
            <p> 
                <italic>Methodological and Statistical</italic>
            </p>
            <p> Explicitly describe handling of plausible values, sampling weights, and replicate weights.</p>
            <p> Report standard errors, confidence intervals, and significance tests&#x00a0;for key comparisons.</p>
            <p> Consider multivariate or multilevel analyses&#x00a0;using contextual variables</p>
            <p> Interpret effect sizes&#x00a0;substantively, not just numerically.</p>
            <p> 
                <italic>Context and Interpretation</italic>
            </p>
            <p> Integrate contextual questionnaire data&#x00a0;(SES, language, resources, teacher factors) where possible.</p>
            <p> Link performance patterns explicitly to systemic factors&#x00a0;(teacher knowledge, curriculum pacing, language of learning and teaching)</p>
            <p> Ensure claims are evidence-based&#x00a0;and appropriately referenced.</p>
            <p> 
                <italic>Structure and Presentation&#x00a0; &gt;&gt;minor corrections</italic>
            </p>
            <p> Streamline the introduction and discussion, removing redundancy and improving coherence.</p>
            <p> Strengthen section headings and transitions&#x00a0;for easier navigation.</p>
            <p> Include tables or figures&#x00a0;with domain-specific means, SEs, and effect sizes</p>
            <p> Add a concise section on limitations and future research.</p>
            <p> </p>
            <p> </p>
            <p> </p>
            <p> </p>
            <p> </p>
            <p> </p>
            <p> </p>
            <p> </p>
            <p> </p>
            <p> </p>
            <p> </p>
            <p> </p>
            <p> </p>
            <p> &#x200b;&#x200b;&#x200b;&#x200b;&#x200b;&#x200b;</p>
            <p>Is the work clearly and accurately presented and does it cite the current literature?</p>
            <p>Yes</p>
            <p>If applicable, is the statistical analysis and its interpretation appropriate?</p>
            <p>Partly</p>
            <p>Are all the source data underlying the results available to ensure full reproducibility?</p>
            <p>Yes</p>
            <p>Is the study design appropriate and is the work technically sound?</p>
            <p>Yes</p>
            <p>Are the conclusions drawn adequately supported by the results?</p>
            <p>Yes</p>
            <p>Are sufficient details of methods and analysis provided to allow replication by others?</p>
            <p>Partly</p>
            <p>Reviewer Expertise:</p>
            <p>Mathematics Education and Teacher Education</p>
            <p>I confirm that I have read this submission and believe that I have an appropriate level of expertise to confirm that it is of an acceptable scientific standard, however I have significant reservations, as outlined above.</p>
        </body>
        <sub-article article-type="response" id="comment15167-435305">
            <front-stub>
                <contrib-group>
                    <contrib contrib-type="author">
                        <name>
                            <surname>Mokgwathi</surname>
                            <given-names>Mathelela Steyn</given-names>
                        </name>
                        <aff>Early Childhood Education, University of South Africa, Pretoria, Gauteng, South Africa</aff>
                    </contrib>
                </contrib-group>
                <author-notes>
                    <fn fn-type="conflict">
                        <p>
                            <bold>Competing interests: </bold>None</p>
                    </fn>
                </author-notes>
                <pub-date pub-type="epub">
                    <day>1</day>
                    <month>1</month>
                    <year>2026</year>
                </pub-date>
            </front-stub>
            <body>
                <p>I would like to thank Reviewer 2 for the detailed and constructive feedback concerning theoretical framing, TIMSS-specific methodology, interpretation of domain-level findings, and the balance and structure of the Discussion and Conclusion. These concerns have been addressed through strengthened integration of mathematical proficiency and learning progression frameworks, expanded methodological transparency regarding plausible values and variance estimation, refined interpretation of effect sizes, removal of misaligned ECE references, and a more nuanced treatment of Singapore as a benchmark. We believe the revisions comprehensively address the issues identified.</p>
            </body>
        </sub-article>
        <sub-article article-type="response" id="comment15958-435305">
            <front-stub>
                <contrib-group>
                    <contrib contrib-type="author">
                        <name>
                            <surname>Mokgwathi</surname>
                            <given-names>Mathelela Steyn</given-names>
                        </name>
                        <aff>Early Childhood Education, University of South Africa, Pretoria, Gauteng, South Africa</aff>
                    </contrib>
                </contrib-group>
                <author-notes>
                    <fn fn-type="conflict">
                        <p>
                            <bold>Competing interests: </bold>No competing interests</p>
                    </fn>
                </author-notes>
                <pub-date pub-type="epub">
                    <day>15</day>
                    <month>4</month>
                    <year>2026</year>
                </pub-date>
            </front-stub>
            <body>
                <p>Dear Dr O&#x2019;Connor,</p>
                <p> </p>
                <p> I would like to thank you for your detailed and constructive feedback, which has contributed significantly to strengthening this manuscript. In this revision, I have clarified the benchmarking rationale by providing a detailed explanation of the use of the TIMSS Grade 4 framework and ensuring comparability between South African Grade 5 and Singaporean Grade 4 learners. I have also strengthened the articulation of the research gap, removed misaligned references to Early Childhood Education, and refined the theoretical framing to improve conceptual clarity.</p>
                <p> </p>
                <p> In addition, I have expanded the methodological section to provide clearer explanations of TIMSS procedures, including the handling of plausible values, sampling weights, and variance estimation, and clarified the descriptive and diagnostic focus of the study in relation to the use of contextual variables and modelling approaches. The interpretation of effect sizes has been strengthened, and claims have been revised to ensure they are appropriately supported and cautiously framed.</p>
                <p> </p>
                <p> The discussion has been revised to enhance analytical depth, reduce prescriptive elements, improve structural coherence, and better integrate the findings with existing literature, while also providing a more balanced interpretation of the Singapore comparison. A concise limitations section has also been included to reflect key design and contextual constraints.</p>
                <p> </p>
                <p> I trust that these revisions address the concerns raised and improve the overall clarity, rigour, and contribution of the study.</p>
                <p> </p>
                <p> Kind regards,</p>
            </body>
        </sub-article>
    </sub-article>
    <sub-article article-type="reviewer-report" id="report435300">
        <front-stub>
            <article-id pub-id-type="doi">10.5256/f1000research.189692.r435300</article-id>
            <title-group>
                <article-title>Reviewer response for version 1</article-title>
            </title-group>
            <contrib-group>
                <contrib contrib-type="author">
                    <name>
                        <surname>Khazanchi</surname>
                        <given-names>Rashmi</given-names>
                    </name>
                    <xref ref-type="aff" rid="r435300a1">1</xref>
                    <role>Referee</role>
                    <uri content-type="orcid">https://orcid.org/0000-0001-8601-4144</uri>
                </contrib>
                <aff id="r435300a1">
                    <label>1</label>Open University of the Netherlands, Heerlen, The Netherlands</aff>
            </contrib-group>
            <author-notes>
                <fn fn-type="conflict">
                    <p>
                        <bold>Competing interests: </bold>No competing interests were disclosed.</p>
                </fn>
            </author-notes>
            <pub-date pub-type="epub">
                <day>30</day>
                <month>12</month>
                <year>2025</year>
            </pub-date>
            <permissions>
                <copyright-statement>Copyright: &#x00a9; 2025 Khazanchi R</copyright-statement>
                <copyright-year>2025</copyright-year>
                <license xlink:href="https://creativecommons.org/licenses/by/4.0/">
                    <license-p>This is an open access peer review report distributed under the terms of the Creative Commons Attribution Licence, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</license-p>
                </license>
            </permissions>
            <related-article ext-link-type="doi" id="relatedArticleReport435300" related-article-type="peer-reviewed-article" xlink:href="10.12688/f1000research.172015.1"/>
            <custom-meta-group>
                <custom-meta>
                    <meta-name>recommendation</meta-name>
                    <meta-value>reject</meta-value>
                </custom-meta>
            </custom-meta-group>
        </front-stub>
        <body>
            <p>The manuscript titled 
                <italic>&#x201c;Closing the Gap in Early Mathematics: Domain and Cognitive Insights from TIMSS 2023 in South Africa and Singapore&#x201d;</italic> addresses an important topic by examining gaps in elementary-level mathematics learning through a comparison of TIMSS results from the two countries. While the study&#x2019;s focus and findings are potentially valuable, the manuscript contains several major methodological and conceptual issues that need to be addressed.</p>
            <p> The author compares TIMSS results of Grade 4 learners from Singapore with Grade 5 learners from South Africa; however, TIMSS assesses learners at Grades 4 and 8 only. The manuscript therefore needs to clarify how Grade 5 results were derived from TIMSS data and justify the rationale for benchmarking Grade 4 and Grade 5 learners. The research Question 1 may need to be revised, as TIMSS does not directly assess Grade 5 learners.</p>
            <p> Several in-text citations are missing throughout the manuscript as statement should be supported with appropriate citations. On page 3, the title of the literature review (&#x201c;Comparative analysis of South African and Singaporean Grade 5 mathematics achievement&#x201d;) requires correction and should be revised consistently throughout the paper, as it implies that the comparison is limited solely to Grade 5 mathematics achievement.</p>
            <p> In the conceptual framework section, the author introduces an integrative framework combining a comparative lens, the TIMSS curriculum model, and curriculum theory dimensions; however, the sentence presenting this framework is a paragraph long and reduces clarity. To enhance clarity, the author needs to explain the role of each framework component, articulate the directional relationships among key constructs, and strengthen the narrative link to Figure 1.</p>
            <p> In the results section, findings should be discussed in direct relation to the research questions rather than beginning with tables or figures. The discussion section would benefit from further elaboration and stronger integration with the literature reviewed earlier, as well as more explicit connections to the research questions to enhance clarity and readability.</p>
            <p> </p>
            <p>Is the work clearly and accurately presented and does it cite the current literature?</p>
            <p>Yes</p>
            <p>If applicable, is the statistical analysis and its interpretation appropriate?</p>
            <p>Partly</p>
            <p>Are all the source data underlying the results available to ensure full reproducibility?</p>
            <p>Yes</p>
            <p>Is the study design appropriate and is the work technically sound?</p>
            <p>No</p>
            <p>Are the conclusions drawn adequately supported by the results?</p>
            <p>Partly</p>
            <p>Are sufficient details of methods and analysis provided to allow replication by others?</p>
            <p>Partly</p>
            <p>Reviewer Expertise:</p>
            <p>Artificial Intelligence in Education, AI - based tutoring systems, Mathematics Achievement, Student Engagement.</p>
            <p>I confirm that I have read this submission and believe that I have an appropriate level of expertise to state that I do not consider it to be of an acceptable scientific standard, for reasons outlined above.</p>
        </body>
        <sub-article article-type="response" id="comment15145-435300">
            <front-stub>
                <contrib-group>
                    <contrib contrib-type="author">
                        <name>
                            <surname>Mokgwathi</surname>
                            <given-names>Mathelela Steyn</given-names>
                        </name>
                        <aff>Early Childhood Education, University of South Africa, Pretoria, Gauteng, South Africa</aff>
                    </contrib>
                </contrib-group>
                <author-notes>
                    <fn fn-type="conflict">
                        <p>
                            <bold>Competing interests: </bold>No competing interest</p>
                    </fn>
                </author-notes>
                <pub-date pub-type="epub">
                    <day>30</day>
                    <month>12</month>
                    <year>2025</year>
                </pub-date>
            </front-stub>
            <body>
                <p>Dear Reviewer,</p>
                <p> Thank you for your careful reading of the manuscript and for the detailed and constructive feedback provided. I appreciate the time and scholarly attention devoted to the review, and I am grateful for the suggestions, which have been invaluable in strengthening the clarity, methodological transparency, and overall quality of the paper. I respond to each of the major points raised below.</p>
                <p> With regard to the comment concerning the comparison between Grade 4 learners in Singapore and Grade 5 learners in South Africa, I appreciate the opportunity to clarify this important methodological issue. While TIMSS is internationally benchmarked at Grades 4 and 8, it is also well documented that several participating countries, including South Africa, assess learners at Grade 5 using the Grade 4 TIMSS assessment instruments. This approach is adopted where curriculum alignment indicates that the Grade 4 framework more accurately reflects the instructional exposure of learners, a practice that parallels the assessment of Grade 9 learners using the Grade 8 TIMSS instruments in some contexts. In the revised manuscript, I will make this rationale explicit by clarifying that the South African Grade 5 learners were assessed using the standard TIMSS Grade 4 mathematics framework and instruments, thereby ensuring construct equivalence and international comparability. In response to the reviewer&#x2019;s suggestion, Research Question 1 will be refined to explicitly reflect this assessment design and to avoid any implication that TIMSS directly assesses a distinct Grade 5 framework.</p>
                <p> Concerning the observation that several in-text citations are missing, I acknowledge this limitation. A thorough review of the manuscript is currently underway to ensure that all empirical claims and theoretical statements are appropriately supported by relevant and up-to-date scholarly sources. These revisions will improve academic rigour and align with best practices in scholarly writing.</p>
                <p> Regarding the comment on the literature review title on page 3, I appreciate the reviewer&#x2019;s attention to precision in terminology. The study intentionally focuses on Grade 5 learners in South Africa, assessed using the Grade 4 TIMSS instrument, and compares their performance with Grade 4 learners in Singapore, assessed using the same framework. To avoid ambiguity, the literature review title and related headings will be revised for consistency and clarity, ensuring that they accurately reflect the comparative focus and assessment design of the study, rather than implying a narrow or exclusive emphasis on Grade 5 achievement alone.</p>
                <p> I am also grateful for the insightful feedback on the conceptual framework section. I acknowledge that the current presentation of the integrative framework is overly dense. In the revised manuscript, this section will be restructured to clearly explain the role of each component, namely the comparative education lens, the TIMSS curriculum model, and the curriculum theory dimensions. The directional relationships among the key constructs will be explicitly articulated, and the narrative will be more tightly aligned with Figure 1 to enhance conceptual coherence and reader accessibility.</p>
                <p> Finally, I appreciate the guidance regarding the results and discussion sections. In response, the results will be reorganised so that the findings are presented explicitly in relation to the research questions before introducing supporting tables and figures. The discussion section will be expanded to provide deeper engagement with the literature reviewed earlier in the manuscript and to draw clearer, more explicit links between the findings, the research questions, and the broader scholarly debates on early mathematics achievement and international benchmarking.</p>
                <p> Once again, I sincerely thank the reviewer for these constructive comments. I am confident that addressing these points will substantially strengthen the manuscript and enhance its contribution to the literature on comparative mathematics education.</p>
            </body>
        </sub-article>
        <sub-article article-type="response" id="comment15166-435300">
            <front-stub>
                <contrib-group>
                    <contrib contrib-type="author">
                        <name>
                            <surname>Mokgwathi</surname>
                            <given-names>Mathelela Steyn</given-names>
                        </name>
                        <aff>Early Childhood Education, University of South Africa, Pretoria, Gauteng, South Africa</aff>
                    </contrib>
                </contrib-group>
                <author-notes>
                    <fn fn-type="conflict">
                        <p>
                            <bold>Competing interests: </bold>None</p>
                    </fn>
                </author-notes>
                <pub-date pub-type="epub">
                    <day>1</day>
                    <month>1</month>
                    <year>2026</year>
                </pub-date>
            </front-stub>
            <body>
                <p>I would like to thank Reviewer 1 for highlighting critical concerns regarding grade-level comparability, clarity of the benchmarking rationale, citation completeness, and alignment between the research questions, results, and interpretation. These issues have been carefully addressed through clarifications of TIMSS grade participation, revisions to the conceptual framing and research questions, strengthened citation support, and restructuring of the Results and Discussion sections. We believe the revised manuscript now fully addresses the raised concerns.</p>
            </body>
        </sub-article>
        <sub-article article-type="response" id="comment15957-435300">
            <front-stub>
                <contrib-group>
                    <contrib contrib-type="author">
                        <name>
                            <surname>Mokgwathi</surname>
                            <given-names>Mathelela Steyn</given-names>
                        </name>
                        <aff>Early Childhood Education, University of South Africa, Pretoria, Gauteng, South Africa</aff>
                    </contrib>
                </contrib-group>
                <author-notes>
                    <fn fn-type="conflict">
                        <p>
                            <bold>Competing interests: </bold>No competing interests</p>
                    </fn>
                </author-notes>
                <pub-date pub-type="epub">
                    <day>15</day>
                    <month>4</month>
                    <year>2026</year>
                </pub-date>
            </front-stub>
            <body>
                <p>Dear Dr Khazanchi,</p>
                <p> </p>
                <p> I would like to thank you for your constructive and insightful feedback, which has significantly strengthened this manuscript. In this revision, I have clarified the rationale for benchmarking South African Grade 5 learners with Singaporean Grade 4 learners by explicitly explaining the use of the TIMSS Grade 4 framework and ensuring measurement equivalence. I have also refined the research questions to align with the TIMSS design and streamlined the conceptual framework to improve clarity and coherence.</p>
                <p> </p>
                <p> In addition, I have strengthened the literature support throughout the manuscript, revised section headings for consistency, and restructured the Results and Discussion sections to ensure clear alignment with the research questions and stronger integration with existing literature. The conclusion has been rewritten to provide a concise synthesis fully grounded in the findings.</p>
                <p> I trust that these revisions address the concerns raised and improve the overall clarity, rigour, and contribution to the study.</p>
                <p> </p>
                <p> Kind regards,</p>
            </body>
        </sub-article>
    </sub-article>
</article>
