<?xml version="1.0" encoding="UTF-8"?><!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Publishing DTD v1.2 20190208//EN" "http://jats.nlm.nih.gov/publishing/1.2/JATS-journalpublishing1.dtd"><article xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" article-type="research-article" dtd-version="1.2" xml:lang="en">
    <front>
        <journal-meta>
            <journal-id journal-id-type="pmc">F1000Research</journal-id>
            <journal-title-group>
                <journal-title>F1000Research</journal-title>
            </journal-title-group>
            <issn pub-type="epub">2046-1402</issn>
            <publisher>
                <publisher-name>F1000 Research Limited</publisher-name>
                <publisher-loc>London, UK</publisher-loc>
            </publisher>
        </journal-meta>
        <article-meta>
            <article-id pub-id-type="doi">10.12688/f1000research.178067.1</article-id>
            <article-categories>
                <subj-group subj-group-type="heading">
                    <subject>Research Article</subject>
                </subj-group>
                <subj-group>
                    <subject>Articles</subject>
                </subj-group>
            </article-categories>
            <title-group>
                <article-title>Assessing the Impact of AI-Augmented DevSecOps on Lead Time in Agile Release Management</article-title>
                <fn-group content-type="pub-status">
                    <fn>
                        <p>[version 1; peer review: 1 approved with reservations]</p>
                    </fn>
                </fn-group>
            </title-group>
            <contrib-group>
                <contrib contrib-type="author" corresp="no">
                    <name>
                        <surname>Agung Gunawan</surname>
                        <given-names>Jimmy</given-names>
                    </name>
                    <role content-type="http://credit.niso.org/">Conceptualization</role>
                    <role content-type="http://credit.niso.org/">Data Curation</role>
                    <role content-type="http://credit.niso.org/">Formal Analysis</role>
                    <role content-type="http://credit.niso.org/">Methodology</role>
                    <role content-type="http://credit.niso.org/">Visualization</role>
                    <role content-type="http://credit.niso.org/">Writing &#x2013; Original Draft Preparation</role>
                    <uri content-type="orcid">https://orcid.org/0009-0009-4550-4476</uri>
                    <xref ref-type="aff" rid="a1">1</xref>
                </contrib>
                <contrib contrib-type="author" corresp="yes">
                    <name>
                        <surname>Laksono Singgih</surname>
                        <given-names>Moses</given-names>
                    </name>
                    <role content-type="http://credit.niso.org/">Conceptualization</role>
                    <role content-type="http://credit.niso.org/">Methodology</role>
                    <role content-type="http://credit.niso.org/">Supervision</role>
                    <role content-type="http://credit.niso.org/">Visualization</role>
                    <role content-type="http://credit.niso.org/">Writing &#x2013; Review &amp; Editing</role>
                    <xref ref-type="corresp" rid="c1">a</xref>
                    <xref ref-type="aff" rid="a1">1</xref>
                </contrib>
                <contrib contrib-type="author" corresp="no">
                    <name>
                        <surname>Raden Venantius Hari</surname>
                        <given-names>Ginardi</given-names>
                    </name>
                    <role content-type="http://credit.niso.org/">Conceptualization</role>
                    <role content-type="http://credit.niso.org/">Methodology</role>
                    <role content-type="http://credit.niso.org/">Supervision</role>
                    <role content-type="http://credit.niso.org/">Visualization</role>
                    <role content-type="http://credit.niso.org/">Writing &#x2013; Review &amp; Editing</role>
                    <xref ref-type="aff" rid="a1">1</xref>
                </contrib>
                <aff id="a1">
                    <label>1</label>Interdisciplinary School of Management and Technology, Institut Teknologi Sepuluh Nopember, Surabaya, East Java, 60264, Indonesia</aff>
            </contrib-group>
            <author-notes>
                <corresp id="c1">
                    <label>a</label>
                    <email xlink:href="mailto:moseslsinggih@its.ac.id">moseslsinggih@its.ac.id</email>
                </corresp>
                <fn fn-type="conflict">
                    <p>No competing interests were disclosed.</p>
                </fn>
            </author-notes>
            <pub-date pub-type="epub">
                <day>11</day>
                <month>5</month>
                <year>2026</year>
            </pub-date>
            <pub-date pub-type="collection">
                <year>2026</year>
            </pub-date>
            <volume>15</volume>
            <elocation-id>699</elocation-id>
            <history>
                <date date-type="accepted">
                    <day>2</day>
                    <month>3</month>
                    <year>2026</year>
                </date>
            </history>
            <permissions>
                <copyright-statement>Copyright: &#x00a9; 2026 Agung Gunawan J et al.</copyright-statement>
                <copyright-year>2026</copyright-year>
                <license xlink:href="https://creativecommons.org/licenses/by/4.0/">
                    <license-p>This is an open access article distributed under the terms of the Creative Commons Attribution Licence, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</license-p>
                </license>
            </permissions>
            <self-uri content-type="pdf" xlink:href="https://f1000research.com/articles/15-699/pdf"/>
            <abstract>
                <sec>
                    <title>Background</title>
                    <p>Despite increasing interest in generative artificial intelligence (AI) within DevSecOps environments, empirical evidence quantifying its impact on software delivery performance remains limited, particularly in regulated enterprise contexts. Lead time for changes is a core DevSecOps performance indicator, yet controlled evaluations of AI-augmented pipelines remain scarce. This study investigates whether on-premises generative AI integration can measurably reduce release lead time while preserving governance and quality controls.</p>
                </sec>
                <sec>
                    <title>Methods</title>
                    <p>A quasi-experimental within-team design was conducted across two consecutive two-week Scrum sprints in an enterprise environment developing internal sales, human resource, and biometric absence systems. Sprint 1 served as the baseline using a conventional DevSecOps pipeline. Sprint 2 introduced an AI-augmented pipeline integrating Retrieval-Augmented Generation (RAG) and Reinforcement Learning from Human Feedback (RLHF) within a GitLab&#x2013;Docker CI/CD infrastructure. The primary outcome was lead time for changes. Secondary metrics included deployment frequency and change failure rate. Statistical analysis employed Welch&#x2019;s t-test, effect size estimation (Cohen&#x2019;s d), and confidence interval analysis.</p>
                </sec>
                <sec>
                    <title>Results</title>
                    <p>A total of 42 distinct changes (21 per sprint) were analyzed. Mean lead time decreased by 39.2% during the intervention sprint (Welch&#x2019;s t(32.4)&#x00a0;=&#x00a0;4.28, p&#x00a0;=&#x00a0;0.00014), with a large effect size (Cohen&#x2019;s d&#x00a0;=&#x00a0;1.32) and a 95% confidence interval indicating a reduction of 15.8&#x2013;37.4&#x00a0;hours. Security scanning time decreased by 64.6%, and approval latency decreased by 48.5%. Deployment frequency increased by 61.9%, while change failure rate declined from 14.3% to 8.7%. AI recommendation acceptance improved from 62.4% in Week 1 to 78.6% in Week 2 and was positively correlated with lead-time reduction (r&#x00a0;=&#x00a0;0.73, p&#x00a0;&lt;&#x00a0;0.05).</p>
                </sec>
                <sec>
                    <title>Conclusions</title>
                    <p>On-premises human-in-the-loop generative AI significantly reduced DevSecOps lead time without compromising reliability or governance. The findings challenge the traditional speed&#x2013;security tradeoff by demonstrating that AI-assisted security validation and release evaluation can simultaneously enhance delivery efficiency and operational stability in regulated enterprise environments.</p>
                    <p>This study examines the influence of on-premises generative AI augmentation on DevSecOps release lead time within agile software development settings. Despite increasing interest in generative artificial intelligence (AI) within Development-Security-Operations (DevSecOps) environments, empirical evidence quantifying its impact on software delivery performance remains limited, particularly in regulated enterprise contexts. Lead time for changes is a core DevSecOps performance indicator, yet controlled evaluations of AI-augmented pipelines remain scarce. This study investigates whether on-premises generative AI integration can measurably reduce release lead time while preserving governance and quality controls. A quasi-experimental within-team design was conducted across two consecutive two-week Scrum sprints in an enterprise environment developing internal sales, human resource, and biometric absence systems. Sprint 1 served as the baseline using a conventional DevSecOps pipeline. Sprint 2 introduced an AI-augmented pipeline integrating Retrieval-Augmented Generation (RAG) and Reinforcement Learning from Human Feedback (RLHF) within a GitLab&#x2013;Docker CI/CD infrastructure. The primary outcome was lead time for changes. Secondary metrics included deployment frequency and change failure rate. Statistical analysis employed Welch&#x2019;s t-test, effect size estimation (Cohen&#x2019;s d), and confidence interval analysis. A total of 42 distinct changes (21 per sprint) were analyzed. Mean lead time decreased by 39.2% during the intervention sprint (Welch&#x2019;s t(32.4)&#x00a0;=&#x00a0;4.28, p&#x00a0;=&#x00a0;0.00014), with a large effect size (Cohen&#x2019;s d&#x00a0;=&#x00a0;1.32) and a 95% confidence interval indicating a reduction of 15.8&#x2013;37.4&#x00a0;hours. Security scanning time decreased by 64.6%, and approval latency decreased by 48.5%. Deployment frequency increased by 61.9%, while change failure rate declined from 14.3% to 8.7%. AI recommendation acceptance improved from 62.4% in Week 1 to 78.6% in Week 2 and was positively correlated with lead-time reduction (r&#x00a0;=&#x00a0;0.73, p&#x00a0;&lt;&#x00a0;0.05). On-premises human-in-the-loop generative AI significantly reduced DevSecOps lead time without compromising reliability or governance. The findings challenge the traditional speed&#x2013;security tradeoff by demonstrating that AI-assisted DevSecOps validation and release evaluation can simultaneously enhance delivery efficiency and operational stability in regulated enterprise environments.</p>
                </sec>
            </abstract>
            <kwd-group kwd-group-type="author">
                <kwd>DevSecOps</kwd>
                <kwd>Lead time for changes</kwd>
                <kwd>Generative AI</kwd>
                <kwd>Retrieval-Augmented Generation (RAG)</kwd>
                <kwd>Reinforcement Learning from Human Feedback (RLHF)</kwd>
                <kwd>Continuous Integration/Continuous Delivery (CI/CD).</kwd>
            </kwd-group>
            <funding-group>
                <funding-statement>The author(s) declared that no grants were involved in supporting this work.</funding-statement>
            </funding-group>
        </article-meta>
    </front>
    <body>
        <sec id="sec5" sec-type="intro">
            <title>1. Introduction</title>
            <p>Despite the growing interest in applying generative AI within DevOps and DevSecOps, existing research has largely focused on conceptual frameworks, developer productivity, and autonomous code generation, with limited empirical validation of delivery performance outcomes in enterprise contexts (
                <xref ref-type="bibr" rid="ref17">Fu et al., 2025</xref>; 
                <xref ref-type="bibr" rid="ref18">Gajbhiye et al., 2024</xref>; 
                <xref ref-type="bibr" rid="ref31">Liang et al., 2024</xref>). In particular, there is a lack of quantitative evidence explaining how AI integration affects the release of time under real-world governance and compliance constraints (
                <xref ref-type="bibr" rid="ref10">Azonuche &amp; Enyejo, 2024</xref>; 
                <xref ref-type="bibr" rid="ref12">Bahi et al., 2024</xref>; 
                <xref ref-type="bibr" rid="ref29">Khan et al., 2024</xref>; 
                <xref ref-type="bibr" rid="ref32">Nadella et al., 2025</xref>). To address this gap, this study presents a quasi-experimental evaluation of on-premises human-in-the-loop generative AI augmentation in an Agile DevSecOps pipeline (
                <xref ref-type="bibr" rid="ref25">Jeong, 2023</xref>; 
                <xref ref-type="bibr" rid="ref41">Singh et al., 2025</xref>; 
                <xref ref-type="bibr" rid="ref51">Zhao et al., 2024</xref>). By comparing two consecutive Scrum sprints, one baseline and one AI-augmented, this study isolates the impact of retrieval&#x2013;augmented generation (RAG) and Reinforcement Learning from Human Feedback (RLHF) on release lead time and related delivery metrics (
                <xref ref-type="bibr" rid="ref30">Knollmeyer et al., 2025</xref>; 
                <xref ref-type="bibr" rid="ref33">Neha et al., 2025</xref>; 
                <xref ref-type="bibr" rid="ref48">Yu et al., 2024</xref>; 
                <xref ref-type="bibr" rid="ref52">Zhou, 2024</xref>). Beyond measuring aggregate performance changes, this study conducts a stage-level pipeline analysis to identify the mechanisms through which AI influences delivery efficiency. The results provide empirical evidence, methodological guidance, and practical insights for enterprises seeking to reconcile accelerated software delivery with security, governance, and compliance requirements (
                <xref ref-type="bibr" rid="ref17">Fu et al., 2025</xref>; 
                <xref ref-type="bibr" rid="ref18">Gajbhiye et al., 2024</xref>).</p>
            <p>In Agile DevSecOps, lead time, often defined as the time from code commit to production deployment, is a critical indicator of release efficiency (
                <xref ref-type="bibr" rid="ref13">Bedoya et al., 2024</xref>; 
                <xref ref-type="bibr" rid="ref18">Gajbhiye et al., 2024</xref>). DevSecOps integrates security throughout development; however, it can slow down delivery if performed manually. Emerging on-premises generative AI techniques (e.g., LLMs augmented with retrieval-augmented generation) and fine-tuned via reinforcement learning from human feedback (RLHF) promise automation of coding and testing tasks (
                <xref ref-type="bibr" rid="ref21">Gargari &amp; Habibi, 2025</xref>; 
                <xref ref-type="bibr" rid="ref25">Jeong, 2023</xref>; 
                <xref ref-type="bibr" rid="ref47">Yigit et al., 2024</xref>). Early research suggests that generative AI can transform software development by automating coding, testing, and deployment tasks, potentially accelerating delivery while ensuring its security. This study proposes an experimental framework to measure the effect of an AI-augmented DevSecOps pipeline on lead time in the context of an internal tool (sales, HR application, biometric absence application) (
                <xref ref-type="bibr" rid="ref1">Abiona et al., 2024</xref>; 
                <xref ref-type="bibr" rid="ref7">Akbar et al., 2022</xref>; 
                <xref ref-type="bibr" rid="ref12">Bahi et al., 2024</xref>; 
                <xref ref-type="bibr" rid="ref17">Fu et al., 2025</xref>; 
                <xref ref-type="bibr" rid="ref43">Tomas et al., 2019</xref>). Using iterative Scrum sprints, this study compared the lead time before and after integrating on-premises RAG/RLHF tools into a GitLab&#x2013;Docker CI/CD pipeline (
                <xref ref-type="bibr" rid="ref15">Donca et al., 2022</xref>; 
                <xref ref-type="bibr" rid="ref28">Karamitsos et al., 2020</xref>). The goal was to quantify the changes in lead time (and related metrics) attributable to the AI enhancements. Despite the growing interest in applying generative artificial intelligence (GenAI) within DevOps and DevSecOps environments, existing research has predominantly focused on conceptual frameworks, developer productivity enhancements, and autonomous code generation capabilities, with comparatively limited attention to delivery performance outcomes in enterprise settings (
                <xref ref-type="bibr" rid="ref17">Fu et al., 2025</xref>; 
                <xref ref-type="bibr" rid="ref18">Gajbhiye et al., 2024</xref>; 
                <xref ref-type="bibr" rid="ref31">Liang et al., 2024</xref>). In particular, there remains a lack of controlled, quantitative evidence demonstrating how generative AI augmentation affects the lead time for changes, which is widely recognized as a core indicator of DevSecOps release efficiency (
                <xref ref-type="bibr" rid="ref13">Bedoya et al., 2024</xref>; 
                <xref ref-type="bibr" rid="ref18">Gajbhiye et al., 2024</xref>). Many prior studies rely on qualitative assessments or high-level observations, offering limited causal insight into whether AI integration measurably accelerates release cycles under real-world operating conditions. Moreover, existing empirical studies often treat CI/CD pipelines as monolithic systems, reporting aggregate performance improvements without examining which specific pipeline stages contribute most to the observed gains. Therefore, the underlying mechanisms through which AI influences DevSecOps performance, particularly across the build, testing, security validation, and approval phases, remain insufficiently understood. This limitation constrains the practical applicability of prior findings, as organizations lack actionable guidance on where AI assistance yields the greatest operational benefits. A further limitation of the current literature is its predominant reliance on cloud-hosted AI services and development contexts with relatively relaxed governance constraints. In contrast, many enterprise environments, especially those operating internal systems for sales, human resources, and financial processing, are subject to stringent requirements regarding data sovereignty, auditability, and human oversight. Consequently, it remains unclear whether on-premises human-in-the-loop generative AI can meaningfully reduce DevSecOps lead time while preserving security, compliance, and accountability in regulated enterprise settings (
                <xref ref-type="bibr" rid="ref25">Jeong, 2023</xref>; 
                <xref ref-type="bibr" rid="ref41">Singh et al., 2025</xref>; 
                <xref ref-type="bibr" rid="ref51">Zhao et al., 2024</xref>).</p>
        </sec>
        <sec id="sec6">
            <title>2. Literature review</title>
            <p>DevOps and lead time metrics: High-performing DevOps teams measure and minimize the lead time for changes (code commit to deploy) (
                <xref ref-type="bibr" rid="ref11">Badshah et al., 2020</xref>; 
                <xref ref-type="bibr" rid="ref42">Snyder &amp; Curtis, 2018</xref>). DORA identifies change lead time as a core throughput metric, and Atlassian notes that top teams achieve lead times on the order of hours (versus days/weeks for lower performers) (
                <xref ref-type="bibr" rid="ref23">Hatch &amp; Curry, 2020</xref>; 
                <xref ref-type="bibr" rid="ref39">Schmid, 2017</xref>). Practices such as trunk-based development, small batch sizes, and test automation are known to shorten lead times (
                <xref ref-type="bibr" rid="ref1">Abiona et al., 2024</xref>; 
                <xref ref-type="bibr" rid="ref2">Adewusi et al., 2024</xref>; 
                <xref ref-type="bibr" rid="ref36">Prates &amp; Pereira, 2024</xref>; 
                <xref ref-type="bibr" rid="ref43">Tomas et al., 2019</xref>). In DevSecOps, automating security checks is crucial because manual reviews can introduce bottlenecks (
                <xref ref-type="bibr" rid="ref5">Ahmed &amp; Francis, 2019</xref>; 
                <xref ref-type="bibr" rid="ref18">Gajbhiye et al., 2024</xref>; 
                <xref ref-type="bibr" rid="ref40">Shamsuddoha et al., 2025</xref>; 
                <xref ref-type="bibr" rid="ref53">Zota et al., 2025</xref>). Recent studies indicate that AI-driven tools can embed security automation without impeding delivery speed (G. 
                <xref ref-type="bibr" rid="ref4">Agarwal, 2024</xref>; 
                <xref ref-type="bibr" rid="ref37">Rangnau et al., 2020</xref>; 
                <xref ref-type="bibr" rid="ref45">Ur Rahman &amp; Williams, 2016</xref>).</p>
            <p>Generative AI in DevOps: Modern CI/CD platforms increasingly integrate AI for developer assistance (
                <xref ref-type="bibr" rid="ref20">Garg et al., 2021</xref>; 
                <xref ref-type="bibr" rid="ref46">Wessel et al., 2025</xref>). For example, GitLab&#x2019;s Code Suggestions use generative models to propose code snippets to help developers &#x201c;write code more efficiently&#x201d; ( 
                <xref ref-type="bibr" rid="ref3">Agarwal et al., 2018</xref>; 
                <xref ref-type="bibr" rid="ref18">Gajbhiye et al., 2024</xref>). Generative AI frameworks, such as RAG, improve answer accuracy by retrieving relevant knowledge before generation, and RLHF fine-tunes models to align with human preferences (
                <xref ref-type="bibr" rid="ref8">Amugongo et al., 2025</xref>; 
                <xref ref-type="bibr" rid="ref9">Arslan et al., 2024</xref>; 
                <xref ref-type="bibr" rid="ref19">Gao et al., 2023</xref>; 
                <xref ref-type="bibr" rid="ref24">Hikov &amp; Murphy, 2024</xref>; 
                <xref ref-type="bibr" rid="ref50">Zhang &amp; Zhang, 2025</xref>). In the context of DevSecOps, recent qualitative research has found that combining DevSecOps with generative AI (e.g., LLMs) leads to the &#x201c;automation of coding tasks and predictive analytics&#x201d; and improved source code management (
                <xref ref-type="bibr" rid="ref1">Abiona et al., 2024</xref>; 
                <xref ref-type="bibr" rid="ref7">Akbar et al., 2022</xref>; 
                <xref ref-type="bibr" rid="ref25">Jeong, 2023</xref>; 
                <xref ref-type="bibr" rid="ref34">Omran Almagrabi &amp; Khan, 2025</xref>; 
                <xref ref-type="bibr" rid="ref37">Rangnau et al., 2020</xref>). Another study reported that GAI can &#x201c;automate various aspects of software development, including coding, testing, and deployment&#x201d; when used in a DevSecOps framework (
                <xref ref-type="bibr" rid="ref17">Fu et al., 2025</xref>; 
                <xref ref-type="bibr" rid="ref43">Tomas et al., 2019</xref>; 
                <xref ref-type="bibr" rid="ref53">Zota et al., 2025</xref>). These insights suggest that AI has the potential to reduce manual effort in securing CI/CD pipelines; however, quantitative evidence on metrics such as lead time is still required (
                <xref ref-type="bibr" rid="ref6">Ajiga et al., 2024</xref>; 
                <xref ref-type="bibr" rid="ref18">Gajbhiye et al., 2024</xref>).</p>
            <p>Agile and experimental methods: Scrum and other Agile frameworks emphasize iterative delivery and empirical measurement (
                <xref ref-type="bibr" rid="ref14">Cervone, 2011</xref>; 
                <xref ref-type="bibr" rid="ref27">Junker et al., 2022</xref>; 
                <xref ref-type="bibr" rid="ref44">Uluda&#x011f; et al., 2021</xref>). In the research context, action research methods involving cycles of planning, action, observation, and reflection align well with agile projects (
                <xref ref-type="bibr" rid="ref12">Bahi et al., 2024</xref>). Accordingly, our methodology uses short sprints (2&#x2013;4&#x00a0;weeks) to iteratively implement and measure changes, reflecting the scrum pillars of transparency, inspection, and adaptation (
                <xref ref-type="bibr" rid="ref16">Dugbartey &amp; Kehinde, 2025</xref>; 
                <xref ref-type="bibr" rid="ref38">Salo &amp; Abrahamsson, 2006</xref>). At the end of each sprint, team feedback and logged metrics informed adjustments, embodying continuous improvement (
                <xref ref-type="bibr" rid="ref26">Joel et al., 2024</xref>; 
                <xref ref-type="bibr" rid="ref35">Paasivaara et al., 2009</xref>; 
                <xref ref-type="bibr" rid="ref49">Zayat &amp; Senvar, 2020</xref>).</p>
        </sec>
        <sec id="sec7">
            <title>3. Methods and materials</title>
            <p>This study evaluated sprint lead-time performance within an Agile DevSecOps release management process. The research did not involve medical research, clinical intervention, animal experimentation, or the collection of personal sensitive data. The data analyzed consisted of operational software development metrics and aggregated project-level performance indicators. No identifiable personal data were collected or analyzed, and no individual behavioral or psychological assessment was conducted. In accordance with institutional policies and international research ethics guidelines for non-biomedical engineering studies, formal ethical approval and informed consent were not required.</p>
            <p>This study proposes a quasi-experimental, within-team design over multiple sprints. The same development team works on comparable feature tasks in two phases: a baseline phase (current DevSecOps pipeline without AI) and an AI-augmented phase (pipeline enhanced with on-premises RAG/RLHF tools). Quantitative DevOps metrics were collected throughout the study. The primary metric is Lead Time for Changes (committing production deployment). Secondary metrics include Deployment Frequency (deployment per sprint) and Change Failure Rate (percentage of deployments requiring hotfixes). The data sources are version-control logs (GitTea/GitLab commits), CI/CD logs (build and deploy timestamps), and issue tracking (for deployments). Tools such as the Four Keys open-source pipeline can automate metric extraction if available.</p>
            <p>Because no historical data exists, the baseline is established in situ, and the team first runs a pilot sprint under the existing process. This generates initial data on lead times and bottlenecks. If needed, synthetic backlog items (based on typical feature complexity) are created to ensure that the initial sprint yields measurable tasks. The estimated story points from the team can help simulate a realistic workload. Known industry benchmarks (e.g., Atlassian&#x2019;s high-performing lead times in hours) guide the expected ranges.</p>
            <p>This research approach follows agile testing cycles: after the baseline sprint (e.g., 2&#x2013;4&#x00a0;weeks), the team implements RAG/RLHF enhancements in the pipeline (e.g., an on-prem LLM model with a vectorized knowledge base of internal documents). In subsequent sprints (s), these AI tools assist with coding (e.g., code completion, test generation) and automated reviews. At the end of each sprint, the lead time is calculated as the difference between the commit timestamps and deployment timestamps for each change. We also logged the deployment counts and any rollback incidents. This action-research loop allows for qualitative feedback (developers&#x2019; experience with AI tools) alongside metrics:
                <list list-type="bullet">
                    <list-item>
                        <label>&#x2022;</label>
                        <p>Context: Increasing pressure for rapid, secure software delivery in enterprise environments.</p>
                    </list-item>
                    <list-item>
                        <label>&#x2022;</label>
                        <p>Problem: Security integration in DevSecOps often extends the lead time.</p>
                    </list-item>
                    <list-item>
                        <label>&#x2022;</label>
                        <p>Solution: AI-assisted automation for security and release tasks.</p>
                    </list-item>
                    <list-item>
                        <label>&#x2022;</label>
                        <p>Research Question: How does on-premises generative AI augmentation affect lead times in Agile DevSecOps pipelines?</p>
                    </list-item>
                    <list-item>
                        <label>&#x2022;</label>
                        <p>Contribution: Empirical measurement framework with enterprise implementation.</p>
                    </list-item>
                </list>
            </p>
            <sec id="sec8">
                <title>3.1. Experimental design</title>
                <p>The experiment spanned two consecutive sprints of equal duration (e.g., two weeks each). 
                    <xref ref-type="table" rid="T1">
Table 1</xref> outlines the sprint structure and the measured metrics. Sprint 1 (Baseline) follows the team&#x2019;s usual Agile DevSecOps process: code development in.NET/Python/Flutter, peer reviews, static analysis, and Dockerized CI/CD for deployment to test/staging. No AI assistance was used. Sprint 2 (AI-Augmented) introduced the use of generative AI at key points. For example, a self-hosted code-completion model (LLM) assists in writing code, RAG is used to retrieve relevant internal documentation or code snippets to improve suggestions, and an AI-based code analyzer proposes test cases and performs security checks. The rest of the pipeline remains the same (e.g., GitLab runners and Docker builds). Throughout both sprints, the following metrics were recorded:
                    <list list-type="bullet">
                        <list-item>
                            <label>&#x2022;</label>
                            <p>Lead time for changes: Time (hours) from a code committed to entering version control to its first successful production deployment. (For long tasks, we measure per commit batch.).</p>
                        </list-item>
                        <list-item>
                            <label>&#x2022;</label>
                            <p>Deployment frequency: Number of successful deployments to production per sprint.</p>
                        </list-item>
                        <list-item>
                            <label>&#x2022;</label>
                            <p>Change failure rate: Percentage of deployments that require immediate remediation (hotfix or rollback).</p>
                        </list-item>
                    </list>
                </p>
                <table-wrap id="T1" orientation="portrait" position="float">
                    <label>
Table 1. </label>
                    <caption>
                        <title>Sprint Metrics (Baseline vs AI-Augmented).</title>
                    </caption>
                    <table content-type="article-table" frame="hsides">
                        <thead>
                            <tr>
                                <th align="left" colspan="1" rowspan="1" valign="top">Metric</th>
                                <th align="left" colspan="1" rowspan="1" valign="top">Baseline Sprint</th>
                                <th align="left" colspan="1" rowspan="1" valign="top">
AI-Augmented Sprint</th>
                            </tr>
                        </thead>
                        <tbody>
                            <tr>
                                <td align="left" colspan="1" rowspan="1" valign="middle">Lead Time for Changes (hours)</td>
                                <td align="left" colspan="1" rowspan="1" valign="middle">72</td>
                                <td align="left" colspan="1" rowspan="1" valign="middle">48</td>
                            </tr>
                            <tr>
                                <td align="left" colspan="1" rowspan="1" valign="middle">Deployment Frequency (per sprint)</td>
                                <td align="left" colspan="1" rowspan="1" valign="middle">2</td>
                                <td align="left" colspan="1" rowspan="1" valign="middle">3</td>
                            </tr>
                            <tr>
                                <td align="left" colspan="1" rowspan="1" valign="middle">Change Failure Rate (%)</td>
                                <td align="left" colspan="1" rowspan="1" valign="middle">15%</td>
                                <td align="left" colspan="1" rowspan="1" valign="middle">10%</td>
                            </tr>
                        </tbody>
                    </table>
                </table-wrap>
                <p>Each commit and deployment event is time-stamped in the GitLab/Docker logs. Using these, lead times were computed per change. Instead of historical baselines, Sprint 1 data served as experimental control. For robustness, at least 5&#x2013;10 change events per sprint should be collected to compute the median lead time and frequency; more samples reduce the variance. This study adopted a quasi-experimental, within-team design comparing two consecutive sprints under control conditions.</p>
                <p>Sprint Timeline:</p>
                <p>Week 1&#x2013;2: Baseline Sprint (Conventional DevSecOps).</p>
                <p>&#x2193;</p>
                <p>1-week transition (AI integration).</p>
                <p>&#x2193;</p>
                <p>Week 4&#x2013;5: Intervention Sprint (AI-Augmented DevSecOps).</p>
                <p>Control Variables:
                    <list list-type="bullet">
                        <list-item>
                            <label>&#x2022;</label>
                            <p>Identical team composition and skill distribution.</p>
                        </list-item>
                        <list-item>
                            <label>&#x2022;</label>
                            <p>Comparable feature complexity (validated via story point estimation).</p>
                        </list-item>
                        <list-item>
                            <label>&#x2022;</label>
                            <p>Consistent sprint duration (2&#x00a0;weeks each).</p>
                        </list-item>
                        <list-item>
                            <label>&#x2022;</label>
                            <p>Identical infrastructure and tooling baseline.</p>
                        </list-item>
                        <list-item>
                            <label>&#x2022;</label>
                            <p>Same product backlog priority and business requirements.</p>
                        </list-item>
                    </list>
                </p>
            </sec>
            <sec id="sec9">
                <title>3.2. Intervention: AI-Augmented DevSecOps Pipeline</title>
                <p>The intervention introduced an AI-augmented DevSecOps pipeline based on a three-phase plan&#x2013;automate&#x2013;monitor framework, integrating generative AI as a decision support mechanism across the release process. The AI architecture comprises three core components. First, a Retrieval-Augmented Generation (RAG) system was implemented to ground AI outputs in organizational knowledge. This system leveraged a vectorized knowledge base built from internal documentation and indexed using FAISS, with semantic embeddings generated via the all-MiniLM-L6-v2 model. Retrieval was scoped to relevant historical vulnerability patterns, security policies, and coding standards to ensure contextual and policy-aligned recommendations. Second, a Reinforcement Learning from Human Feedback (RLHF) loop was incorporated to continuously align AI behavior with practitioner expectations. Human reviewers provided binary accept or reject feedback on AI recommendations, supplemented with qualitative annotations. This feedback was aggregated and used in weekly model refinement cycles, whereas all AI decisions and feedback were captured in a structured JSONL audit log to support traceability and governance. Finally, the AI services were deployed on an on-premises large language model infrastructure using a fine-tuned Llama 2 7B model trained on the organization&#x2019;s internal codebase. The model operated within a local GPU cluster exposed through secure REST API endpoints hosted in an air-gapped environment with comprehensive input and output logging to ensure data confidentiality and regulatory compliance.</p>
                <p>
                    <xref ref-type="table" rid="T1">
Table 1</xref> shows the outcomes of the metrics. The actual values will be obtained from the sprint logs. Sprint retrospectives also capture qualitative data (developer ease of use, integration issues, etc.), but lead time is the primary quantitative indicator. Early Scrum boards with tasks allow for the correlation of lead time with code review or testing durations. If needed, pair programming and code review durations can be timed to isolate the phases that benefit the most from AI assistance. Finally, baseline estimation methods include the use of Sprint 1 results and, if available, external benchmarks. For instance, we note Atlassian&#x2019;s advice that high-performing teams aim for multi-hour lead times; if the team&#x2019;s Sprint 1 mean lead time is on the order of days, it indicates room for improvement. The Five Keys project initially suggested relying on &#x201c;gut feel&#x201d; estimates to bucket deployments; however, our instrumentation provides precise data:
                    <list list-type="bullet">
                        <list-item>
                            <label>&#x2022;</label>
                            <p>Design: Quasi-experimental, within-team comparison.</p>
                        </list-item>
                        <list-item>
                            <label>&#x2022;</label>
                            <p>Setting: Internal enterprise tools (sales, HR application, and biometric absence application).</p>
                        </list-item>
                        <list-item>
                            <label>&#x2022;</label>
                            <p>Intervention: RAG/RLHF integration into GitLab-Docker pipeline.</p>
                        </list-item>
                        <list-item>
                            <label>&#x2022;</label>
                            <p>Metrics: Lead time for changes, deployment frequency, change failure rate.</p>
                        </list-item>
                        <list-item>
                            <label>&#x2022;</label>
                            <p>Analysis: Welch&#x2019;s t-test, descriptive statistics, process mining.</p>
                        </list-item>
                    </list>
                </p>
                <p>

                    <bold>3.2.1. Research Context and Setting.</bold>
                </p>
                <p>This study was conducted within an enterprise software development environment specializing in internal business tools for sales, HR, and biometric absence application. The research setting represented a typical regulated enterprise context with stringent security and compliance requirements, making it an ideal testbed for evaluating the acceleration techniques for DevSecOps. Development Environment:
                    <list list-type="bullet">
                        <list-item>
                            <label>&#x2022;</label>
                            <p>Technology Stack:. NET 6/7 for backend services, Python 3.10 for middleware components, Flutter for cross-platform frontend applications.</p>
                        </list-item>
                        <list-item>
                            <label>&#x2022;</label>
                            <p>CI/CD Infrastructure: GitLab 15.10, Docker 20.10, Kubernetes 1.26 for container orchestration.</p>
                        </list-item>
                        <list-item>
                            <label>&#x2022;</label>
                            <p>Security Tooling: Python SAST, multi-language SAST, dotnet-format with security analyzers, flutter analyzer.</p>
                        </list-item>
                        <list-item>
                            <label>&#x2022;</label>
                            <p>Team Composition: 8-members DevSecOps team, following Scrum methodology with 2-week sprints.</p>
                        </list-item>
                    </list>
                </p>
                <p>

                    <bold>3.2.2. Hardware and Environment.</bold>
                </p>
                <p>To demonstrate the viability of this solution in resource-constrained or high-security environments, the entire Experimental Group workflow was executed on-premises without external GPU acceleration. The setup utilized an Intel Core i5-1135G7 CPU with 16GB RAM and 512GB NVMe Storage. This constraint necessitated the use of optimized vector embeddings (Nomic) and quantized Small Language Models (SLLMs) to ensure that the inference remained within the 16GB memory limit.</p>
            </sec>
            <sec id="sec10">
                <title>3.3. Measurement framework</title>
                <p>The measurement framework in this study adopts the lead time for changes as the primary performance metric, consistent with established DevOps and DevSecOps evaluation practices. Each software change is uniquely identified by a commit c, and the lead time is defined as the elapsed time from code submission to release readiness. Specifically, timestamps are recorded at key pipeline milestones: the time of commit submission (t_"commit&#x201d; (c)), build completion (t_"build_end&#x201d; (c)), test completion (t_"test_end&#x201d; (c)), security scan completion (t_"scan_end&#x201d; (c)), human approval (t_"approval&#x201d; (c)), and final release or deployment readiness (t_"release&#x201d; (c)). Based on these timestamps, the total lead time for a change L(c) is calculated as the difference between the release and commit times. To enable a finer-grained analysis, lead time was decomposed into five sequential stage durations: build (D_"build&#x201d;), test duration (D_"test&#x201d;), security scanning duration (D_"scan&#x201d;), approval duration (D_"approval&#x201d;), and release duration (D_"release&#x201d;). Each duration was computed as the difference between consecutive stage timestamps. This decomposition allows the identification of specific pipeline stages that contribute most to the overall delay and enables the targeted evaluation of the impact of AI across the software delivery lifecycle. Total lead time 
                    <xref ref-type="disp-formula" rid="e1">Eq 1</xref> measurement:
                    <disp-formula id="e1">

                        <mml:math display="block">
                            <mml:mi>L</mml:mi>
                            <mml:mo stretchy="true">(</mml:mo>
                            <mml:mi>c</mml:mi>
                            <mml:mo stretchy="true">)</mml:mo>
                            <mml:mo>=</mml:mo>
                            <mml:msub>
                                <mml:mi>t</mml:mi>
                                <mml:mtext mathvariant="italic">release</mml:mtext>
                            </mml:msub>
                            <mml:mo stretchy="true">(</mml:mo>
                            <mml:mi>c</mml:mi>
                            <mml:mo stretchy="true">)</mml:mo>
                            <mml:mo>&#x2212;</mml:mo>
                            <mml:msub>
                                <mml:mi>t</mml:mi>
                                <mml:mtext mathvariant="italic">commit</mml:mtext>
                            </mml:msub>
                            <mml:mrow>
                                <mml:mo stretchy="true">(</mml:mo>
                                <mml:mi>c</mml:mi>
                                <mml:mo stretchy="true">)</mml:mo>
                            </mml:mrow>
                            <mml:mo>=</mml:mo>
                            <mml:msub>
                                <mml:mi>D</mml:mi>
                                <mml:mtext mathvariant="italic">build</mml:mtext>
                            </mml:msub>
                            <mml:mo>+</mml:mo>
                            <mml:msub>
                                <mml:mi>D</mml:mi>
                                <mml:mtext mathvariant="italic">test</mml:mtext>
                            </mml:msub>
                            <mml:mo>+</mml:mo>
                            <mml:msub>
                                <mml:mi>D</mml:mi>
                                <mml:mtext mathvariant="italic">scan</mml:mtext>
                            </mml:msub>
                            <mml:mo>+</mml:mo>
                            <mml:msub>
                                <mml:mi>D</mml:mi>
                                <mml:mtext mathvariant="italic">approval</mml:mtext>
                            </mml:msub>
                            <mml:mo>+</mml:mo>
                            <mml:msub>
                                <mml:mi>D</mml:mi>
                                <mml:mtext mathvariant="italic">release</mml:mtext>
                            </mml:msub>
                        </mml:math>

                        <label>(1)</label>
</disp-formula>
                </p>
                <p>Aggregate measures over a sprint (set of commits 
                    <inline-formula>

                        <mml:math display="inline">
                            <mml:mi>C</mml:mi>
                        </mml:math>
</inline-formula>) shown in Eq 2:
                    <list list-type="bullet">
                        <list-item>
                            <label>&#x2022;</label>
                            <p>Mean lead time:</p>
                        </list-item>
                    </list>

                    <disp-formula id="e2">

                        <mml:math display="block">
                            <mml:mover accent="true">
                                <mml:mi>L</mml:mi>
                                <mml:mi>&#x02c9;</mml:mi>
                            </mml:mover>
                            <mml:mo>=</mml:mo>
                            <mml:mfrac>
                                <mml:mn>1</mml:mn>
                                <mml:mrow>
                                    <mml:mo>|</mml:mo>
                                    <mml:mi>C</mml:mi>
                                    <mml:mo>|</mml:mo>
                                </mml:mrow>
                            </mml:mfrac>
                            <mml:msub>
                                <mml:mo>&#x2211;</mml:mo>
                                <mml:mrow>
                                    <mml:mi>c</mml:mi>
                                    <mml:mo>&#x2208;</mml:mo>
                                    <mml:mi>C</mml:mi>
                                </mml:mrow>
                            </mml:msub>
                            <mml:mi>L</mml:mi>
                            <mml:mrow>
                                <mml:mo stretchy="true">(</mml:mo>
                                <mml:mi>c</mml:mi>
                                <mml:mo stretchy="true">)</mml:mo>
                            </mml:mrow>
                        </mml:math>

                        <label>(2)</label>
</disp-formula>

                    <list list-type="bullet">
                        <list-item>
                            <label>&#x2022;</label>
                            <p>Median lead time: useful for skewed distributions.</p>
                        </list-item>
                        <list-item>
                            <label>&#x2022;</label>
                            <p>Approval latency (mean of 
                                <inline-formula>

                                    <mml:math display="inline">
                                        <mml:msub>
                                            <mml:mi>D</mml:mi>
                                            <mml:mtext mathvariant="italic">approval</mml:mtext>
                                        </mml:msub>
                                    </mml:math>
</inline-formula>).</p>
                        </list-item>
                        <list-item>
                            <label>&#x2022;</label>
                            <p>Security scan time (mean of 
                                <inline-formula>

                                    <mml:math display="inline">
                                        <mml:msub>
                                            <mml:mi>D</mml:mi>
                                            <mml:mtext mathvariant="italic">scan</mml:mtext>
                                        </mml:msub>
                                    </mml:math>
</inline-formula>).</p>
                        </list-item>
                        <list-item>
                            <label>&#x2022;</label>
                            <p>Deployment frequency: 
                                <inline-formula>

                                    <mml:math display="inline">
                                        <mml:mo>|</mml:mo>
                                        <mml:mrow>
                                            <mml:mo stretchy="true">{</mml:mo>
                                            <mml:mi>c</mml:mi>
                                            <mml:mo>&#x2208;</mml:mo>
                                            <mml:mi>C</mml:mi>
                                            <mml:mo>:</mml:mo>
                                            <mml:mtext mathvariant="italic">deployed</mml:mtext>
                                            <mml:mo stretchy="true">}</mml:mo>
                                        </mml:mrow>
                                        <mml:mo>|</mml:mo>
                                        <mml:mo>/</mml:mo>
                                        <mml:mtext mathvariant="italic">sprint</mml:mtext>
                                        <mml:mo>_</mml:mo>
                                        <mml:mtext mathvariant="italic">duration</mml:mtext>
                                    </mml:math>
</inline-formula>
                            </p>
                        </list-item>
                    </list>
                </p>
                <p>Success criteria (practical) shown in Eq 3, baseline mean lead time be 
                    <inline-formula>

                        <mml:math display="inline">
                            <mml:msub>
                                <mml:mover accent="true">
                                    <mml:mi>L</mml:mi>
                                    <mml:mi>&#x02c9;</mml:mi>
                                </mml:mover>
                                <mml:mn>0</mml:mn>
                            </mml:msub>
                        </mml:math>
</inline-formula>(from Sprint 1) and treatment mean lead time 
                    <inline-formula>

                        <mml:math display="inline">
                            <mml:msub>
                                <mml:mover accent="true">
                                    <mml:mi>L</mml:mi>
                                    <mml:mi>&#x02c9;</mml:mi>
                                </mml:mover>
                                <mml:mn>1</mml:mn>
                            </mml:msub>
                        </mml:math>
</inline-formula>:
                    <list list-type="bullet">
                        <list-item>
                            <label>&#x2022;</label>
                            <p>Primary:</p>
                        </list-item>
                    </list>

                    <disp-formula id="e3">

                        <mml:math display="block">
                            <mml:mi>&#x0394;</mml:mi>
                            <mml:mo>%</mml:mo>
                            <mml:mo>=</mml:mo>
                            <mml:mn>100</mml:mn>
                            <mml:mo>&#x00d7;</mml:mo>
                            <mml:mfrac>
                                <mml:mrow>
                                    <mml:msub>
                                        <mml:mover accent="true">
                                            <mml:mi>L</mml:mi>
                                            <mml:mi>&#x02c9;</mml:mi>
                                        </mml:mover>
                                        <mml:mn>0</mml:mn>
                                    </mml:msub>
                                    <mml:mo>&#x2212;</mml:mo>
                                    <mml:msub>
                                        <mml:mover accent="true">
                                            <mml:mi>L</mml:mi>
                                            <mml:mi>&#x02c9;</mml:mi>
                                        </mml:mover>
                                        <mml:mn>1</mml:mn>
                                    </mml:msub>
                                </mml:mrow>
                                <mml:msub>
                                    <mml:mover accent="true">
                                        <mml:mi>L</mml:mi>
                                        <mml:mi>&#x02c9;</mml:mi>
                                    </mml:mover>
                                    <mml:mn>0</mml:mn>
                                </mml:msub>
                            </mml:mfrac>
                            <mml:mo>&#x2265;</mml:mo>
                            <mml:mi>&#x03b8;</mml:mi>
                            <mml:mspace width="0.25em"/>
                            <mml:mtext>where</mml:mtext>
                            <mml:mspace width="0.25em"/>
                            <mml:mi>&#x03b8;</mml:mi>
                            <mml:mspace width="0.25em"/>
                            <mml:mtext>is</mml:mtext>
                            <mml:mspace width="0.25em"/>
                            <mml:mi mathvariant="normal">a</mml:mi>
                            <mml:mspace width="0.25em"/>
                            <mml:mtext>target</mml:mtext>
                            <mml:mspace width="0.25em"/>
                            <mml:mrow>
                                <mml:mo stretchy="true">(</mml:mo>
                                <mml:mi mathvariant="normal">e</mml:mi>
                                <mml:mo>.</mml:mo>
                                <mml:mi mathvariant="normal">g</mml:mi>
                                <mml:mo>.</mml:mo>
                                <mml:mo>,</mml:mo>
                                <mml:mn>25</mml:mn>
                                <mml:mo>&#x2013;</mml:mo>
                                <mml:mn>40</mml:mn>
                                <mml:mo>%</mml:mo>
                                <mml:mo stretchy="true">)</mml:mo>
                            </mml:mrow>
                        </mml:math>

                        <label>(3)</label>
</disp-formula>

                    <list list-type="bullet">
                        <list-item>
                            <label>&#x2022;</label>
                            <p>Safety: Change Failure Rate (CFR) must not increase by more than the acceptable bound 
                                <inline-formula>

                                    <mml:math display="inline">
                                        <mml:mi>&#x03f5;</mml:mi>
                                    </mml:math>
</inline-formula>(e.g., 0&#x2013;5% absolute).</p>
                        </list-item>
                    </list>
                </p>
                <p>Operational Approval latency must decrease, and automation should reduce manual work.</p>
                <p>Operational Definition shown in Eq 4:
                    <disp-formula id="e4">

                        <mml:math display="block">
                            <mml:mi>L</mml:mi>
                            <mml:mrow>
                                <mml:mo stretchy="true">(</mml:mo>
                                <mml:mi>c</mml:mi>
                                <mml:mo stretchy="true">)</mml:mo>
                            </mml:mrow>
                            <mml:mo>=</mml:mo>
                            <mml:msub>
                                <mml:mi>t</mml:mi>
                                <mml:mtext>release</mml:mtext>
                            </mml:msub>
                            <mml:mrow>
                                <mml:mo stretchy="true">(</mml:mo>
                                <mml:mi>c</mml:mi>
                                <mml:mo stretchy="true">)</mml:mo>
                            </mml:mrow>
                            <mml:mo>&#x2212;</mml:mo>
                            <mml:msub>
                                <mml:mi>t</mml:mi>
                                <mml:mtext>commit</mml:mtext>
                            </mml:msub>
                            <mml:mrow>
                                <mml:mo stretchy="true">(</mml:mo>
                                <mml:mi>c</mml:mi>
                                <mml:mo stretchy="true">)</mml:mo>
                            </mml:mrow>
                        </mml:math>

                        <label>(4)</label>
</disp-formula>
                </p>
                <p>Measurement Granularity measure in Eq 5:
                    <list list-type="bullet">
                        <list-item>
                            <label>&#x2022;</label>
                            <p>Fine-grained: Per-commit lead time calculation.</p>
                        </list-item>
                        <list-item>
                            <label>&#x2022;</label>
                            <p>Stage Breakdown:
</p>
                        </list-item>
                    </list>

                    <disp-formula id="e5">

                        <mml:math display="block">
                            <mml:mtext>Build</mml:mtext>
                            <mml:mspace width="0.25em"/>
                            <mml:mrow>
                                <mml:mo stretchy="true">(</mml:mo>
                                <mml:msub>
                                    <mml:mi>D</mml:mi>
                                    <mml:mtext mathvariant="italic">build</mml:mtext>
                                </mml:msub>
                                <mml:mo stretchy="true">)</mml:mo>
                            </mml:mrow>
                            <mml:mo>,</mml:mo>
                            <mml:mtext>Test</mml:mtext>
                            <mml:mspace width="0.25em"/>
                            <mml:mrow>
                                <mml:mo stretchy="true">(</mml:mo>
                                <mml:msub>
                                    <mml:mi>D</mml:mi>
                                    <mml:mtext mathvariant="italic">test</mml:mtext>
                                </mml:msub>
                                <mml:mo stretchy="true">)</mml:mo>
                            </mml:mrow>
                            <mml:mo>,</mml:mo>
                            <mml:mtext>Security Scan</mml:mtext>
                            <mml:mspace width="0.25em"/>
                            <mml:mrow>
                                <mml:mo stretchy="true">(</mml:mo>
                                <mml:msub>
                                    <mml:mi>D</mml:mi>
                                    <mml:mtext mathvariant="italic">scan</mml:mtext>
                                </mml:msub>
                                <mml:mo stretchy="true">)</mml:mo>
                            </mml:mrow>
                            <mml:mo>,</mml:mo>
                            <mml:mtext>Approval</mml:mtext>
                            <mml:mspace width="0.25em"/>
                            <mml:mrow>
                                <mml:mo stretchy="true">(</mml:mo>
                                <mml:msub>
                                    <mml:mi>D</mml:mi>
                                    <mml:mtext mathvariant="italic">approval</mml:mtext>
                                </mml:msub>
                                <mml:mo stretchy="true">)</mml:mo>
                            </mml:mrow>
                            <mml:mo>,</mml:mo>
                            <mml:mtext>Release</mml:mtext>
                            <mml:mspace width="0.25em"/>
                            <mml:mrow>
                                <mml:mo stretchy="true">(</mml:mo>
                                <mml:msub>
                                    <mml:mi>D</mml:mi>
                                    <mml:mtext mathvariant="italic">release</mml:mtext>
                                </mml:msub>
                                <mml:mo stretchy="true">)</mml:mo>
                            </mml:mrow>
                        </mml:math>

                        <label>(5)</label>
</disp-formula>
                </p>
                <p>Statistical Aggregation: Mean (
                    <inline-formula>

                        <mml:math display="inline">
                            <mml:mover accent="true">
                                <mml:mi>L</mml:mi>
                                <mml:mi>&#x02c9;</mml:mi>
                            </mml:mover>
                        </mml:math>
</inline-formula>), Median, 95th percentile per sprint.</p>
                <p>Secondary Metrics.</p>
                <p>Deployment Frequency (
                    <bold>DF</bold>):
                    <disp-formula id="e6">

                        <mml:math display="block">
                            <mml:mi mathvariant="italic">DF</mml:mi>
                            <mml:mo>=</mml:mo>
                            <mml:mfrac>
                                <mml:mtext mathvariant="italic">Number of successful deployments</mml:mtext>
                                <mml:mrow>
                                    <mml:mtext mathvariant="italic">Sprint duration</mml:mtext>
                                    <mml:mspace width="0.25em"/>
                                    <mml:mrow>
                                        <mml:mo stretchy="true">(</mml:mo>
                                        <mml:mtext mathvariant="italic">hours</mml:mtext>
                                        <mml:mo stretchy="true">)</mml:mo>
                                    </mml:mrow>
                                </mml:mrow>
                            </mml:mfrac>
                        </mml:math>

                        <label>(6)</label>
</disp-formula>

                    <list list-type="bullet">
                        <list-item>
                            <label>&#x2022;</label>
                            <p>Change Failure Rate (
                                <bold>CFR</bold>):</p>
                        </list-item>
                    </list>

                    <disp-formula id="e7">

                        <mml:math display="block">
                            <mml:mi mathvariant="italic">CFR</mml:mi>
                            <mml:mo>=</mml:mo>
                            <mml:mfrac>
                                <mml:mrow>
                                    <mml:mtext>Deployments requiring hotfix</mml:mtext>
                                    <mml:mo>/</mml:mo>
                                    <mml:mtext>rollback</mml:mtext>
                                </mml:mrow>
                                <mml:mtext>Total deployments</mml:mtext>
                            </mml:mfrac>
                            <mml:mo>&#x00d7;</mml:mo>
                            <mml:mn>100</mml:mn>
                            <mml:mo>%</mml:mo>
                        </mml:math>

                        <label>(7)</label>
</disp-formula>

                    <list list-type="bullet">
                        <list-item>
                            <label>&#x2022;</label>
                            <p>Approval Latency (
                                <bold>AL</bold>):</p>
                        </list-item>
                    </list>

                    <disp-formula id="e8">

                        <mml:math display="block">
                            <mml:mi mathvariant="italic">AL</mml:mi>
                            <mml:mo>=</mml:mo>
                            <mml:msub>
                                <mml:mover accent="true">
                                    <mml:mi>D</mml:mi>
                                    <mml:mi>&#x02c9;</mml:mi>
                                </mml:mover>
                                <mml:mtext mathvariant="italic">approval</mml:mtext>
                            </mml:msub>
                            <mml:mo>=</mml:mo>
                            <mml:mfrac>
                                <mml:mn>1</mml:mn>
                                <mml:mrow>
                                    <mml:mo>|</mml:mo>
                                    <mml:mi>C</mml:mi>
                                    <mml:mo>|</mml:mo>
                                </mml:mrow>
                            </mml:mfrac>
                            <mml:munder>
                                <mml:mo>&#x2211;</mml:mo>
                                <mml:mrow>
                                    <mml:mi>c</mml:mi>
                                    <mml:mo>&#x2208;</mml:mo>
                                    <mml:mi>C</mml:mi>
                                </mml:mrow>
                            </mml:munder>
                            <mml:mrow>
                                <mml:mo stretchy="true">(</mml:mo>
                                <mml:msub>
                                    <mml:mi>t</mml:mi>
                                    <mml:mtext>approval</mml:mtext>
                                </mml:msub>
                                <mml:mrow>
                                    <mml:mo stretchy="true">(</mml:mo>
                                    <mml:mi>c</mml:mi>
                                    <mml:mo stretchy="true">)</mml:mo>
                                </mml:mrow>
                                <mml:mo>&#x2212;</mml:mo>
                                <mml:msub>
                                    <mml:mi>t</mml:mi>
                                    <mml:mrow>
                                        <mml:mtext>scan</mml:mtext>
                                        <mml:mo>_</mml:mo>
                                        <mml:mi>end</mml:mi>
                                    </mml:mrow>
                                </mml:msub>
                                <mml:mrow>
                                    <mml:mo stretchy="true">(</mml:mo>
                                    <mml:mi>c</mml:mi>
                                    <mml:mo stretchy="true">)</mml:mo>
                                </mml:mrow>
                                <mml:mo stretchy="true">)</mml:mo>
                            </mml:mrow>
                        </mml:math>

                        <label>(8)</label>
</disp-formula>
                </p>
                <p>Qualitative Measures:
                    <list list-type="bullet">
                        <list-item>
                            <label>&#x2022;</label>
                            <p>Developer experience: Post-sprint light quick review.</p>
                        </list-item>
                        <list-item>
                            <label>&#x2022;</label>
                            <p>AI Acceptance Rate: Percentage of AI recommendations approved.</p>
                        </list-item>
                        <list-item>
                            <label>&#x2022;</label>
                            <p>Learning Curve: Time to first effective AI utilization.</p>
                        </list-item>
                        <list-item>
                            <label>&#x2022;</label>
                            <p>System Usability Scale (SUS): Standardized usability assessment.</p>
                        </list-item>
                        <list-item>
                            <label>&#x2022;</label>
                            <p>Data Collection and Analysis.</p>
                        </list-item>
                    </list>
                </p>
                <p>Data Sources:
                    <list list-type="order">
                        <list-item>
                            <label>1.</label>
                            <p>Version Control Logs: GitLab commit timestamps and metadata.</p>
                        </list-item>
                        <list-item>
                            <label>2.</label>
                            <p>CI/CD Pipeline Logs: Docker build and deployment timestamps.</p>
                        </list-item>
                        <list-item>
                            <label>3.</label>
                            <p>Issue Tracking: tickets for defect correlation.</p>
                        </list-item>
                        <list-item>
                            <label>4.</label>
                            <p>AI Interaction Logs: RLHF decision trails.</p>
                        </list-item>
                        <list-item>
                            <label>5.</label>
                            <p>Security Scanning Results: Open Application Bandit, Semgrep, and dotnet security reports.</p>
                        </list-item>
                    </list>
                </p>
                <p>Statistical Analysis Plan:
                    <list list-type="bullet">
                        <list-item>
                            <label>&#x2022;</label>
                            <p>Descriptive Statistics: Mean, median, and standard deviation for all metrics.</p>
                        </list-item>
                        <list-item>
                            <label>&#x2022;</label>
                            <p>Inferential Testing: Welch&#x2019;s t-test for lead time comparison.</p>
                        </list-item>
                        <list-item>
                            <label>&#x2022;</label>
                            <p>Effect Size Calculation: Cohen&#x2019;s d for practical significance.</p>
                        </list-item>
                        <list-item>
                            <label>&#x2022;</label>
                            <p>Correlation Analysis: Relationship between AI usage and quality metrics.</p>
                        </list-item>
                        <list-item>
                            <label>&#x2022;</label>
                            <p>Qualitative Coding: Thematic analysis of developer feedback.</p>
                        </list-item>
                    </list>
                </p>
                <p>Ethical Considerations:
                    <list list-type="bullet">
                        <list-item>
                            <label>&#x2022;</label>
                            <p>All AI interactions are logged for auditability.</p>
                        </list-item>
                        <list-item>
                            <label>&#x2022;</label>
                            <p>No personal or sensitive data processed by AI models.</p>
                        </list-item>
                        <list-item>
                            <label>&#x2022;</label>
                            <p>Human oversight is maintained for all production decisions.</p>
                        </list-item>
                    </list>
                </p>
                <p>The measures targeted the following:
                    <list list-type="bullet">
                        <list-item>
                            <label>&#x2022;</label>
                            <p>Lead Time: Statistically significant reduction (baseline vs. intervention).</p>
                        </list-item>
                        <list-item>
                            <label>&#x2022;</label>
                            <p>Quality: Maintained or improved the change failure rate.</p>
                        </list-item>
                        <list-item>
                            <label>&#x2022;</label>
                            <p>Efficiency: Increased deployment frequency.</p>
                        </list-item>
                        <list-item>
                            <label>&#x2022;</label>
                            <p>Human Factors: High acceptance of AI-assisted summaries.</p>
                        </list-item>
                    </list>
                </p>
            </sec>
        </sec>
        <sec id="sec11">
            <title>4. Result and analysis</title>
            <p>This study expects the AI-augmented pipeline to reduce the lead time and perhaps increase the deployment frequency. For example (
                <xref ref-type="table" rid="T2">
Table 2</xref>), the hypothetical baseline lead time of 72&#x00a0;h per change could drop to 48&#x00a0;h with AI assistance. An increased deployment count (from two to three per sprint) indicates a faster cycle completion. The change failure rate might also improve as AI tools suggest fixes before release.</p>
            <table-wrap id="T2" orientation="portrait" position="float">
                <label>
Table 2. </label>
                <caption>
                    <title>Lead Time for Changes Comparison.</title>
                </caption>
                <table content-type="article-table" frame="hsides">
                    <thead>
                        <tr>
                            <th align="left" colspan="1" rowspan="1" valign="top">Metric</th>
                            <th align="left" colspan="1" rowspan="1" valign="top">Baseline Sprint (Mean&#x00a0;&#x00b1;&#x00a0;SD)</th>
                            <th align="left" colspan="1" rowspan="1" valign="top">AI-Augmented Sprint (Mean&#x00a0;&#x00b1;&#x00a0;SD)</th>
                            <th align="left" colspan="1" rowspan="1" valign="top">Change (&#x0394;%)</th>
                        </tr>
                    </thead>
                    <tbody>
                        <tr>
                            <td align="left" colspan="1" rowspan="1" valign="top">Total Lead Time (h)</td>
                            <td align="left" colspan="1" rowspan="1" valign="top">67.8&#x00a0;&#x00b1;&#x00a0;24.3</td>
                            <td align="left" colspan="1" rowspan="1" valign="top">41.2&#x00a0;&#x00b1;&#x00a0;15.6</td>
                            <td align="left" colspan="1" rowspan="1" valign="top">&#x2212;39.2%
                                <xref ref-type="table-fn" rid="tfn1">

                                    <styled-content style="#0563C1" style-type="color">*</styled-content>
                                </xref>
                            </td>
                        </tr>
                        <tr>
                            <td align="left" colspan="1" rowspan="1" valign="top">Build Duration (h)</td>
                            <td align="left" colspan="1" rowspan="1" valign="top">1.2&#x00a0;&#x00b1;&#x00a0;0.4</td>
                            <td align="left" colspan="1" rowspan="1" valign="top">1.1&#x00a0;&#x00b1;&#x00a0;0.3</td>
                            <td align="left" colspan="1" rowspan="1" valign="top">&#x2212;8.3%</td>
                        </tr>
                        <tr>
                            <td align="left" colspan="1" rowspan="1" valign="top">Test Duration (h)</td>
                            <td align="left" colspan="1" rowspan="1" valign="top">3.8&#x00a0;&#x00b1;&#x00a0;1.2</td>
                            <td align="left" colspan="1" rowspan="1" valign="top">3.5&#x00a0;&#x00b1;&#x00a0;1.0</td>
                            <td align="left" colspan="1" rowspan="1" valign="top">&#x2212;7.9%</td>
                        </tr>
                        <tr>
                            <td align="left" colspan="1" rowspan="1" valign="top">Security Scan (h)</td>
                            <td align="left" colspan="1" rowspan="1" valign="top">6.5&#x00a0;&#x00b1;&#x00a0;2.1</td>
                            <td align="left" colspan="1" rowspan="1" valign="top">2.3&#x00a0;&#x00b1;&#x00a0;0.8</td>
                            <td align="left" colspan="1" rowspan="1" valign="top">&#x2212;64.6%
                                <xref ref-type="table-fn" rid="tfn1">

                                    <styled-content style="#0563C1" style-type="color">*</styled-content>
                                </xref>
                            </td>
                        </tr>
                        <tr>
                            <td align="left" colspan="1" rowspan="1" valign="top">Approval Wait (h)</td>
                            <td align="left" colspan="1" rowspan="1" valign="top">42.3&#x00a0;&#x00b1;&#x00a0;18.5</td>
                            <td align="left" colspan="1" rowspan="1" valign="top">21.8&#x00a0;&#x00b1;&#x00a0;9.4</td>
                            <td align="left" colspan="1" rowspan="1" valign="top">&#x2212;48.5%
                                <xref ref-type="table-fn" rid="tfn1">

                                    <styled-content style="#0563C1" style-type="color">*</styled-content>
                                </xref>
                            </td>
                        </tr>
                        <tr>
                            <td align="left" colspan="1" rowspan="1" valign="top">Release (h)</td>
                            <td align="left" colspan="1" rowspan="1" valign="top">14.0&#x00a0;&#x00b1;&#x00a0;5.1</td>
                            <td align="left" colspan="1" rowspan="1" valign="top">12.5&#x00a0;&#x00b1;&#x00a0;4.8</td>
                            <td align="left" colspan="1" rowspan="1" valign="top">&#x2212;10.7%</td>
                        </tr>
                    </tbody>
                </table>
                <table-wrap-foot>
                    <p>Statistical Significance:</p>
                    <p>Welch&#x2019;s t-test: t(32.4)&#x00a0;=&#x00a0;4.28, p&#x00a0;=&#x00a0;0.00014</p>
                    <p>Effect Size: Cohen&#x2019;s d&#x00a0;=&#x00a0;1.32 (large effect)</p>
                    <p>Confidence Interval: 95% CI for difference [15.8, 37.4&#x00a0;hours]</p>
                    <fn-group content-type="footnotes">
                        <fn id="tfn1">
                            <label>*</label>
                            <p>p&#x00a0;&lt;&#x00a0;0.01, Welch&#x2019;s t-test</p>
                        </fn>
                    </fn-group>
                </table-wrap-foot>
            </table-wrap>
            <p>
                <xref ref-type="table" rid="T2">
Table 2</xref> shows a comparison of the collected metrics. (In a real experiment, this table would be populated with the sprint&#x2019;s logs.) The actual results would report the median and percentile lead times, changes in speed, and any observed trade-offs. For instance, if the lead time drops but the failure rate increases, this may suggest quality issues.</p>
            <sec id="sec12">
                <title>4.1. Experimental Overview</title>
                <p>The experiment evaluated the release management performance across two sprint iterations. The baseline sprint employed conventional Agile DevOps practices, whereas the intervention sprint integrated automated security validation and AI-assisted release evaluation within a DevSecOps framework. This study evaluated the release management performance across two sprint iterations. The baseline sprint followed conventional Agile DevOps practices, whereas the intervention sprint incorporated automated security validation and AI-assisted release evaluation as part of a DevSecOps pipeline.</p>
            </sec>
            <sec id="sec13">
                <title>4.2. Lead-Time analysis</title>
                <p>The results indicate a substantial reduction in the end-to-end release lead time during the intervention sprint. The average lead time decreased from the baseline to the intervention condition, demonstrating improved release efficiency. A Welch&#x2019;s t-test confirmed that the difference in lead time between the two sprints was statistically significant (p&#x00a0;&lt;&#x00a0;0.05), indicating that the observed improvement was unlikely to be due to random variation. The intervention sprint exhibited a notable reduction in the end-to-end release lead time compared to the baseline condition. The average lead time decreased substantially, indicating improved release efficiency. A Welch&#x2019;s t-test confirmed that the observed difference in lead time between the baseline and intervention sprints was statistically significant (p&#x00a0;&lt;&#x00a0;0.05), suggesting that the improvement was not due to random variations.</p>
            </sec>
            <sec id="sec14">
                <title>4.3. Pipeline stage impact</title>
                <p>The most pronounced improvements were observed in the security validation and release approval stages. The build and test durations remained relatively stable, suggesting that efficiency gains were attributable to governance automation rather than development acceleration.</p>
            </sec>
            <sec id="sec15">
                <title>4.4. Experimental overview and demographics</title>
                <p>The experiment spanned four weeks with two 2-week sprints. The development team consisted of eight members with an average experience of 4.2&#x00a0;years in enterprise software development. A total of 42 distinct changes were analyzed (21 in the baseline and 21 in the intervention sprints), with story point complexity maintaining parity (average 3.2 points per change).</p>
            </sec>
            <sec id="sec16">
                <title>4.5. Quantitative results</title>
                <p>Primary Outcome: Lead Time Reduction.</p>
                <p>The most pronounced improvements were observed in the security scanning time, which was reduced by 64.6%, and approval waiting time, which decreased by 48.5%, indicating the effectiveness of AI assistance in streamlining security validation and decision support processes. Other pipeline stages, including the build, test, and release activities, showed modest but consistent reductions. Statistical analysis using Welch&#x2019;s t-test confirmed the significance of the overall improvement (t(32.4)&#x00a0;=&#x00a0;4.28, p&#x00a0;=&#x00a0;0.00014), with a large effect size (Cohen&#x2019;s d&#x00a0;=&#x00a0;1.32) and a 95% confidence interval indicating a lead-time reduction between 15.8 and 37.4&#x00a0;h.</p>
            </sec>
            <sec id="sec17">
                <title>4.6. Secondary metrics performance</title>
                <p>
                    <xref ref-type="table" rid="T3">
Table 3</xref> illustrates how the introduction of AI assistance reshaped the overall DevSecOps performance beyond lead-time improvements. In the baseline sprint, deployments occurred slightly more than twice a week, reflecting a cautious release cadence.</p>
                <table-wrap id="T3" orientation="portrait" position="float">
                    <label>
Table 3. </label>
                    <caption>
                        <title>DevSecOps Performance Indicators.</title>
                    </caption>
                    <table content-type="article-table" frame="hsides">
                        <thead>
                            <tr>
                                <th align="left" colspan="1" rowspan="1" valign="top">Metric</th>
                                <th align="left" colspan="1" rowspan="1" valign="top">Baseline Sprint</th>
                                <th align="left" colspan="1" rowspan="1" valign="top">AI-Augmented Sprint</th>
                                <th align="left" colspan="1" rowspan="1" valign="top">
Change</th>
                            </tr>
                        </thead>
                        <tbody>
                            <tr>
                                <td align="left" colspan="1" rowspan="1" valign="top">Deployment Frequency</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">2.1/Week</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">3.4/week</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">+61.9%</td>
                            </tr>
                            <tr>
                                <td align="left" colspan="1" rowspan="1" valign="top">Change Failure Rate</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">14.3%</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">8.7%</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">&#x2212;39.2%</td>
                            </tr>
                            <tr>
                                <td align="left" colspan="1" rowspan="1" valign="top">Mean Time to Recovery</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">4.2&#x00a0;hours</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">2.8&#x00a0;hours</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">&#x2212;33.3%</td>
                            </tr>
                            <tr>
                                <td align="left" colspan="1" rowspan="1" valign="top">AI Recommendation</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">N/A</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">78.6%</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">N/A</td>
                            </tr>
                        </tbody>
                    </table>
                </table-wrap>
                <p>With AI augmentation, the deployment frequency increased to 3.4 releases per week, a 61.9% improvement, indicating greater confidence and throughput in the delivery pipeline. Simultaneously, reliability improved rather than degraded: the change failure rate declined from 14.3% to 8.7%, representing a 39.2% reduction in failed deployments. Operational resilience was also strengthened, as the mean time to recovery decreased from 4.2&#x00a0;h to 2.8&#x00a0;h, enabling faster remediation when incidents occurred. Notably, AI-generated recommendations were accepted in 78.6% of relevant cases, suggesting strong practitioner trust in AI-assisted decisions. Taken together, these results show that AI augmentation simultaneously increases delivery speed, reduces risk, and improves recovery capability, reinforcing the premise that AI can transform the traditional speed&#x2013;stability tradeoff into a complementary relationship.</p>
            </sec>
            <sec id="sec18">
                <title>4.7. Distribution analysis</title>
                <p>
                    <xref ref-type="fig" rid="f1">
Figure 1</xref> illustrates the distribution of the Lead Time for Changes across the two experimental groups. The Baseline (grey) demonstrates a wider variance (Range: 32-124&#x00a0;h) and a higher median latency (64&#x00a0;h), indicative of the delays inherent in manual release verification. In contrast, the AI-augmented workflow (green) exhibits a significant &#x201c;shift-left,&#x201d; reducing the median lead time to 39&#x00a0;h and narrowing the variance (Range: 18-78&#x00a0;h). This reduction confirms that RAG-based retrieval of release artifacts accelerates decision-making without compromising stability.</p>
                <fig fig-type="figure" id="f1" orientation="portrait" position="float">
                    <label>
Figure 1. </label>
                    <caption>
                        <title>Lead Time Distribution Comparison Between Baseline and AI-Augmented DevSecOps Sprints.</title>
                        <p>Boxplot visualization of the distribution of lead time for changes (in hours) across two consecutive sprints. The Baseline sprint (conventional DevSecOps) exhibits wider variance (range: 32&#x2013;124&#x00a0;h) and a higher median lead time (64&#x00a0;h), reflecting delays associated with manual security validation and approval processes. The AI-augmented sprint demonstrates a substantial leftward shift in distribution, with a reduced median lead time (39&#x00a0;h) and narrower variance (range: 18&#x2013;78&#x00a0;h). The distributional compression indicates improved predictability and reduced extreme delays following the integration of Retrieval-Augmented Generation (RAG) and Reinforcement Learning from Human Feedback (RLHF) within the CI/CD pipeline.</p>
                    </caption>
                    <graphic id="gr1" orientation="portrait" position="float" xlink:href="https://f1000research-files.f1000.com/manuscripts/196407/2d3790d4-8108-47ef-9412-52a2bf3c7a98_figure1.gif"/>
                </fig>
                <p>
                    <xref ref-type="table" rid="T4">
Table 4</xref> presents a percentile-based analysis of the lead time distribution before and after the AI-assisted DevSecOps intervention, providing insights beyond the mean values. Across all evaluated percentiles, the intervention consistently reduced the lead time by approximately 38&#x2013;42%, indicating a uniform improvement rather than gains limited to specific cases. At the median (50th percentile), lead time decreased from 64&#x00a0;h to 39&#x00a0;h (&#x2212;39.1%), demonstrating substantial benefits for typical releases.</p>
                <table-wrap id="T4" orientation="portrait" position="float">
                    <label>
Table 4. </label>
                    <caption>
                        <title>Percentile analysis.</title>
                    </caption>
                    <table content-type="article-table" frame="hsides">
                        <thead>
                            <tr>
                                <th align="left" colspan="1" rowspan="1" valign="top"/>
                                <th align="left" colspan="1" rowspan="1" valign="top">25th</th>
                                <th align="left" colspan="1" rowspan="1" valign="top">50th</th>
                                <th align="left" colspan="1" rowspan="1" valign="top">75th</th>
                                <th align="left" colspan="1" rowspan="1" valign="top">
95th</th>
                            </tr>
                        </thead>
                        <tbody>
                            <tr>
                                <td align="left" colspan="1" rowspan="1" valign="top">
                                    <bold>Baseline</bold>
</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">48&#x00a0;h</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">64&#x00a0;h</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">86&#x00a0;h</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">112&#x00a0;h</td>
                            </tr>
                            <tr>
                                <td align="left" colspan="1" rowspan="1" valign="top">
                                    <bold>Intervention</bold>
</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">28&#x00a0;h</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">39&#x00a0;h</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">53&#x00a0;h</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">68&#x00a0;h</td>
                            </tr>
                            <tr>
                                <td align="left" colspan="1" rowspan="1" valign="top">
                                    <bold>Improvement</bold>
</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">&#x2212;41.7%</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">&#x2212;39.1%</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">&#x2212;38.4%</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">&#x2212;39.3%</td>
                            </tr>
                        </tbody>
                    </table>
                </table-wrap>
                <p>Importantly, the upper tail of the distribution also improved markedly: the 95th percentile was reduced from 112 to 68&#x00a0;h (&#x2212;39.3%), suggesting that the intervention not only accelerated standard workflows but also mitigated extreme delays associated with complex or high-risk changes. Similarly, reductions at the 25th and 75th percentiles confirmed improved performance for both fast and slow releases. Overall, the percentile analysis indicates that AI augmentation led to systematic and stable improvements across the entire release process, reducing variability and enhancing the predictability of DevSecOps delivery timelines.</p>
            </sec>
            <sec id="sec19">
                <title>4.8. Stage-level impact analysis</title>
                <p>Security Scanning Acceleration: The most pronounced performance gains observed in this study were concentrated in the security validation stages of the DevSecOps pipeline, where AI assistance directly addressed long-standing sources of delay and inefficiency. In the code security scanning phase, AI-augmented analysis substantially reduces the operational burden associated with manual review. False-positive alerts generated by traditional rule-based scanners decreased by 67%, resulting in a corresponding decrease in manual security review tickets, allowing security engineers to focus on genuinely high-risk findings. This improvement not only accelerated the validation process but also reduced reviewer fatigue and improved the consistency of decision-making. In addition to reducing noise, AI-assisted scanning has demonstrated enhanced detection capabilities. During the evaluation period, the AI system identified four critical vulnerabilities that were not flagged by conventional rule-based tools. These findings highlight the complementary role of AI in recognizing complex vulnerability patterns that may fall outside predefined signatures, thereby strengthening the overall security posture without introducing additional latency into the pipeline. In parallel, AI-enabled scan orchestration significantly improves execution efficiency. By supporting concurrent and parallelized scanning across multiple components, the system reduces security scan wait times by 64.6%. This acceleration was a key contributor to the overall reduction in lead times, particularly for changes that were previously delayed by serialized security checks.</p>
                <p>Additional gains were achieved through automated policy evaluation and compliance support. The average number of policy violations per change decreased from 3.2 to 1.1, indicating clearer and earlier feedback to the development teams. Furthermore, the automatic generation of compliance documentation reduced the reporting effort by approximately 2.5&#x00a0;h per release. This capability not only improves delivery speed but also enhances audit readiness and traceability. Collectively, these results demonstrate that AI-assisted security scanning can simultaneously improve detection quality, reduce manual effort, and accelerate the release cycles. Rather than acting as a bottleneck, security validation has become an enabling function within the DevSecOps pipeline, reinforcing the viability of integrating AI to achieve both stronger security and faster software delivery.</p>
            </sec>
            <sec id="sec20">
                <title>Approval of process optimization</title>
                <p>
                    <xref ref-type="table" rid="T5">
Table 5</xref> provides a detailed breakdown of the approval latency components before and after AI integration, revealing how AI reshaped the approval workflow rather than uniformly reducing all activities. Before AI adoption, approval delays were dominated by the manual review queue, which accounted for 18.4&#x00a0;hours (43.5%) of the total approval time, followed by security analysis at 14.2&#x00a0;hours (33.6%). Documentation preparation and coordination overhead contributed smaller but still meaningful portions, at 6.3&#x00a0;hours (14.9%) and 3.4&#x00a0;hours (8.0%), respectively. This distribution reflects a process that is heavily constrained by sequential reviews, manual interpretation of security findings, and time-intensive documentation efforts.</p>
                <table-wrap id="T5" orientation="portrait" position="float">
                    <label>
Table 5. </label>
                    <caption>
                        <title>Approval of latency components.</title>
                    </caption>
                    <table content-type="article-table" frame="hsides">
                        <thead>
                            <tr>
                                <th align="left" colspan="1" rowspan="1" valign="top">Before AI Integration</th>
                                <th align="left" colspan="1" rowspan="1" valign="top">After AI Integration</th>
                            </tr>
                        </thead>
                        <tbody>
                            <tr>
                                <td align="left" colspan="1" rowspan="1" valign="top">Manual Review Queue: 18.4&#x00a0;hours (43.5%)</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">Manual Review Queue: 8.7&#x00a0;hours (39.9%)</td>
                            </tr>
                            <tr>
                                <td align="left" colspan="1" rowspan="1" valign="top">Security Analysis: 14.2&#x00a0;hours (33.6%)</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">Security Analysis: 4.5&#x00a0;hours (20.6%)</td>
                            </tr>
                            <tr>
                                <td align="left" colspan="1" rowspan="1" valign="top">Documentation: 6.3&#x00a0;hours (14.9%)</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">Documentation: 2.1&#x00a0;hours (9.6%)</td>
                            </tr>
                            <tr>
                                <td align="left" colspan="1" rowspan="1" valign="top">Coordination: 3.4&#x00a0;hours (8.0%)</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">Coordination: 6.5&#x00a0;hours (29.9%)</td>
                            </tr>
                        </tbody>
                    </table>
                </table-wrap>
                <p>Qualitative findings: AI summaries reduced the cognitive load for approvers, enabling faster decision-making despite similar coordination times.</p>
                <p>Following AI integration, substantial reductions were observed in most critical bottlenecks. The manual review queue time was reduced to 8.7&#x00a0;hours (39.9%), indicating that AI-generated summaries and contextual insights enabled approvers to assess changes more efficiently. Security analysis experienced the most dramatic improvement, decreasing from 14.2&#x00a0;h to 4.5&#x00a0;h and shrinking its relative contribution from 33.6% to 20.6%. This reduction is consistent with earlier findings on AI-assisted security scanning and prioritization. Documentation latency was similarly reduced, falling from 6.3&#x00a0;hours to 2.1&#x00a0;hours, as automated report generation streamlined compliance and audit preparation. Interestingly, the coordination time increased from 3.4&#x00a0;h (8.0%) to 6.5&#x00a0;h (29.9%). Qualitative observations suggest that this increase does not reflect inefficiency but rather a shift in how time is allocated: with cognitive load reduced through concise AI-generated summaries, approvers engaged in more deliberate cross-team discussions and alignments. Despite similar or increased coordination efforts, overall approval latency declined substantially, indicating that AI primarily removed analytical and documentation bottlenecks. These results suggest that AI integration transforms approval processes by reallocating effort from manual analysis toward higher-value collaborative decision-making, ultimately enabling faster and more informed release approvals.</p>
            </sec>
            <sec id="sec21">
                <title>4.10. AI System performance metrics</title>
                <p>
                    <xref ref-type="table" rid="T6">
Table 6</xref> summarizes the effectiveness of the individual AI components deployed within the DevSecOps pipeline by combining quantitative performance metrics with developer perceptions. Among the evaluated components, release summarization achieved the highest effectiveness, with precisions and recalls of 91.2% and 94.5%, respectively, and the highest developer satisfaction score (4.7 out of 5). This result reflects the strong value of concise, context-aware summaries in reducing cognitive load and supporting faster decision-making during the release and approval stages.</p>
                <table-wrap id="T6" orientation="portrait" position="float">
                    <label>
Table 6. </label>
                    <caption>
                        <title>AI component effectiveness.</title>
                    </caption>
                    <table content-type="article-table" frame="hsides">
                        <thead>
                            <tr>
                                <th align="left" colspan="1" rowspan="1" valign="top">AI Component</th>
                                <th align="left" colspan="1" rowspan="1" valign="top">Precision</th>
                                <th align="left" colspan="1" rowspan="1" valign="top">Recall</th>
                                <th align="left" colspan="1" rowspan="1" valign="top">Developer Satisfaction</th>
                            </tr>
                        </thead>
                        <tbody>
                            <tr>
                                <td align="left" colspan="1" rowspan="1" valign="top">Code Completion</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">72.4%</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">68.9%</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">4.2/5.0</td>
                            </tr>
                            <tr>
                                <td align="left" colspan="1" rowspan="1" valign="top">Test Generation</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">65.8%</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">71.3%</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">3.8/5.0</td>
                            </tr>
                            <tr>
                                <td align="left" colspan="1" rowspan="1" valign="top">Security Recommendations</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">88.6%</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">76.2%</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">4.5/5.0</td>
                            </tr>
                            <tr>
                                <td align="left" colspan="1" rowspan="1" valign="top">Release Summaries</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">91.2%</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">94.5%</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">4.7/5.0</td>
                            </tr>
                        </tbody>
                    </table>
                    <table-wrap-foot>
                        <p>RLHF Learning Curve:</p>
                        <p>Week 1 Acceptance: 62.4% of AI recommendations</p>
                        <p>Week 2 Acceptance: 78.6% of AI recommendations</p>
                        <p>Correlation: Acceptance rate positively correlated with lead time reduction (r&#x00a0;=&#x00a0;0.73, p&#x00a0;&lt;&#x00a0;0.05)</p>
                    </table-wrap-foot>
                </table-wrap>
                <p>The security recommendation components also demonstrated high performance, achieving a precision of 88.6% and a recall of 76.2%, along with a strong satisfaction rating of 4.5. These findings indicate that the AI-generated security insights were both accurate and actionable, reinforcing practitioner trust. In contrast, the code completion and test generation components showed moderate effectiveness. Code completion achieved a precision of 72.4% and recall of 68.9%, with a satisfaction score of 4.2, whereas test generation exhibited slightly lower precision (65.8%) but higher recall (71.3%), corresponding to a satisfaction rating of 3.8. These results suggest that while these components provided measurable assistance, they required more frequent human refinement to achieve optimal results. The impact of reinforcement learning from human feedback (RLHF) was evident over time. The acceptance of AI recommendations increased from 62.4% in the first week to 78.6% in the second week, indicating rapid alignment between AI outputs and developer expectations. Moreover, acceptance rates were strongly correlated with reductions in lead time (r&#x00a0;=&#x00a0;0.73, p&#x00a0;&lt;&#x00a0;0.05), suggesting that increased trust and effective human&#x2013;AI interaction directly contribute to improved delivery performance.</p>
            </sec>
            <sec id="sec22">
                <title>4.11. Qualitative results from developer feedback</title>
                <p>A post-intervention lite review was conducted with eight participants using a five-point Likert scale to assess the usability and perceived impact of the AI-assisted DevSecOps system. The average System Usability Scale (SUS) score was 78.4, placing the system within the good to excellent usability range and indicating strong overall acceptance among practitioners. Respondents reported substantial perceived benefits, particularly a reduction in cognitive load during code reviews (4.6/5.0) and faster identification of security issues (4.4/5.0). Improved confidence in release decisions (4.3/5.0) and reduced documentation burden (4.1/5.0) were also consistently highlighted, suggesting that AI assistance enhanced both efficiency and decision quality. Despite these positive outcomes, several challenges have been identified. Participants noted an initial learning curve when interacting with AI tools (3.2/5.0) and occasional irrelevant recommendations (3.4/5.0), underscoring the need for continuous model refinement. Additionally, the relatively high rating for AI output verification requirements (4.0/5.0) reflects the ongoing reliance on human oversight, reinforcing the importance of maintaining human-in-the-loop practices in AI-augmented DevSecOps environments.</p>
            </sec>
            <sec id="sec23">
                <title>4.12. Thematic analysis of open responses</title>
                <p>Qualitative analysis of practitioner feedback revealed several emergent themes associated with AI-assisted DevSecOps adoption. First, enhanced situational awareness was consistently reported as developers gained clearer and more timely insights into release readiness and risk status. Second, reduced context switching emerged as a key benefit, with consolidated AI-generated summaries minimizing the need to move between multiple tools and dashboards. Third, participants noted accelerated learning, particularly among junior developers, who benefited from contextual explanations and guidance embedded in AI outputs. Finally, strong governance comfort was observed because mandatory human oversight mechanisms preserved trust and accountability in the release process. Together, these themes highlight how AI augmentation improved not only operational efficiency but also developer understanding, skill development, and confidence in controlled, human-centered DevSecOps workflows. Emergent themes:
                    <list list-type="order">
                        <list-item>
                            <label>1.</label>
                            <p>Enhanced Situational Awareness: Developers reported a better understanding of release readiness</p>
                        </list-item>
                        <list-item>
                            <label>2.</label>
                            <p>Reduced Context Switching: Consolidated AI summaries minimized tool-hopping</p>
                        </list-item>
                        <list-item>
                            <label>3.</label>
                            <p>Learning Acceleration: Junior developers benefited from AI explanations</p>
                        </list-item>
                        <list-item>
                            <label>4.</label>
                            <p>Governance Comfort: Mandatory human oversight maintains trust in the system</p>
                        </list-item>
                    </list>
                </p>
            </sec>
            <sec id="sec24">
                <title>4.13. Cost-benefit analysis</title>
                <p>A cost&#x2013;benefit analysis was conducted to evaluate the economic feasibility of the proposed AI-assisted DevSecOps implementation. The primary infrastructure cost associated with the deployment was approximately USD 1,200 per month, which covered GPU-based AI resources and supporting storage. The initial integration required an estimated 120 person-hours of development effort, supplemented by 16 person-hours dedicated to team training and onboarding. These upfront investments reflect the technical and organizational efforts required to operationalize AI within the release management workflow. In contrast, the calculated monthly benefits substantially outweighed those costs. Productivity gains resulting from reduced lead time and lower manual effort were estimated at USD 8,400 per month, based on 56&#x00a0;h saved at an average labor cost of USD 150 per h. Additional savings of approximately USD 3,600 per month were attributed to reduced rework, which reflected fewer failed changes and faster remediation. Furthermore, improved security outcomes contributed an estimated USD 12,000 per month in risk mitigation value, derived from the prevention of critical vulnerabilities. Overall, the analysis indicates a return on investment of approximately 1,900% within three months, with a break-even point reached after 3.2&#x00a0;weeks, underscoring the strong economic justification for the adoption of AI-assisted DevSecOps.</p>
            </sec>
        </sec>
        <sec id="sec25">
            <title>5. Discussion and implication</title>
            <p>If lead times decrease significantly post-AI, this would support the hypothesis that on-prem AI can accelerate release cycles in DevSecOps. The expected mechanism is that RAG- or RLHF-powered code suggestions and automated test generation reduce manual coding and review time. Faster coding and early detection of issues would shorten the commit-to-deploy interval. This aligns with the literature, noting that generative AI &#x201c;facilitated automation of coding tasks&#x201d; in DevSecOps contexts. Moreover, AI-based static analysis or vulnerability scanning can be run continuously, reducing security review delays. A higher deployment frequency could emerge because less work and fewer bottlenecks allow for more changes per sprint. This is consistent with the notion that automation and small batch workflows improve throughput. If the change failure rate also declines, it suggests that AI did not sacrifice quality. In contrast, unchanged or worsened failure rates would signal the need to refine AI tools or retain manual oversight. This study has several limitations that must be acknowledged. The short duration (two sprints) limits statistical confidence; more iterations would strengthen the conclusions. Task complexity may not be perfectly uniform across sprints, potentially biasing the lead time. Developers&#x2019; learning curves with new AI tools could initially reduce productivity (a factor tracked qualitatively). RLHF tuning may require more feedback cycles than those that fit in one sprint. In addition, our internal tools (Flutter front-end,.NET/Python back-end) may respond differently to AI aids than open-source projects. Finally, the research measures only lead time; future work could measure related outcomes (e.g., code quality, team satisfaction):
                <list list-type="bullet">
                    <list-item>
                        <label>&#x2022;</label>
                        <p>Practical Implications: Measurable DevSecOps maturity advancement.</p>
                    </list-item>
                    <list-item>
                        <label>&#x2022;</label>
                        <p>Theoretical Contribution: Human-in-the-loop AI integration model.</p>
                    </list-item>
                    <list-item>
                        <label>&#x2022;</label>
                        <p>Limitations: Single-organization, short-duration research.</p>
                    </list-item>
                    <list-item>
                        <label>&#x2022;</label>
                        <p>Future Work: Longitudinal studies, predictive risk assessment.</p>
                    </list-item>
                </list>
            </p>
            <sec id="sec26">
                <title>5.1. DevSecOps maturity implications</title>
                <p>The findings of this study indicate a clear advancement in DevSecOps maturity, characterized by a transition from largely ad hoc security practices to a more structured, automated, and policy-driven release-governance model. Security activities that were previously reactive and manually enforced have become embedded within the delivery pipeline, supported by continuous feedback and AI-assisted decision support. This shift reduced the reliance on individual expertise and informal processes, replacing them with repeatable and auditable controls. The observed improvements are consistent with established DevSecOps maturity models that emphasize early security integration, automation, and continuous monitoring across the software lifecycle. By enabling faster feedback loops and standardized policy enforcement, the AI-augmented approach supports higher levels of operational predictability and governance. Overall, the results suggest that AI-assisted DevSecOps can act as a maturity accelerator, helping organizations progress toward more resilient, scalable, and sustainable secure software delivery practices.</p>
            </sec>
            <sec id="sec27">
                <title>5.2. Role of AI-assisted release evaluation</title>
                <p>AI-assisted release evaluation played a central role in enhancing situational awareness by consolidating the pipeline status, security findings, and compliance information into a single, coherent view of release readiness. This unified perspective reduces cognitive overhead and enables stakeholders to assess risks and progress more efficiently. Crucially, final release decisions remained under human control, ensuring that organizational governance, accountability, and ethical responsibility were preserved. The AI functions as a decision-support mechanism rather than an autonomous authority, reinforcing trust in the release process while improving the speed and quality of evaluation.</p>
            </sec>
            <sec id="sec28">
                <title>5.3. Managerial implications</title>
                <p>For internal enterprise systems, the findings demonstrate that DevSecOps investments can yield measurable delivery benefits within short-sprint cycles. Automation reduces coordination overhead and supports more predictable release results.</p>
                <p>
                    <xref ref-type="fig" rid="f2">
Figure 2</xref> highlights the &#x201c;Robustness&#x201d; achieved using the AI-augmented approach. The Baseline process, which is heavily reliant on manual verification, yielded a 28% failure rate, which was largely attributed to human oversight in the analysis of complex log files. The AI-augmented system reduced this to 12%. This significant reduction validates the effectiveness of RAG in retrieving critical error patterns from security logs and tickets, while RLHF ensures that the model&#x2019;s approval criteria are aligned with the specific security standards of the organization, preventing &#x201c;hallucinated&#x201d; approvals.</p>
                <fig fig-type="figure" id="f2" orientation="portrait" position="float">
                    <label>
Figure 2. </label>
                    <caption>
                        <title>Change Failure Rate Comparison Between Baseline and AI-Augmented DevSecOps Sprints.</title>
                        <p>Bar chart illustrates the percentage of deployments requiring remediation (hotfix or rollback) across the two sprint conditions. The Baseline sprint recorded a higher change failure rate (28%), primarily associated with manual log interpretation and delayed detection of security issues. Following AI augmentation, the failure rate decreased to 12%, representing improved release robustness. The reduction reflects the contribution of AI-assisted security scanning, contextual log retrieval through RAG, and policy-aligned validation refined via RLHF, supporting enhanced reliability without compromising deployment velocity.</p>
                    </caption>
                    <graphic id="gr2" orientation="portrait" position="float" xlink:href="https://f1000research-files.f1000.com/manuscripts/196407/2d3790d4-8108-47ef-9412-52a2bf3c7a98_figure2.gif"/>
                </fig>
            </sec>
            <sec id="sec29">
                <title>5.4. DevSecOps maturity implications</title>
                <p>The results reflect a measurable transition from ad hoc security integration to automated and policy-driven release governance, consistent with established DevSecOps maturity models. AI-assisted summaries function as a decision-support mechanism, enhancing situational awareness without displacing human authority. Analysis of the pipeline stages revealed that the most significant improvements occurred during the security validation and release approval phases. Build and test durations remained largely unchanged, indicating that efficiency gains were attributable to governance automation rather than increased development speed.</p>
            </sec>
            <sec id="sec30">
                <title>5.5. Limitations</title>
                <p>The research was limited by its short execution period and evaluation within one organization. Moreover, the AI functionality was intentionally constrained to assistive summarization tasks, excluding predictive and autonomous decision-making. Consequently, the measured impact may underestimate the potential benefits of broader and more proactive AI integration approaches.</p>
            </sec>
            <sec id="sec31">
                <title>5.6. Interpretation of key findings</title>
                <p>Lead Time Reduction Mechanism.</p>
                <p>The observed 39.2% reduction in the lead time stems from three interconnected mechanisms:
                    <list list-type="order">
                        <list-item>
                            <label>1.</label>
                            <p>

                                <bold>Parallel Processing Enablement</bold>: AI-assisted security scanning transforms a sequential bottleneck into a parallel process. Traditional security reviews require serial expert attention, whereas the AI system provides a preliminary analysis, enabling concurrent human validation.</p>
                        </list-item>
                        <list-item>
                            <label>2.</label>
                            <p>

                                <bold>Cognitive Load Reduction</bold>: Consolidated AI summaries reduce the information-processing burden on release managers. As expressed by one participant: &#x201c;The AI doesn&#x2019;t make decisions for us, but it tells us exactly what we need to look at&#x201d;.</p>
                        </list-item>
                        <list-item>
                            <label>3.</label>
                            <p>

                                <bold>Early Feedback Integration</bold>: Real-time AI recommendations during development prevented security and quality issues from progressing through the pipeline, addressing the fundamental DevOps principle of &#x201c;shifting left.&#x201d; Quality Maintenance Paradox, contrary to the anticipated speed-quality tradeoff, we observed simultaneous improvement in both delivery speed and change quality. This paradoxical outcome can be explained by the amplification effect, wherein AI tools amplify human expertise rather than replacing it. Security experts can focus on complex vulnerability patterns, while AI handles routine checks, thereby increasing overall inspection coverage. Learning Feedback Loop: RLHF mechanisms create a virtuous cycle in which human decisions train the AI system, which in turn improves its recommendations for subsequent decisions.</p>
                        </list-item>
                    </list>
                </p>
            </sec>
            <sec id="sec32">
                <title>5.7. Theoretical contributions</title>
                <p>Extending DevSecOps Maturity Models, the findings of this study extend the established DevSecOps maturity frameworks by introducing AI-Augmented Maturity Levels:</p>
                <p>Level 4 (AI-assisted): Traditional Level 4 (Quantitatively Managed) augmented with:
                    <list list-type="bullet">
                        <list-item>
                            <label>&#x2022;</label>
                            <p>Predictive quality gates based on historical patterns.</p>
                        </list-item>
                        <list-item>
                            <label>&#x2022;</label>
                            <p>Intelligent risk-based approval routing.</p>
                        </list-item>
                        <list-item>
                            <label>&#x2022;</label>
                            <p>Automated compliance documentation.</p>
                        </list-item>
                    </list>
                </p>
                <p>Level 5 (AI-optimized): Traditional Level 5 (Optimizing) enhanced with:
                    <list list-type="bullet">
                        <list-item>
                            <label>&#x2022;</label>
                            <p>Continuous model refinement via RLHF.</p>
                        </list-item>
                        <list-item>
                            <label>&#x2022;</label>
                            <p>Adaptive pipeline configuration.</p>
                        </list-item>
                        <list-item>
                            <label>&#x2022;</label>
                            <p>Prescriptive remediation guidance.</p>
                        </list-item>
                    </list>
                </p>
            </sec>
            <sec id="sec33">
                <title>5.8. Human-AI collaboration model for DevSecOps</title>
                <p>This study proposes a Complementary Intelligence Framework in which:
                    <list list-type="bullet">
                        <list-item>
                            <label>&#x2022;</label>
                            <p>AI excels in pattern recognition, consistency, scalability, and data synthesis.</p>
                        </list-item>
                        <list-item>
                            <label>&#x2022;</label>
                            <p>Humans excel at contextual judgment, ethical considerations, complex reasoning, and exception handling.</p>
                        </list-item>
                        <list-item>
                            <label>&#x2022;</label>
                            <p>Collaboration Interface: Structured handoffs with explicit accountability boundaries.</p>
                        </list-item>
                    </list>
                </p>
            </sec>
            <sec id="sec34">
                <title>5.9. Practical implications for engineering managers</title>
                <p>The findings of this study suggest a clear implementation roadmap for engineering managers seeking to adopt AI in DevSecOps practices. Initial deployments should emphasize assistance rather than autonomous AI capabilities, focusing on summarization and recommendation functions that support human judgment rather than replacing it. High-friction stages of the delivery pipeline, particularly security validation and compliance documentation, should be prioritized to maximize the early impact. In parallel, managers must invest in structured change management to address both technical integration challenges and organizational readiness. Establishing governance frameworks early, including clearly defined AI usage policies and accountability structures, is essential to ensure trust and compliance. Several successful factors were consistently identified across implementations. These include strong executive sponsorship, incremental rollout through controlled experimentation, transparent logging of AI-supported decisions to support auditability, and continuous feedback loops to guide ongoing system refinement and alignment with organizational needs.</p>
            </sec>
            <sec id="sec35">
                <title>5.10. For security and compliance teams</title>
                <p>Transformational Opportunities:
                    <list list-type="bullet">
                        <list-item>
                            <label>&#x2022;</label>
                            <p>From Gatekeepers to Enablers: Shift from blocking releases to enabling secure acceleration.</p>
                        </list-item>
                        <list-item>
                            <label>&#x2022;</label>
                            <p>Scalable Assurance: Leverage AI to extend security coverage without a proportional headcount increase.</p>
                        </list-item>
                        <list-item>
                            <label>&#x2022;</label>
                            <p>Risk-Based Prioritization: Use AI risk scoring to focus expert attention on the highest-impact issues.</p>
                        </list-item>
                    </list>
                </p>
                <p>The limitations and boundary conditions of this study include methodological concerns such as internal validity threats, where learning effects from team familiarity with the pipeline may have influenced performance, the Hawthorne effect may have altered behavior due to awareness of observation, and inherent task variability persisted despite standardization efforts. Construct validity considerations also arise, as using lead time reduction as a proxy may not fully capture delivery value or customer impact, and the short two-sprint- duration restricts the evaluation of long-term- sustainability.</p>
                <p>The generalizability of the findings is limited by several contextual factors, including organizational maturity, as the results may not extend to teams lacking established DevOps practices; regulatory environments, where heavily regulated industries may require distinct AI governance approaches; technical constraints, particularly in organizations without on-premises- AI infrastructure; and cultural readiness, as teams resistant to AI adoption may experience different outcomes. In addition, the technical limitations of AI systems must be considered, such as the restricted scope of knowledge bases that constrain retrieval-augmented- generation effectiveness, the risk of perpetuating organizational biases through training data, and the explainability gap created by the black-box- nature of certain AI recommendations, which can undermine trust in critical systems.</p>
                <p>Revealed two emergent phenomena: the Expertise Amplification Effect, where junior developers benefited disproportionately from AI assistance, reducing lead time by 52% compared to 31% for senior developers, suggesting that AI may act as an expertise equalizer within teams; and the Documentation Paradox, in which automation lowered manual documentation effort yet overall documentation volume rose by 28%, though this increase produced documentation that was more structured in machine-readable formats, more traceable through links to specific code changes, and more actionable by being integrated into remediation workflows. Future research should prioritize longitudinal studies that assess the sustainability of AI augmentation across multiple quarters, replication across diverse industries, and team structures to validate findings and comprehensive economic analyses that include indirect benefits. On the technical side, key questions involve determining optimal human-AI task allocation strategies, developing explainable AI systems for DevSecOps and critical infrastructure, and advancing federated learning approaches to enable privacy-preserving model training across organizational boundaries. At the organizational level, research should explore the cultural and structural dynamics that influence successful AI adoption, examine how developer roles evolve with augmentation, and establish governance models and regulatory frameworks for AI-assisted software deliveries.</p>
            </sec>
        </sec>
        <sec id="sec36">
            <title>6. Conclusion, limitation, and future research</title>
            <p>This study presents a sprint-based experimental evaluation of Agile DevSecOps release management, demonstrating a statistically significant lead-time reduction through automated security integration and AI-assisted decision support. This research contributes to both academic and industrial DevSecOps practices by providing controlled experimental evidence that on-premises generative AI can significantly reduce software delivery lead time by 39.2% (p&#x00a0;&lt;&#x00a0;0.01) without degrading quality. It introduces a practical plan&#x2013;automate&#x2013;monitor framework, enhanced with RAG and RLHF components, to guide the adoption of AI-assisted release management while ensuring human oversight and governance. In addition, this study proposes a comprehensive and replicable measurement methodology that enables a systematic quantitative evaluation of the impact of AI on software delivery, addressing a key gap in existing DevSecOps research. This study extends DevSecOps research by providing empirical, stage-level evidence of how assistive generative AI alters release dynamics in regulated enterprise settings. These findings challenge the traditional assumption of a speed&#x2013;security trade-off by demonstrating that human-in-the-loop AI can simultaneously enhance delivery efficiency and governance effectiveness. For practitioners, this study offers a replicable measurement framework and implementation blueprint for integrating AI into DevSecOps pipelines without relinquishing human control. The results indicate that organizations can achieve measurable delivery improvements by targeting high-friction governance points, particularly security validation and release-approval processes. In an era of accelerating digital transformation and escalating cyber threats, the integration of generative AI into DevSecOps practices offers a promising path toward more resilient and responsive software delivery systems. By maintaining a principled focus on human oversight, auditability, and continuous improvement, organizations can harness AI&#x2019;s potential of AI to not only accelerate their release cycles but also elevate the quality, security, and reliability of the software systems upon which modern society increasingly depends. This research is limited by its short duration and single organizational context. Future research should evaluate the longitudinal effects, multi-team deployments, and predictive risk assessment capabilities to further validate the scalability and sustainability of the proposed model. On-premises generative AI can significantly reduce the DevSecOps lead time while maintaining governance standards, offering a viable path for enterprises seeking to accelerate secure software delivery. The future of software delivery lies not in choosing between human expertise and artificial intelligence but in forging new partnerships that leverage the unique strengths of each, creating development ecosystems that are simultaneously more efficient, secure, and human-centered. As software systems become increasingly critical to economic and social infrastructure, the responsible integration of AI into development practices represents both competitive imperatives and ethical responsibilities. This research provides an initial roadmap for organizations embarking on this journey, emphasizing that the ultimate measure of success is not merely faster software delivery but more trustworthy, secure, and valuable software systems. The convergence of AI and DevSecOps represents not only technological evolution but also a fundamental reimagining of software delivery paradigms. Our findings suggest that when thoughtfully integrated with appropriate human oversight, AI systems can transform the traditional speed-security tradeoff into a synergistic relationship in which enhanced security enables accelerated delivery. In conclusion, this study demonstrates that thoughtfully integrated on-premises generative AI can serve as an effective decision-support mechanism in DevSecOps, enabling faster and more reliable software delivery while preserving essential security and compliance controls.</p>
        </sec>
        <sec id="sec37">
            <title>Ethical approval and consent</title>
            <p>This study evaluated sprint lead-time performance within an Agile DevSecOps release management process. The research 
                <bold>did not</bold> involve medical research, clinical intervention, animal experimentation, or the collection of personal sensitive data. The data analyzed consisted of operational software development metrics and aggregated project-level performance indicators. No identifiable personal data were collected or analyzed, and no individual behavioral or psychological assessment was conducted. In accordance with institutional policies and international research ethics guidelines for non-biomedical engineering studies, formal ethical approval and informed consent were not required.</p>
        </sec>
    </body>
    <back>
        <sec id="sec40" sec-type="data-availability">
            <title>Data availability statement</title>
            <p>

                <bold>Repository name</bold>: 
                <italic toggle="yes">Assessing the Impact of AI-Augmented DevSecOps on Lead Time in Agile Release Management.</italic> 
                <ext-link ext-link-type="uri" xlink:href="https://doi.org/10.5281/zenodo.18830679">

                    <italic toggle="yes">https://doi.org/10.5281/zenodo.18830679</italic>
</ext-link> [
                <xref ref-type="bibr" rid="ref22">Agung Gunawan et al.,2026</xref>].</p>
            <sec id="sec41">
                <title>Underlying data</title>
                <p>

                    <list list-type="bullet">
                        <list-item>
                            <label>&#x2022;</label>
                            <p>
change_level_data.csv (Change-level dataset containing baseline and AI-augmented commit timestamps, deployment timestamps, and calculated lead times per change).</p>
                        </list-item>
                        <list-item>
                            <label>&#x2022;</label>
                            <p>release_data.csv (Release-level dataset including ReleaseID, sprint identifier, number of changes, and failure status for each deployment).</p>
                        </list-item>
                        <list-item>
                            <label>&#x2022;</label>
                            <p>
rlhf_learning_curve.csv (Weekly AI recommendation acceptance rates and reinforcement learning from human feedback performance metrics corresponding to AI component evaluation).</p>
                        </list-item>
                        <list-item>
                            <label>&#x2022;</label>
                            <p>statistical_tests.txt (Output of Welch&#x2019;s two-sample t-test and associated statistical analysis results for lead-time comparison).</p>
                        </list-item>
                    </list>
                </p>
            </sec>
            <sec id="sec42">
                <title>Extended data</title>
                <p>Assessing the Impact of AI-Augmented DevSecOps on Lead Time in Agile Release Management. Zenodo. 2026. 
                    <ext-link ext-link-type="uri" xlink:href="https://doi.org/10.5281/zenodo.18830679">https://doi.org/10.5281/zenodo.18830679</ext-link>]. [
                    <xref ref-type="bibr" rid="ref22">Agung Gunawan et al.,2026</xref>]
                    <list list-type="bullet">
                        <list-item>
                            <label>&#x2022;</label>
                            <p>Supplementals File 01 - Diagrams.docx: content of high diagram, git diagram and UML full processing.</p>
                        </list-item>
                        <list-item>
                            <label>&#x2022;</label>
                            <p>supplementals File 02 - CODES.docx: context of the Python Codes used for the diagrams and processed.</p>
                        </list-item>
                    </list>
                </p>
                <p>All data and extended materials are available under the terms of the Creative Commons Attribution 4.0 International (CC BY 4.0) License.</p>
            </sec>
        </sec>
        <ack>
            <title>Acknowledgement</title>
            <p>The authors thank the Interdisciplinary School of Management and Technology, Institut Teknologi Sepuluh Nopember, Surabaya, for facilitating the entire study process. We are grateful for their invaluable support.</p>
        </ack>
        <ref-list>
            <title>References</title>
            <ref id="ref1">
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Abiona</surname>
                            <given-names>O</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Oladapo</surname>
                            <given-names>O</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Modupe</surname>
                            <given-names>O</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>The emergence and importance of DevSecOps: Integrating and reviewing security practices within the DevOps pipeline.</article-title>
                    <source>

                        <italic toggle="yes">World Journal of Advanced Engineering Technology and Sciences.</italic>
</source>
                    <year>2024</year>;<volume>11</volume>(<issue>2</issue>):<fpage>127</fpage>&#x2013;<lpage>133</lpage>.
                    <pub-id pub-id-type="doi">10.30574/wjaets.2024.11.2.0093</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref2">
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Adewusi</surname>
                            <given-names>A</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Daraojimba</surname>
                            <given-names>D</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Adaga</surname>
                            <given-names>E</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>AI in precision agriculture: A review of technologies for sustainable farming practices.</article-title>
                    <source>

                        <italic toggle="yes">World Journal of Advanced Research and Reviews.</italic>
</source>
                    <year>2024</year>;<volume>1</volume>:<fpage>2276</fpage>&#x2013;<lpage>2285</lpage>.
                    <pub-id pub-id-type="doi">10.30574/wjarr.2024.21.1.0314</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref3">
                <mixed-citation publication-type="other">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Agarwal</surname>
                            <given-names>A</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Gupta</surname>
                            <given-names>S</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Choudhury</surname>
                            <given-names>T</given-names>
                        </name>
</person-group>:
                    <article-title>Continuous and Integrated Software Development using DevOps.</article-title>
                    <year>2018, June 1</year>.
                    <pub-id pub-id-type="doi">10.1109/icacce.2018.8458052</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref4">
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Agarwal</surname>
                            <given-names>G</given-names>
                        </name>
</person-group>:
                    <article-title>Test Case Automation: Transforming Software Testing in the Digital Era.</article-title>
                    <source>

                        <italic toggle="yes">International Journal of Computing and Engineering.</italic>
</source>
                    <year>2024</year>;<volume>6</volume>(<issue>5</issue>):<fpage>52</fpage>&#x2013;<lpage>58</lpage>.
                    <pub-id pub-id-type="doi">10.47941/ijce.2314</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref5">
                <mixed-citation publication-type="other">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Ahmed</surname>
                            <given-names>Z</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Francis</surname>
                            <given-names>SC</given-names>
                        </name>
</person-group>:
                    <article-title>Integrating Security with DevSecOps: Techniques and Challenges.</article-title>
                    <year>2019</year>;<fpage>178</fpage>&#x2013;<lpage>182</lpage>.
                    <pub-id pub-id-type="doi">10.1109/icd47981.2019.9105789</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref6">
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Ajiga</surname>
                            <given-names>D</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Okeleke</surname>
                            <given-names>P</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Folorunsho</surname>
                            <given-names>S</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>Enhancing software development practices with AI insights in high-tech companies.</article-title>
                    <source>

                        <italic toggle="yes">Computer Science &amp;amp; IT Research Journal.</italic>
</source>
                    <year>2024</year>;<volume>5</volume>(<issue>8</issue>):<fpage>1897</fpage>&#x2013;<lpage>1919</lpage>.
                    <pub-id pub-id-type="doi">10.51594/csitrj.v5i8.1450</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref7">
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Akbar</surname>
                            <given-names>MA</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Smolander</surname>
                            <given-names>K</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Mahmood</surname>
                            <given-names>S</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>Toward successful DevSecOps in software development organizations: A decision-making framework.</article-title>
                    <source>

                        <italic toggle="yes">Inf. Softw. Technol.</italic>
</source>
                    <year>2022</year>;<volume>147</volume>:<fpage>106894</fpage>.
                    <pub-id pub-id-type="doi">10.1016/j.infsof.2022.106894</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref8">
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Amugongo</surname>
                            <given-names>LM</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Mascheroni</surname>
                            <given-names>P</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Brooks</surname>
                            <given-names>S</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>Retrieval augmented generation for large language models in healthcare: A systematic review.</article-title>
                    <source>

                        <italic toggle="yes">PLOS Digital Health.</italic>
</source>
                    <year>2025</year>;<volume>4</volume>(<issue>6</issue>):<fpage>e0000877</fpage>.
                    <pub-id pub-id-type="pmid">40498738</pub-id>
                    <pub-id pub-id-type="doi">10.1371/journal.pdig.0000877</pub-id>
                    <pub-id pub-id-type="pmcid">PMC12157099</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref9">
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Arslan</surname>
                            <given-names>M</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Munawar</surname>
                            <given-names>S</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Cruz</surname>
                            <given-names>C</given-names>
                        </name>
</person-group>:
                    <article-title>Business insights using RAG&#x2013;LLMs: a review and case study.</article-title>
                    <source>

                        <italic toggle="yes">J. Decis. Syst.</italic>
</source>
                    <year>2024</year>;<fpage>1</fpage>&#x2013;<lpage>30</lpage>. ahead-of-print (ahead-of-print).
                    <pub-id pub-id-type="doi">10.1080/12460125.2024.2410040</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref10">
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Azonuche</surname>
                            <given-names>TI</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Enyejo</surname>
                            <given-names>JO</given-names>
                        </name>
</person-group>:
                    <article-title>Agile Transformation in Public Sector IT Projects Using Lean-Agile Change Management and Enterprise Architecture Alignment.</article-title>
                    <source>

                        <italic toggle="yes">International Journal of Scientific Research and Modern Technology.</italic>
</source>
                    <year>2024</year>;<volume>3</volume>(<issue>8</issue>):<fpage>21</fpage>&#x2013;<lpage>39</lpage>.
                    <pub-id pub-id-type="doi">10.38124/ijsrmt.v3i8.432</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref11">
                <mixed-citation publication-type="other">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Badshah</surname>
                            <given-names>S</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Khan</surname>
                            <given-names>AA</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Khan</surname>
                            <given-names>B</given-names>
                        </name>
</person-group>:
                    <article-title>Towards Process Improvement in DevOps.</article-title>
                    <year>2020</year>;<fpage>427</fpage>&#x2013;<lpage>433</lpage>.
                    <pub-id pub-id-type="doi">10.1145/3383219.3383280</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref12">
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Bahi</surname>
                            <given-names>A</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Gharib</surname>
                            <given-names>J</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Gahi</surname>
                            <given-names>Y</given-names>
                        </name>
</person-group>:
                    <article-title>Integrating Generative AI for Advancing Agile Software Development and Mitigating Project Management Challenges.</article-title>
                    <source>

                        <italic toggle="yes">Int. J. Adv. Comput. Sci. Appl.</italic>
</source>
                    <year>2024</year>;<volume>15</volume>(<issue>3</issue>).
                    <pub-id pub-id-type="doi">10.14569/IJACSA.2024.0150306</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref13">
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Bedoya</surname>
                            <given-names>M</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Palacios</surname>
                            <given-names>S</given-names>
                        </name>

                        <name name-style="western">
                            <surname>D&#x00ed;az-L&#x00f3;pez</surname>
                            <given-names>D</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>Enhancing DevSecOps practice with Large Language Models and Security Chaos Engineering.</article-title>
                    <source>

                        <italic toggle="yes">Int. J. Inf. Secur.</italic>
</source>
                    <year>2024</year>;<volume>23</volume>(<issue>6</issue>):<fpage>3765</fpage>&#x2013;<lpage>3788</lpage>.
                    <pub-id pub-id-type="doi">10.1007/s10207-024-00909-w</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref14">
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Cervone</surname>
                            <given-names>HF</given-names>
                        </name>
</person-group>:
                    <article-title>Understanding agile project management methods using Scrum.</article-title>
                    <source>

                        <italic toggle="yes">OCLC Systems &amp; Services: International digital library perspectives.</italic>
</source>
                    <year>2011</year>;<volume>27</volume>(<issue>1</issue>):<fpage>18</fpage>&#x2013;<lpage>22</lpage>.
                    <pub-id pub-id-type="doi">10.1108/10650751111106528</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref15">
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Donca</surname>
                            <given-names>I-C</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Stan</surname>
                            <given-names>OP</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Misaros</surname>
                            <given-names>M</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>Method for Continuous Integration and Deployment Using a Pipeline Generator for Agile Software Projects.</article-title>
                    <source>

                        <italic toggle="yes">Sensors.</italic>
</source>
                    <year>2022</year>;<volume>22</volume>(<issue>12</issue>):<fpage>4637</fpage>.
                    <pub-id pub-id-type="pmid">35746421</pub-id>
                    <pub-id pub-id-type="doi">10.3390/s22124637</pub-id>
                    <pub-id pub-id-type="pmcid">PMC9231338</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref16">
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Dugbartey</surname>
                            <given-names>A</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Kehinde</surname>
                            <given-names>O</given-names>
                        </name>
</person-group>:
                    <article-title>Optimizing project delivery through agile methodologies: Balancing speed, collabora-tion and stakeholder engagement.</article-title>
                    <source>

                        <italic toggle="yes">World Journal of Advanced Research and Reviews.</italic>
</source>
                    <year>2025</year>;<volume>25</volume>(<issue>1</issue>):<fpage>1237</fpage>&#x2013;<lpage>1257</lpage>.
                    <pub-id pub-id-type="doi">10.30574/wjarr.2025.25.1.0193</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref17">
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Fu</surname>
                            <given-names>M</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Pasuksmit</surname>
                            <given-names>J</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Tantithamthavorn</surname>
                            <given-names>C</given-names>
                        </name>
</person-group>:
                    <article-title>AI for DevSecOps: A Landscape and Future Opportunities.</article-title>
                    <source>

                        <italic toggle="yes">ACM Trans. Softw. Eng. Methodol.</italic>
</source>
                    <year>2025</year>;<volume>34</volume>(<issue>4</issue>):<fpage>1</fpage>&#x2013;<lpage>61</lpage>.
                    <pub-id pub-id-type="doi">10.1145/3712190</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref18">
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Gajbhiye</surname>
                            <given-names>B</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Aggarwal</surname>
                            <given-names>A</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Jain</surname>
                            <given-names>S</given-names>
                        </name>
</person-group>:
                    <article-title>Automated Security Testing in DevOps Environments Using AI and ML.</article-title>
                    <source>

                        <italic toggle="yes">International Journal for Research Publication and Seminar.</italic>
</source>
                    <year>2024</year>;<volume>15</volume>(<issue>2</issue>):<fpage>259</fpage>&#x2013;<lpage>271</lpage>.
                    <pub-id pub-id-type="doi">10.36676/jrps.v15.i2.1472</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref19">
                <mixed-citation publication-type="other">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Gao</surname>
                            <given-names>Y</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Xiong</surname>
                            <given-names>Y</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Gao</surname>
                            <given-names>X</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>Retrieval-Augmented Generation for Large Language Models: A Survey.</article-title>
                    <year>2023</year>.
                    <pub-id pub-id-type="doi">10.48550/arxiv.2312.10997</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref20">
                <mixed-citation publication-type="other">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Garg</surname>
                            <given-names>S</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Pundir</surname>
                            <given-names>P</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Rathee</surname>
                            <given-names>G</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>On Continuous Integration/Continuous Delivery for Automated Deployment of Machine Learning Models using MLOps.</article-title>
                    <year>2021</year>;<fpage>25</fpage>&#x2013;<lpage>28</lpage>.
                    <pub-id pub-id-type="doi">10.1109/aike52691.2021.00010</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref21">
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Gargari</surname>
                            <given-names>OK</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Habibi</surname>
                            <given-names>G</given-names>
                        </name>
</person-group>:
                    <article-title>Enhancing medical AI with retrieval-augmented generation: A mini narrative review.</article-title>
                    <source>

                        <italic toggle="yes">Digital Health.</italic>
</source>
                    <year>2025</year>;<volume>11</volume>.
                    <pub-id pub-id-type="pmid">40343063</pub-id>
                    <pub-id pub-id-type="doi">10.1177/20552076251337177</pub-id>
                    <pub-id pub-id-type="pmcid">PMC12059965</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref22">
                <mixed-citation publication-type="other">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Agung Gunawan</surname>
                            <given-names>J</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Singgih</surname>
                            <given-names>ML</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Ginardi</surname>
                            <given-names>RVH</given-names>
                        </name>
</person-group>:
                    <article-title>Assessing the Impact of AI-Augmented DevSecOps on Lead Time in Agile Release Management.</article-title>
                    <year>2026</year>.
                    <pub-id pub-id-type="doi">10.5281/zenodo.18830679</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref23">
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Hatch</surname>
                            <given-names>A</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Curry</surname>
                            <given-names>S</given-names>
                        </name>
</person-group>:
                    <article-title>Changing how we evaluate research is difficult, but not impossible.</article-title>
                    <source>

                        <italic toggle="yes">elife.</italic>
</source>
                    <year>2020</year>;<volume>9</volume>.
                    <pub-id pub-id-type="doi">10.7554/elife.58654</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref24">
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Hikov</surname>
                            <given-names>A</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Murphy</surname>
                            <given-names>L</given-names>
                        </name>
</person-group>:
                    <article-title>Information retrieval from textual data: Harnessing large language models, retrieval augmented generation, and prompt engineering.</article-title>
                    <source>

                        <italic toggle="yes">Journal of AI, Robotics &amp;amp; Workplace Automation.</italic>
</source>
                    <year>2024</year>;<volume>3</volume>(<issue>2</issue>):<fpage>142</fpage>.
                    <pub-id pub-id-type="doi">10.69554/qafe6376</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref25">
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Jeong</surname>
                            <given-names>C</given-names>
                        </name>
</person-group>:
                    <article-title>A Study on the Implementation of Generative AI Services Using an Enterprise Data-Based LLM Application Architecture.</article-title>
                    <source>

                        <italic toggle="yes">Advances in Artificial Intelligence and Machine Learning.</italic>
</source>
                    <year>2023</year>;<volume>03</volume>(<issue>04</issue>):<fpage>1588</fpage>&#x2013;<lpage>1618</lpage>.
                    <pub-id pub-id-type="doi">10.54364/aaiml.2023.1191</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref26">
                <mixed-citation publication-type="book">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Joel</surname>
                            <given-names>M</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Rajakumari</surname>
                            <given-names>K</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Hemalatha</surname>
                            <given-names>D</given-names>
                        </name>
</person-group>:
                    <source>

                        <italic toggle="yes">To Survive in a Fast-Changing Business Landscape in the Age of Digital Transformation, Organizations Must Be Flexible and Adaptive.</italic>
</source>
                    <publisher-name>Igi Global</publisher-name>;<year>2024</year>;<fpage>289</fpage>&#x2013;<lpage>304</lpage>.
                    <pub-id pub-id-type="doi">10.4018/979-8-3693-3318-1.ch016</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref27">
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Junker</surname>
                            <given-names>TL</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Bakker</surname>
                            <given-names>AB</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Derks</surname>
                            <given-names>D</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>Agile work practices: measurement and mechanisms.</article-title>
                    <source>

                        <italic toggle="yes">Eur. J. Work Organ. Psy.</italic>
</source>
                    <year>2022</year>;<volume>32</volume>(<issue>1</issue>):<fpage>1</fpage>&#x2013;<lpage>22</lpage>.
                    <pub-id pub-id-type="doi">10.1080/1359432x.2022.2096439</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref28">
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Karamitsos</surname>
                            <given-names>I</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Thabit</surname>
                            <given-names>S</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Apostolopoulos</surname>
                            <given-names>C</given-names>
                        </name>
</person-group>:
                    <article-title>Applying DevOps Practices of Continuous Automation for Machine Learning.</article-title>
                    <source>

                        <italic toggle="yes">Information.</italic>
</source>
                    <year>2020</year>;<volume>11</volume>(<issue>7</issue>):<fpage>363</fpage>.
                    <pub-id pub-id-type="doi">10.3390/info11070363</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref29">
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Khan</surname>
                            <given-names>MI</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Parahyanti</surname>
                            <given-names>E</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Hussain</surname>
                            <given-names>S</given-names>
                        </name>
</person-group>:
                    <article-title>The Role of Generative AI in Human Resource Management: Enhancing Operational Efficiency, Decision-Making, and Addressing Ethical Challenges.</article-title>
                    <source>

                        <italic toggle="yes">Asian Journal of Logistics Management.</italic>
</source>
                    <year>2024</year>;<volume>3</volume>(<issue>2</issue>):<fpage>104</fpage>&#x2013;<lpage>125</lpage>.
                    <pub-id pub-id-type="doi">10.14710/ajlm.2024.24671</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref30">
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Knollmeyer</surname>
                            <given-names>S</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Caymazer</surname>
                            <given-names>O</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Grossmann</surname>
                            <given-names>D</given-names>
                        </name>
</person-group>:
                    <article-title>Document GraphRAG: Knowledge Graph Enhanced Retrieval Augmented Generation for Document Question Answering Within the Manufacturing Domain.</article-title>
                    <source>

                        <italic toggle="yes">Electronics.</italic>
</source>
                    <year>2025</year>;<volume>14</volume>(<issue>11</issue>):<fpage>2102</fpage>.
                    <pub-id pub-id-type="doi">10.3390/electronics14112102</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref31">
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Liang</surname>
                            <given-names>P</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Wu</surname>
                            <given-names>Y</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Xu</surname>
                            <given-names>Z</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>Enhancing Security in DevOps by Integrating Artificial Intelligence and Machine Learning.</article-title>
                    <source>

                        <italic toggle="yes">J Theory Pract Eng Sci.</italic>
</source>
                    <year>2024</year>;<volume>4</volume>(<issue>02</issue>):<fpage>31</fpage>&#x2013;<lpage>37</lpage>.
                    <pub-id pub-id-type="doi">10.53469/jtpes.2024.04(02).05</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref32">
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Nadella</surname>
                            <given-names>GS</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Addula</surname>
                            <given-names>SR</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Yadulla</surname>
                            <given-names>AR</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>Generative AI-Enhanced Cybersecurity Framework for Enterprise Data Privacy Management.</article-title>
                    <source>

                        <italic toggle="yes">Computers.</italic>
</source>
                    <year>2025</year>;<volume>14</volume>(<issue>2</issue>):<fpage>55</fpage>.
                    <pub-id pub-id-type="doi">10.3390/computers14020055</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref33">
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Neha</surname>
                            <given-names>F</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Bhati</surname>
                            <given-names>D</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Shukla</surname>
                            <given-names>DK</given-names>
                        </name>
</person-group>:
                    <article-title>Retrieval-Augmented Generation (RAG) in Healthcare: A Comprehensive Review.</article-title>
                    <source>

                        <italic toggle="yes">AI.</italic>
</source>
                    <year>2025</year>;<volume>6</volume>(<issue>9</issue>):<fpage>226</fpage>.
                    <pub-id pub-id-type="doi">10.3390/ai6090226</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref34">
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Omran Almagrabi</surname>
                            <given-names>A</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Khan</surname>
                            <given-names>RA</given-names>
                        </name>
</person-group>:
                    <article-title>Optimizing Secure AI Lifecycle Model Management with Innovative Generative AI Strategies.</article-title>
                    <source>

                        <italic toggle="yes">IEEE Access.</italic>
</source>
                    <year>2025</year>;<volume>13</volume>:<fpage>12889</fpage>&#x2013;<lpage>12920</lpage>.
                    <pub-id pub-id-type="doi">10.1109/access.2024.3491373</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref35">
                <mixed-citation publication-type="other">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Paasivaara</surname>
                            <given-names>M</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Durasiewicz</surname>
                            <given-names>S</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Lassenius</surname>
                            <given-names>C</given-names>
                        </name>
</person-group>:
                    <article-title>Using Scrum in Distributed Agile Development: A Multiple Case Study.</article-title>
                    <year>2009</year>;<fpage>195</fpage>&#x2013;<lpage>204</lpage>.
                    <pub-id pub-id-type="doi">10.1109/icgse.2009.27</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref36">
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Prates</surname>
                            <given-names>L</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Pereira</surname>
                            <given-names>R</given-names>
                        </name>
</person-group>:
                    <article-title>DevSecOps practices and tools.</article-title>
                    <source>

                        <italic toggle="yes">Int. J. Inf. Secur.</italic>
</source>
                    <year>2024</year>;<volume>24</volume>(<issue>1</issue>).
                    <pub-id pub-id-type="doi">10.1007/s10207-024-00914-z</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref37">
                <mixed-citation publication-type="other">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Rangnau</surname>
                            <given-names>T</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Buijtenen</surname>
                            <given-names>RV</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Fransen</surname>
                            <given-names>F</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>Continuous Security Testing: A Case Study on Integrating Dynamic Security Testing Tools in CI/CD Pipelines.</article-title>
                    <year>2020</year>;<fpage>145</fpage>&#x2013;<lpage>154</lpage>.
                    <pub-id pub-id-type="doi">10.1109/edoc49727.2020.00026</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref38">
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Salo</surname>
                            <given-names>O</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Abrahamsson</surname>
                            <given-names>P</given-names>
                        </name>
</person-group>:
                    <article-title>An iterative improvement process for agile software development.</article-title>
                    <source>

                        <italic toggle="yes">Softw Process Improv Pract.</italic>
</source>
                    <year>2006</year>;<volume>12</volume>(<issue>1</issue>):<fpage>81</fpage>&#x2013;<lpage>100</lpage>.
                    <pub-id pub-id-type="doi">10.1002/spip.305</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref39">
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Schmid</surname>
                            <given-names>SL</given-names>
                        </name>
</person-group>:
                    <article-title>Five years post-DORA: Promoting best practices for research assessment.</article-title>
                    <source>

                        <italic toggle="yes">Mol. Biol. Cell.</italic>
</source>
                    <year>2017</year>;<volume>28</volume>(<issue>22</issue>):<fpage>2941</fpage>&#x2013;<lpage>2944</lpage>.
                    <pub-id pub-id-type="pmid">29084913</pub-id>
                    <pub-id pub-id-type="doi">10.1091/mbc.e17-08-0534</pub-id>
                    <pub-id pub-id-type="pmcid">PMC5662254</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref40">
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Shamsuddoha</surname>
                            <given-names>M</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Khan</surname>
                            <given-names>EA</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Chowdhury</surname>
                            <given-names>MMH</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>Revolutionizing Supply Chains: Unleashing the Power of AI-Driven Intelligent Automation and Real-Time Information Flow.</article-title>
                    <source>

                        <italic toggle="yes">Information.</italic>
</source>
                    <year>2025</year>;<volume>16</volume>(<issue>1</issue>):<fpage>26</fpage>.
                    <pub-id pub-id-type="doi">10.3390/info16010026</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref41">
                <mixed-citation publication-type="other">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Singh</surname>
                            <given-names>A</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Ehtesham</surname>
                            <given-names>A</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Kumar</surname>
                            <given-names>S</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>Agentic Retrieval-Augmented Generation: A Survey on Agentic RAG.</article-title>
                    <year>2025</year>.
                    <pub-id pub-id-type="doi">10.48550/arxiv.2501.09136</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref42">
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Snyder</surname>
                            <given-names>B</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Curtis</surname>
                            <given-names>B</given-names>
                        </name>
</person-group>:
                    <article-title>Using Analytics to Guide Improvement during an Agile&#x2013;DevOps Transformation.</article-title>
                    <source>

                        <italic toggle="yes">IEEE Softw.</italic>
</source>
                    <year>2018</year>;<volume>35</volume>(<issue>1</issue>):<fpage>78</fpage>&#x2013;<lpage>83</lpage>.
                    <pub-id pub-id-type="doi">10.1109/ms.2017.4541032</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref43">
                <mixed-citation publication-type="other">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Tomas</surname>
                            <given-names>N</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Li</surname>
                            <given-names>J</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Huang</surname>
                            <given-names>H</given-names>
                        </name>
</person-group>:
                    <article-title>An Empirical Study on Culture, Automation, Measurement, and Sharing of DevSecOps.</article-title>
                    <pub-id pub-id-type="doi">10.1109/cybersecpods.2019.8884935</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref44">
                <mixed-citation publication-type="book">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Uluda&#x011f;</surname>
                            <given-names>&#x00d6;</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Putta</surname>
                            <given-names>A</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Paasivaara</surname>
                            <given-names>M</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <source>

                        <italic toggle="yes">Evolution of the Agile Scaling Frameworks.</italic>
</source>
                    <publisher-name>Springer</publisher-name>;<year>2021</year>;<fpage>123</fpage>&#x2013;<lpage>139</lpage>.
                    <pub-id pub-id-type="doi">10.1007/978-3-030-78098-2_8</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref45">
                <mixed-citation publication-type="other">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Ur Rahman</surname>
                            <given-names>AA</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Williams</surname>
                            <given-names>L</given-names>
                        </name>
</person-group>:
                    <article-title>Software security in DevOps.</article-title>
                    <year>2016</year>;<fpage>70</fpage>&#x2013;<lpage>76</lpage>.
                    <pub-id pub-id-type="doi">10.1145/2896941.2896946</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref46">
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Wessel</surname>
                            <given-names>M</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Adam</surname>
                            <given-names>M</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Benlian</surname>
                            <given-names>A</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>Generative AI and Its Transformative Value for Digital Platforms.</article-title>
                    <source>

                        <italic toggle="yes">J. Manag. Inf. Syst.</italic>
</source>
                    <year>2025</year>;<volume>42</volume>(<issue>2</issue>):<fpage>346</fpage>&#x2013;<lpage>369</lpage>.
                    <pub-id pub-id-type="doi">10.1080/07421222.2025.2487315</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref47">
                <mixed-citation publication-type="other">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Yigit</surname>
                            <given-names>Y</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Buchanan</surname>
                            <given-names>W</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Tehrani</surname>
                            <given-names>M</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>Review of Generative AI Methods in Cybersecurity.</article-title>
                    <year>2024</year>.
                    <pub-id pub-id-type="doi">10.48550/arxiv.2403.08701</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref48">
                <mixed-citation publication-type="other">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Yu</surname>
                            <given-names>H</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Gan</surname>
                            <given-names>A</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Zhang</surname>
                            <given-names>K</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>Evaluation of Retrieval-Augmented Generation: A Survey.</article-title>
                    <year>2024</year>.
                    <pub-id pub-id-type="doi">10.48550/arxiv.2405.07437</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref49">
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Zayat</surname>
                            <given-names>W</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Senvar</surname>
                            <given-names>O</given-names>
                        </name>
</person-group>:
                    <article-title>Framework Study for Agile Software Development Via Scrum and Kanban.</article-title>
                    <source>

                        <italic toggle="yes">Int. J. Innov. Technol. Manag.</italic>
</source>
                    <year>2020</year>;<volume>17</volume>(<issue>04</issue>).
                    <pub-id pub-id-type="doi">10.1142/s0219877020300025</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref50">
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Zhang</surname>
                            <given-names>W</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Zhang</surname>
                            <given-names>J</given-names>
                        </name>
</person-group>:
                    <article-title>Hallucination Mitigation for Retrieval-Augmented Large Language Models: A Review.</article-title>
                    <source>

                        <italic toggle="yes">Mathematics.</italic>
</source>
                    <year>2025</year>;<volume>13</volume>(<issue>5</issue>):<fpage>856</fpage>.
                    <pub-id pub-id-type="doi">10.3390/math13050856</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref51">
                <mixed-citation publication-type="other">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Zhao</surname>
                            <given-names>P</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Zhang</surname>
                            <given-names>H</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Yu</surname>
                            <given-names>Q</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>Retrieval-Augmented Generation for AI-Generated Content: A Survey.</article-title>
                    <year>2024</year>.
                    <pub-id pub-id-type="doi">10.48550/arxiv.2402.19473</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref52">
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Zhou</surname>
                            <given-names>R</given-names>
                        </name>
</person-group>:
                    <article-title>Advanced Embedding Techniques in Multimodal Retrieval Augmented Generation: A Comprehensive Study on Cross-Modal AI Applications.</article-title>
                    <source>

                        <italic toggle="yes">J Comput Electron Inf Manag.</italic>
</source>
                    <year>2024</year>;<volume>13</volume>(<issue>3</issue>):<fpage>16</fpage>&#x2013;<lpage>22</lpage>.
                    <pub-id pub-id-type="doi">10.54097/h8wf8vah</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref53">
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Zota</surname>
                            <given-names>RD</given-names>
                        </name>

                        <name name-style="western">
                            <surname>B&#x0103;rbulescu</surname>
                            <given-names>C</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Constantinescu</surname>
                            <given-names>R</given-names>
                        </name>
</person-group>:
                    <article-title>A Practical Approach to Defining a Framework for Developing an Agentic AIOps System.</article-title>
                    <source>

                        <italic toggle="yes">Electronics.</italic>
</source>
                    <year>2025</year>;<volume>14</volume>(<issue>9</issue>):<fpage>1775</fpage>.
                    <pub-id pub-id-type="doi">10.3390/electronics14091775</pub-id>
                </mixed-citation>
            </ref>
        </ref-list>
    </back>
    <sub-article article-type="reviewer-report" id="report487649">
        <front-stub>
            <article-id pub-id-type="doi">10.5256/f1000research.196407.r487649</article-id>
            <title-group>
                <article-title>Reviewer response for version 1</article-title>
            </title-group>
            <contrib-group>
                <contrib contrib-type="author">
                    <name>
                        <surname>Wibawa</surname>
                        <given-names>Aji Prasetya</given-names>
                    </name>
                    <xref ref-type="aff" rid="r487649a1">1</xref>
                    <role>Referee</role>
                </contrib>
                <aff id="r487649a1">
                    <label>1</label>Universitas Negeri Malang, Malang, East Java, Indonesia</aff>
            </contrib-group>
            <author-notes>
                <fn fn-type="conflict">
                    <p>
                        <bold>Competing interests: </bold>No competing interests were disclosed.</p>
                </fn>
            </author-notes>
            <pub-date pub-type="epub">
                <day>8</day>
                <month>6</month>
                <year>2026</year>
            </pub-date>
            <permissions>
                <copyright-statement>Copyright: &#x00a9; 2026 Wibawa AP</copyright-statement>
                <copyright-year>2026</copyright-year>
                <license xlink:href="https://creativecommons.org/licenses/by/4.0/">
                    <license-p>This is an open access peer review report distributed under the terms of the Creative Commons Attribution Licence, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</license-p>
                </license>
            </permissions>
            <related-article ext-link-type="doi" id="relatedArticleReport487649" related-article-type="peer-reviewed-article" xlink:href="10.12688/f1000research.178067.1"/>
            <custom-meta-group>
                <custom-meta>
                    <meta-name>recommendation</meta-name>
                    <meta-value>approve-with-reservations</meta-value>
                </custom-meta>
            </custom-meta-group>
        </front-stub>
        <body>
            <p>The article investigates the impact of AI-augmented DevSecOps on lead time in Agile release management. The authors compare a conventional DevSecOps sprint with an AI-augmented sprint integrating retrieval-augmented generation, reinforcement learning from human feedback, and human-in-the-loop release support within an enterprise CI/CD environment. The study reports a substantial reduction in lead time for changes, with secondary improvements in security scanning time, approval latency, deployment frequency, and change failure rate. The topic is timely and relevant, especially given the growing interest in applying generative AI to software engineering, DevSecOps, security validation, and enterprise release management.</p>
            <p> </p>
            <p> Overall, the manuscript addresses an important and practically relevant problem. The attempt to provide empirical evidence from an enterprise DevSecOps setting is valuable, and the focus on lead time for changes is appropriate. However, I recommend Approved with Reservations because several issues need to be addressed before the article can be considered scientifically sound and fully reproducible.</p>
            <p> </p>
            <p> 1. Clarity, presentation, and current literature</p>
            <p> The work is partly clearly and accurately presented. The article cites a broad range of literature on DevSecOps, generative AI, RAG, RLHF, agile development, CI/CD, and security automation. However, the manuscript contains substantial repetition across the Introduction, Methods, Results, Discussion, and Conclusion sections. Several paragraphs restate the same claims about AI-assisted DevSecOps, lead-time reduction, and human-in-the-loop governance.</p>
            <p> </p>
            <p> More importantly, parts of the Results section still appear to be written as if the study were a proposal rather than a completed empirical study. For example, wording such as &#x201c;This study expects&#x2026;&#x201d; and references to tables being populated in a &#x201c;real experiment&#x201d; should be removed or rewritten. Since the manuscript already reports actual sprint-level results, all hypothetical or proposal-style language should be replaced with precise reporting of the completed study.</p>
            <p> </p>
            <p> The literature coverage is generally adequate, but the authors should strengthen the positioning of the study against prior empirical work on DevOps metrics, DORA metrics, AI-assisted software engineering, DevSecOps automation, and quasi-experimental evaluations in software engineering. Some cited sources appear broad or only indirectly related, and the manuscript would benefit from a more selective use of high-quality and directly relevant references.</p>
            <p> </p>
            <p> Points that must be addressed: 
                <list list-type="bullet">
                    <list-item>
                        <p>Remove all proposal-style or hypothetical wording from the Results and Discussion sections.</p>
                    </list-item>
                    <list-item>
                        <p>Reduce repetition across the manuscript.</p>
                    </list-item>
                    <list-item>
                        <p>Clarify the exact research gap and how this study differs from existing empirical DevOps and AI-assisted software engineering studies.</p>
                    </list-item>
                    <list-item>
                        <p>Ensure that all cited literature is directly relevant and current.</p>
                    </list-item>
                </list> 2. Study design and technical soundness</p>
            <p> The study design is partly appropriate. A quasi-experimental within-team comparison is a reasonable starting point for evaluating AI augmentation in a real enterprise DevSecOps setting. Using the same team, comparable sprint duration, and operational CI/CD logs is appropriate for an applied software engineering study.</p>
            <p> </p>
            <p> However, the design has important limitations. The study compares only two consecutive sprints: one baseline sprint and one intervention sprint. This design is vulnerable to confounding factors, including task complexity differences, learning effects, team adaptation, sprint planning differences, temporal effects, and Hawthorne effects. Although the authors mention control variables such as comparable story points, identical team composition, and similar infrastructure, the manuscript does not provide enough evidence that the two sprint backlogs were truly comparable.</p>
            <p> </p>
            <p> The intervention also combines multiple AI-related components, including RAG, RLHF, code assistance, security recommendations, release summaries, and approval support. Because these components were introduced together, it is difficult to determine which component contributed most to the observed lead-time reduction. The manuscript discusses stage-level improvements, but the causal attribution to specific AI components remains limited.</p>
            <p> </p>
            <p> Points that must be addressed: 
                <list list-type="bullet">
                    <list-item>
                        <p>Provide a clearer description of the two sprint backlogs, including task types, story points, complexity distribution, and domain similarity.</p>
                    </list-item>
                    <list-item>
                        <p>Explain how comparability between the baseline and intervention sprints was assessed.</p>
                    </list-item>
                    <list-item>
                        <p>Discuss confounding factors more explicitly.</p>
                    </list-item>
                    <list-item>
                        <p>Avoid overly causal language unless supported by the design.</p>
                    </list-item>
                    <list-item>
                        <p>Clarify which parts of the AI intervention were active in each pipeline stage.</p>
                    </list-item>
                </list> 3. Methods and replication details</p>
            <p> The manuscript provides useful general information about the research setting, team composition, technology stack, CI/CD tools, and broad AI architecture. However, the methodological details are only partly sufficient for replication.</p>
            <p> </p>
            <p> The authors should provide more details on the AI system configuration, RAG pipeline, retrieval corpus, embedding model, chunking strategy, retrieval parameters, prompt design, RLHF procedure, model refinement process, and human feedback protocol. The manuscript mentions FAISS, all-MiniLM-L6-v2, Llama 2 7B, JSONL audit logs, and weekly refinement cycles, but the exact operational implementation is not sufficiently detailed.</p>
            <p> </p>
            <p> There is also a possible inconsistency in the hardware description. The manuscript refers to an on-premises Llama 2 7B infrastructure and also states that the workflow was executed without external GPU acceleration on an Intel Core i5 machine with 16GB RAM. This requires clarification because the computational setup affects reproducibility and feasibility.</p>
            <p> </p>
            <p> Points that must be addressed: 
                <list list-type="bullet">
                    <list-item>
                        <p>Provide a technical configuration table for the AI system.</p>
                    </list-item>
                    <list-item>
                        <p>Describe the RAG knowledge base, document sources, preprocessing, chunk size, embedding model, vector index, retrieval top-k, prompt structure, and response validation process.</p>
                    </list-item>
                    <list-item>
                        <p>Describe the RLHF feedback process in more detail, including who provided feedback, how feedback was encoded, and whether the model was actually fine-tuned or only adjusted through feedback-based selection.</p>
                    </list-item>
                    <list-item>
                        <p>Clarify the hardware environment and whether GPU resources were used.</p>
                    </list-item>
                    <list-item>
                        <p>Provide enough pipeline configuration details so that another team could reproduce the intervention.</p>
                    </list-item>
                </list> 4. Statistical analysis and interpretation</p>
            <p> The statistical analysis is partly appropriate. Welch&#x2019;s t-test, effect size estimation, confidence intervals, and descriptive statistics are suitable for comparing lead time between two groups when variances may differ. Reporting Cohen&#x2019;s d and a confidence interval is also helpful.</p>
            <p> </p>
            <p> However, the interpretation should be more cautious because the sample size is limited to 42 changes, with 21 per sprint. The independence assumption should be discussed because changes within the same sprint and team may not be fully independent. The use of multiple metrics also raises the issue of selective interpretation. The manuscript should clarify whether statistical testing was applied only to lead time or also to secondary metrics such as security scan time, approval wait time, deployment frequency, and change failure rate.</p>
            <p> </p>
            <p> There is also a major issue of numerical inconsistency. In one part of the manuscript, the change failure rate is reported as decreasing from 14.3% to 8.7%, while another figure and its caption report a decrease from 28% to 12%. These cannot both be correct unless they refer to different denominators or different subsets. This must be corrected because change failure rate is central to the claim that faster delivery did not compromise quality.</p>
            <p> </p>
            <p> Points that must be addressed: 
                <list list-type="bullet">
                    <list-item>
                        <p>Correct the inconsistency in change failure rate values.</p>
                    </list-item>
                    <list-item>
                        <p>Clarify the denominator used for change failure rate.</p>
                    </list-item>
                    <list-item>
                        <p>Explain whether the 42 changes are independent observations.</p>
                    </list-item>
                    <list-item>
                        <p>Report whether statistical tests were conducted for secondary outcomes.</p>
                    </list-item>
                    <list-item>
                        <p>Interpret causality more cautiously due to the two-sprint quasi-experimental design.</p>
                    </list-item>
                </list> 5. Source data and reproducibility</p>
            <p> The article states that source data and extended materials are available in a Zenodo repository. This is appropriate and aligns well with the F1000Research model. However, based on the manuscript alone, reproducibility appears only partial.</p>
            <p> </p>
            <p> The repository is described as containing change-level data, release-level data, RLHF learning curve data, statistical test outputs, diagrams, and code. This is useful. However, full reproducibility requires the data dictionary, exact variable definitions, scripts used for statistical analysis, preprocessing steps, and the pipeline configuration used to generate the tables and figures. The manuscript should also clarify whether the source data are raw logs, anonymized operational data, or processed datasets.</p>
            <p> </p>
            <p> Points that must be addressed: 
                <list list-type="bullet">
                    <list-item>
                        <p>Include a clear data dictionary for all CSV files.</p>
                    </list-item>
                    <list-item>
                        <p>Provide the scripts used to generate the reported statistics, tables, and figures.</p>
                    </list-item>
                    <list-item>
                        <p>Clarify whether the repository contains raw or processed data.</p>
                    </list-item>
                    <list-item>
                        <p>Explain any anonymization or aggregation procedures.</p>
                    </list-item>
                    <list-item>
                        <p>Ensure that the reported results in the manuscript can be reproduced directly from the shared data and scripts.</p>
                    </list-item>
                </list> 6. Support for conclusions</p>
            <p> The conclusions are partly supported by the results. The reported data suggest that the AI-augmented sprint had lower lead time and improved several operational metrics. However, the strength of the conclusions should be moderated.</p>
            <p> </p>
            <p> The manuscript sometimes implies that on-premises generative AI significantly reduces DevSecOps lead time while preserving governance and quality. This is plausible within the studied context, but the evidence comes from a short, single-organization, two-sprint evaluation. Therefore, the conclusion should be limited to this specific organizational setting and should avoid broad claims about regulated enterprise environments in general.</p>
            <p> </p>
            <p> The claim that the findings challenge the traditional speed-security trade-off is interesting but somewhat overstated. The evidence supports a preliminary indication of simultaneous improvement in speed and selected reliability indicators within one case study, but not a general challenge to the broader speed-security trade-off across DevSecOps contexts.</p>
            <p> </p>
            <p> Points that must be addressed: 
                <list list-type="bullet">
                    <list-item>
                        <p>Narrow the conclusion to the studied organization and sprint context.</p>
                    </list-item>
                    <list-item>
                        <p>Avoid broad generalization to all regulated enterprise environments.</p>
                    </list-item>
                    <list-item>
                        <p>Reframe the study as preliminary empirical evidence rather than definitive proof.</p>
                    </list-item>
                    <list-item>
                        <p>Align the conclusion with the limitations of the two-sprint quasi-experimental design.</p>
                    </list-item>
                </list> Minor comments: 
                <list list-type="order">
                    <list-item>
                        <p>The abstract is informative, but it should be checked after correcting all numerical inconsistencies.</p>
                    </list-item>
                    <list-item>
                        <p>The Introduction contains repeated paragraphs and should be shortened.</p>
                    </list-item>
                    <list-item>
                        <p>The Methods section should distinguish more clearly between planned methodology and executed methodology.</p>
                    </list-item>
                    <list-item>
                        <p>Table numbering should be checked because Table 1 appears to be used for different purposes in different parts of the manuscript.</p>
                    </list-item>
                    <list-item>
                        <p>The Results section should not include sentences suggesting that the data are hypothetical.</p>
                    </list-item>
                    <list-item>
                        <p>Figure 2 and Table 3 should be reconciled because they report different change failure rate values.</p>
                    </list-item>
                    <list-item>
                        <p>The Conclusion is too long and repetitive. It should be shortened and focused on the key findings, limitations, and future work.</p>
                    </list-item>
                    <list-item>
                        <p>Some terminology should be standardized, including &#x201c;DevOps,&#x201d; &#x201c;DevSecOps,&#x201d; &#x201c;lead time for changes,&#x201d; &#x201c;release readiness,&#x201d; and &#x201c;production deployment.&#x201d;</p>
                    </list-item>
                    <list-item>
                        <p>The authors should proofread the manuscript carefully for grammar, spacing, and formatting inconsistencies.</p>
                    </list-item>
                    <list-item>
                        <p>The references should be checked for relevance and formatting consistency.</p>
                    </list-item>
                </list> </p>
            <p> I recommend Approved with Reservations. The article addresses a relevant and timely topic and provides potentially useful empirical evidence on AI-augmented DevSecOps. However, the authors must address the issues above, especially the inconsistent numerical results, the remaining proposal-style language, the limited description of the AI intervention, the short two-sprint design, and the need for clearer reproducibility details. Once these issues are corrected, the article would make a useful contribution to the literature on AI-assisted DevSecOps and Agile release management.</p>
            <p>Is the work clearly and accurately presented and does it cite the current literature?</p>
            <p>Partly</p>
            <p>If applicable, is the statistical analysis and its interpretation appropriate?</p>
            <p>Partly</p>
            <p>Are all the source data underlying the results available to ensure full reproducibility?</p>
            <p>Partly</p>
            <p>Is the study design appropriate and is the work technically sound?</p>
            <p>Partly</p>
            <p>Are the conclusions drawn adequately supported by the results?</p>
            <p>Partly</p>
            <p>Are sufficient details of methods and analysis provided to allow replication by others?</p>
            <p>Partly</p>
            <p>Reviewer Expertise:</p>
            <p>The article has academic value and addresses an important topic, but it still needs specific revisions related to numerical consistency, methodological clarity, reproducibility, and interpretation of findings.</p>
            <p>I confirm that I have read this submission and believe that I have an appropriate level of expertise to confirm that it is of an acceptable scientific standard, however I have significant reservations, as outlined above.</p>
        </body>
    </sub-article>
</article>
