The Effects of Artificial Intelligence-Based Interventions on Depression and Anxiety: A Systematic Review

Ni Made Adinda Putri Puspitarani; I Gusti Ayu Agung Istri Risna Prajna Devi; Modesta Windyarti Natalia Toa Ngey; Kevin Putri Novera Tanaem; Radita Sonixtus Arauna; Wiwin Hendriani

doi:10.12688/f1000research.181969.1

Home Browse The Effects of Artificial Intelligence-Based Interventions on Depression...

ALL Metrics

-

Views

-

Downloads

Get PDF

Get XML

Export

▬

✚

Systematic Review

The Effects of Artificial Intelligence-Based Interventions on Depression and Anxiety: A Systematic Review

[version 1; peer review: awaiting peer review]

Previously titled: The Effects of Artificial Intelligence-Based Intervention on Depression and Anxiety: A Systematic Review"

Ni Made Adinda Putri Puspitarani ¹, I Gusti Ayu Agung Istri Risna Prajna Devi¹, Modesta Windyarti Natalia Toa Ngey¹, Kevin Putri Novera Tanaem¹, Radita Sonixtus Arauna², Wiwin Hendriani¹

Ni Made Adinda Putri Puspitarani ¹, I Gusti Ayu Agung Istri Risna Prajna Devi¹, [...] Modesta Windyarti Natalia Toa Ngey¹, Kevin Putri Novera Tanaem¹, Radita Sonixtus Arauna², Wiwin Hendriani¹

PUBLISHED 18 Jun 2026

Author details Author details

¹ Department of Psychology,Faculty of Psychology, Universitas Airlangga, Surabaya, East Java, 60286, Indonesia
² Faculty of Business and Economics, Monash University, Tangerang, Indonesia, 15339, Indonesia

Ni Made Adinda Putri Puspitarani
Roles: Conceptualization, Data Curation, Formal Analysis, Investigation, Methodology, Supervision, Validation, Visualization, Writing – Original Draft Preparation

I Gusti Ayu Agung Istri Risna Prajna Devi
Roles: Investigation, Methodology, Validation, Writing – Original Draft Preparation

Modesta Windyarti Natalia Toa Ngey
Roles: Investigation, Validation, Writing – Original Draft Preparation

Kevin Putri Novera Tanaem
Roles: Investigation, Validation, Writing – Original Draft Preparation

Radita Sonixtus Arauna
Roles: Formal Analysis, Investigation, Validation, Writing – Original Draft Preparation

Wiwin Hendriani
Roles: Writing – Review & Editing

OPEN PEER REVIEW

REVIEWER STATUS AWAITING PEER REVIEW

This article is included in the Artificial Intelligence and Machine Learning gateway.

Abstract

Introduction

Depression and anxiety remain major global mental health challenges that continue to increase across populations. Conventional treatments are often limited by cost, accessibility, stigma, and the availability of professionals. Artificial intelligence (AI)-based interventions have emerged as a potential approach to address these gaps. However, the growing body of evidence across diverse contexts calls for further synthesis. This study aims to examine research characteristics, evaluate effects, and analyse the implementation issues of AI-based interventions for depression and anxiety.

Methods

This systematic review was conducted in accordance with guidelines. Fourteen randomised controlled trials (RCTs) were identified from major databases, including Scopus, Web of Science, PubMed, and EBSCO, within the period from 6 November 2015 to 6 November 2025. Study quality was assessed using the Cochrane Risk of Bias 2 tool, and findings were synthesised using a narrative approach.

Results

The findings indicate that AI-based interventions, such as chatbots, large language models, and integrated platforms, generally demonstrate effects in reducing symptoms of depression and anxiety across various populations. However, results remain heterogeneous, with some studies showing outcome-specific or within-group improvements only. Implementation issues were identified, including limited human support, recruitment bias, and short follow-up periods, which may reduce adherence, generalisability, and the assessment of long-term effects.

Conclusions

AI-based interventions may be potentially accessible and scalable mental health solutions, with outcomes comparable to conventional care in certain contexts. However, their effects are shaped by implementation-related challenges, including variability in engagement, technological limitations, and ethical considerations. Future research should prioritise more standardised methodologies, longer intervention durations with follow-up, and greater attention to implementation design and sustainability.

Systematic Review Registration

Registered in PROSPERO on 16 February 2026 (Registration number CRD420261308648). Available from: https://www.crd.york.ac.uk/PROSPERO/view/CRD420261308648.

Keywords

anxiety, artificial intelligence, depression, interventions, systematic review

Corresponding author: Ni Made Adinda Putri Puspitarani

Competing interests: No competing interests were disclosed.

Grant information: The research was funded by the Indonesia Endowment Fund for Education (LPDP), Ministry of Finance, Republic of Indonesia.
The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.

Copyright: © 2026 Puspitarani NMAP et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

How to cite: Puspitarani NMAP, Devi IGAAIRP, Ngey MWNT et al. The Effects of Artificial Intelligence-Based Interventions on Depression and Anxiety: A Systematic Review [version 1; peer review: awaiting peer review]. F1000Research 2026, 15:964 (https://doi.org/10.12688/f1000research.181969.1) First published: 18 Jun 2026, 15:964 (https://doi.org/10.12688/f1000research.181969.1) Latest published: 18 Jun 2026, 15:964 (https://doi.org/10.12688/f1000research.181969.1)

1. Introduction

Depression and anxiety have become two of the most pervasive global mental health challenges, with prevalence rates continuing to rise across age groups and regions.¹ These conditions not only diminish quality of life but are also associated with increased risks of chronic illness,² impaired social functioning,³ and substantial economic burden on healthcare systems.⁴ Conventional treatment approaches, such as face-to-face therapy and psychiatric services, are often constrained by limited availability of mental-health professionals, high costs, stigma, and, not to mention, geographical barriers.^1,5 These complexities underscore the urgent need for innovative strategies to expand the reach, accessibility, and effects of mental health interventions.⁶

Depression and anxiety are mental disorders that contribute to a significant portion of the global disease burden.⁷ The National Health Interview Survey shows that one in five adults experienced symptoms of depression (21.4%) and anxiety (18.2%) during the past two weeks.⁸ These disorders are caused by multiple factors, including biological, psychological, and social factors.⁹ These disorders often co-occur with physical problems, such as chronic physical pain, migraines, insomnia, low pain tolerance, extreme fatigue, and worsening physical and mental conditions.¹⁰ Conventional therapies such as Cognitive Behavioural Therapy (CBT) and medications such as antidepressants and anxiolytics are often used as treatment strategies.¹¹ However, stigma, high costs, limited availability of mental health services, and long waiting times often lead individuals to seek self-help.¹² To address this gap, AI offers 24/7 services, anonymity, and low costs. Through integration with CBT approaches, AI can help track mood, provide psychoeducation, and develop problem-solving skills through conversational interactions that mimic human interaction.¹³

Artificial Intelligence (AI)-based interventions have emerged as potential solutions to address the limitations of traditional mental-health services.¹⁴ These technologies can take various forms, including mobile applications, text-based chatbots, conversational agents, and even passive digital-behavior monitoring systems that help detect early signs of psychological stress.¹⁵ Studies have shown that AI-driven tools can deliver emotional support, psychoeducation, and cognitive-behavioral exercises in a consistent, scalable, and personalized manner.¹⁶ In cases of depression and anxiety, technology-based interventions are used to deliver more interactive and empathetic digital Cognitive Behavioural Therapy (CBT), such as AI chatbots (Therabot, ChatGPT, Psy-Bot, Woebot), Facebook Messenger, and mobile health applications (TEO).^17–21 Early evidence indicates that certain AI-based approaches can produce clinically meaningful improvements and, in some contexts, perform comparably to or even better than conventional interventions,^6,14,22 providing strong justification for further scientific investigation.

However, although previous studies have shown that AI-based conversational agents have a significant impact on reducing symptoms of depression and emotional distress,^6,16,18,23 further research is needed to synthesise the effects of AI in reducing depression and anxiety. A systematic review conducted by Joshi et al.¹ highlights AI-based interventions for anxiety and depression involving individuals with psychological problems as the population. However, the article search was conducted only through 2024 and included articles not indexed in Scopus. A systematic review of AI Chatbots was also conducted by Nyakhar & Wang,¹³ which focused on improving students’ psychological well-being, including anxiety and depression. However, there has been no comprehensive synthesis evaluating the effects of AI-based interventions in simultaneously reducing depression and anxiety in various populations. This study included research articles published in reputable Scopus-indexed journals, indicating high scientific quality.

The rapid and widespread integration of AI into digital health systems worldwide has accelerated its development. As AI-based mental health tools become increasingly integrated into telehealth platforms, they assist with patient monitoring, technology-enabled healthcare, diagnostic support, and data analysis,²⁴ thereby enhancing their clinical impact. Although previous reviews have highlighted the effects of AI for depression and anxiety,^13,25 a new systematic review across diverse populations and contexts is needed to assess the effects of AI-based interventions.²⁶ This systematic review aims to determine the effects of AI-based interventions in reducing depression and anxiety, as well as to examine the implementation issues associated with these interventions.

2. Methods

This systematic review study was conducted in accordance with the Preferred Reporting Items for Systematic Reviews and Meta-Analyses (PRISMA) guidelines.²⁷ Page et al.²⁷ state that PRISMA reflects advances in methods for identifying, selecting, appraising, and synthesising studies. This study applied the PRISMA guidelines through four stages, namely (1) identifying research questions, (2) identifying literature sources, (3) conducting a literature search that answered the research questions, and (4) analysing the findings. The protocol was registered in the PROSPERO (International Prospective Register of Systematic Reviews) (CRD420261308648).

2.1 Identifying research questions (RQ)

This systematic review will answer research questions regarding the effects of AI-based interventions on anxiety and depression. The three research questions include:

RQ 1. What are the effects of AI-based interventions in reducing depression and anxiety?

RQ 2. What are the characteristics and patterns of research on AI-based interventions for anxiety and depression in the last 10 years?

RQ 3. What are the Implementation Issues of AI-Based Interventions for Depression and Anxiety?

2.2 Identifying literature sources

Eligibility criteria were developed based on the PICOS (Population, Intervention, Outcome, Comparator, and Study Design) framework.²⁸ The population in this study is the general population, such as students, parents, patients, adults, and workers, considering that depression and anxiety are psychological conditions that are cross-demographic and not limited to specific clinical groups, thus enabling the evaluation of the effects of AI interventions in various real-world contexts for various users. The intervention criteria focus on AI, including AI chatbots, ChatGPT, AI-based platforms, and other AI-based interventions, to distinguish AI’s effects from that of conventional interventions. The comparator in this study was non-AI interventions or alternative AI-based designs used as a control condition. Outcome measures focus on psychological problems, namely anxiety and depression, as these two conditions are the most common psychological disorders and are most often targeted in AI-based interventions in the literature. We filtered the article based only on the study using a Randomised Controlled Trial (RCT) to maximise internal validity. Criteria excluded from the study were research that focused solely on reducing depression or anxiety; interventions that were not AI-based, such as digital or conventional interventions; cross-sectional studies; experimental studies other than RCTs; quasi-studies; case reports; non-empirical articles; or articles written in languages other than English.

2.3 Conducting a literature search that answers the research questions

The literature search was conducted across four main databases: Scopus, CINAHL (EBSCO), Web of Science (WoS), and MEDLINE (PubMed), all of which are indexed in Scopus. The search was conducted over the last 10 years (6 November 2015–6 November 2025) to ensure the studies remain relevant today. The literature search focused on articles relevant to the PICOS framework that discussed the population (general), intervention (AI-based), outcomes (depression and anxiety), and study design (experimental, RCT). The Boolean operators used were (“artificial intelligence” OR AI) AND (depression) AND (anxiety) AND (“randomised controlled trial”). In addition, the article search was limited to English-language articles. The inclusion and exclusion criteria are described in Table 1.

Table 1. Inclusion and exclusion criteria.

Inclusion	Exclusion
Articles published between 6 November 2015 and 6 November 2025	Articles published outside the range of 6 November 2015–6 November 2025
Articles written in English	Articles not written in English
Articles discussing anxiety and depression simultaneously	Articles that do not discuss anxiety and depression simultaneously
Study design using Randomised Controlled Trial (RCT)	Study design does not use a Randomised Controlled Trial (RCT)
Keywords using Boolean operators (“artificial intelligence” OR AI) AND (depression) AND (anxiety) AND (“randomised controlled trial”)	Articles other than those appearing in the Boolean search
Full-paper articles are available	Full-text articles are not available

Study selection was conducted by five reviewers (NMAPP, IGAAIRPD, RSA, KPNT, and MWNTN) using Rayyan AI tools. Articles obtained from four databases were then imported into Rayyan and automatically deduplicated. Initial screening was conducted by four reviewers (NMAPP, KPNT, IGAAIRPD, and MWNTN), filtering abstracts for suitability to the PICOS framework. This was followed by full-text screening, yielding 14 final articles. When disagreements arose, the senior reviewer made the final decision on which articles were included (RSA).

2.4 Analysing the findings

The extraction stage was carried out using Notebook LM and manual extraction by the author. Data extraction was carried out by identifying 14 selected articles and extracting publication information (author name, year of publication, country, study design, sample size, Scopus quartile), population characteristics, sampling techniques, interventions (duration and type), and findings. Data synthesis in this study was conducted using a narrative approach to interpret and integrate the findings, due to the conceptual and methodological diversity of the included studies.²⁹ The synthesis involved organizing study characteristics and results into comparative tables, identifying similar and recurring patterns, and examining relationships among interventions across the literature.²⁹

Data synthesis was performed narratively by categorising the findings into three research questions: the characteristics and patterns of research on AI interventions for anxiety and depression over the last 10 years; the effects of AI-based interventions; and the implementation of AI-based interventions for anxiety and depression. The extraction was presented in tables and descriptive narratives. Reliability assessment was carried out by ensuring the suitability of articles according to the inclusion-exclusion criteria, conducting systematic selection using PRISMA guidelines, extracting data using a standard format, and providing direct citations for all articles. The 14 selected papers are described in Table 2.

Table 2. 14 Selected papers.

Citation	Author name and year	Publisher	Scopus quartile
³²	(Akdogan et al., 2025)	Elsevier	Q1
³⁸	(Chen et al., 2025)	JMIR Publications	Q1
¹⁷	(Wang et al., 2025)	JMIR Publications	Q1
³⁶	(Xu & Ma, 2025)	Elsevier	Q1
¹⁸	(Heinz et al., 2025)	NEJM AI	Q1
³³	(Sharp et al., 2025)	JMIR Publications	Q1
³⁷	(Gan et al., 2025)	Wolters Kluwer	Q1
³⁵	(Zhao et al., 2024)	Wiley	Q1
¹⁹	(Karkosz et al., 2024)	JMIR Publications	Q2
⁶	(Sadeh-Sharvit et al., 2023)	JMIR Publications	Q1
²⁰	(Suharwardy et al., 2023)	Elsevier	Q2
²¹	(Danieli et al., 2022)	JMIR Publications	Q1
⁴⁰	(Klos et al., 2021)	JMIR Publications	Q2
³⁴	(Fulmer et al., 2018)	JMIR Publications	Q1

2.5 Risk of bias

The risk of bias in the selected studies was assessed using the Cochrane Risk of Bias 2 (RoB 2), which is designed to assess randomised controlled trials (RCTs). The assessment was conducted on five main domains, namely: (D1) bias from the randomisation process, (D2) bias due to deviations from the planned intervention, (D3) bias due to missing outcome data, (D4) bias in outcome measurement, and (D5) bias in the selection of reported results. For each domain, each study was then categorised as low risk, some concerns, or high risk.³⁰

The classification was determined based on the signalling questions in RoB 2, including the information available in the research report. Then, the overall risk-of-bias assessment was conducted in accordance with the official RoB 2 guidelines, ensuring that the final decision remained consistent and accountable. To maintain consistency in the assessment, the assessment process was carried out systematically by recording the reasons behind each decision in each domain (e.g., whether the randomisation procedure was described, whether there was potential for intervention deviation, and so on). If unclear information was found in an article, it was noted as a consideration in determining the relevant domain category, in accordance with the principle of caution in RoB 2.³¹

3. Results

RQ 1. What are the effects of AI-based interventions in reducing depression and anxiety?

Artificial intelligence (AI) holds potential for reducing symptoms of depression and anxiety due to its accessibility and complementary role in conventional care. Various AI tools, such as Large Language Model (LLM)-based agents, chatbots, and mobile applications have shown effects in reducing symptoms of depression and anxiety.²⁶ Although AI shows significant effects in addressing mental health issues, it is not a substitute for professionals or therapists; rather, it is a complement, while safety and long-term effects still need to be considered.²⁶ The effects of AI from the 14 studies included in this systematic review will be outlined in Table 4.

The effects of artificial Intelligence-based interventions for depression and anxiety

Most studies suggest that AI-based interventions may help reduce symptoms of depression and anxiety, although the evidence remains heterogeneous.^6,18,32–36 Several controlled trials reported improvements in both outcomes; for example, ChatGPT-4.0 in digital counselling for cancer patients was associated with significant reductions in depression and anxiety compared with a control group.³² However, findings were not consistent across all studies. Some interventions were effects for only one outcome, such as Psy-Bot for depression but not anxiety,¹⁷ while ChatGPT in preoperative education reduced anxiety but not depression.³⁷ In addition, several studies reported improvements only within groups, with no significant differences compared with control conditions.^21,38 Overall, these findings indicate that AI-based interventions may be potential, but their effects appear to vary depending on population, intervention type, and study context.

Individual outcomes

Primary outcomes

The primary focus of this study was to measure the effects of AI interventions in reducing symptoms of mental disorders. Thus, the main outcomes were as follows: (1) reduction in depression symptoms measured using clinical scales such as the Patient Health Questionnaire-9 (PHQ-9), Centre for Epidemiologic Studies Depression Scale (CES-D), Hospital Anxiety and Depression Scale (HADS-depression), and Edinburgh Postnatal Depression Scale (EPDS), (2) reduction in anxiety symptoms measured using instruments such as the Generalised Anxiety Disorder-7 (GAD-7), HADS-Anxiety, State-Trait Anxiety Inventory (STAI), and perioperative anxiety, (3) clinically meaningful changes, namely by assessing whether AI-based testing tools can provide improvements in symptoms that are equal to or even greater than those achieved with conventional interventions.

Secondary outcomes

Secondary outcomes in this study extended beyond the use of AI in reducing symptoms of depression and anxiety, providing additional insights into improving life satisfaction and general well-being, as reported by 14.2% (2/14) of studies. A study conducted by Karkosz et al.¹⁹ revealed that the “Fido” application was not only effective in relieving anxiety symptoms but also significantly helped participants feel more satisfied with their daily lives. Other reported outcomes in this study included reduced loneliness, improved mood regulation, and enhanced social functioning, primarily among students following a short-term chatbot intervention.^17,19,34,35

RQ 2. What are the characteristics and patterns of research on AI-based interventions for anxiety and depression over the past 10 years?

A systematic literature search was conducted²⁷ using four sources, namely Web of Science, Scopus, PubMed, and EBSCO, yielding 355 articles. Deduplication was performed using Rayyan AI. 287 articles underwent title and abstract screening, of which 270 articles were excluded for not meeting the inclusion criteria, such as inappropriate study design (n = 59), irrelevant interventions (n = 124), not focusing on depression and anxiety (n = 39), retracted articles (n = 1), and review articles (n = 47). A total of 17 articles were read in full, but 3 articles with irrelevant study designs (n = 2) and high risk of bias (n = 1) did not meet the criteria. Thus, 14 studies were included in the final analysis, as illustrated in Figure 1. PRISMA Flow diagram of study selection.

Figure 1. PRISMA Flow diagram of study selection.

This figure presents the study selection process conducted in accordance with the PRISMA guidelines. It shows the number of records identified through abstract screening, full-text articles assessed, and the final paper included in this review. The flowchart provides the process of the identification, screening, and inclusion stages of the review process.

Study characteristics

This section presents the main characteristics of the studies included in the systematic review, providing an overview of the research context analysed. The presentation of these characteristics is an important component of systematic reviews, helping to understand variations in study design, population, and interventions that underlie the interpretation of results,²⁷ as well as to explain the results of individual studies, as highlighted in previous systematic reviews.³⁹ Table 3 summarises the research design, sample size, population characteristics, and sampling techniques, types of AI-based interventions, duration, and main outcomes reported.

Table 3. Study characteristics.

Citation	Author, year of publication	Study design	Sample size	Population characteristics	Sampling techniques	Intervention type	Duration	Primary outcomes
³²	Akdogan et al, 2025)	Two-Center RCT	n = 150 (75 control, 75 intervention)	Chemotherapy-naïve cancer patients. Median age: 64 years; 53.3% female	Randomized 1:1 (ChatGPT vs control)	Chat GPT 4.0	3 months	Reduction in anxiety (HADS-anxiety) and depression (HADS-depression) score
³⁸	(Chen et al., 2025)	Pilot RCT	n = 103	Parents (general population)	Block randomization	AI Chatbot	5 months	Reduction of anxiety (GAD-7) and depression (PHQ-9) levels
¹⁷	(Wang et al., 2025)	RCT	n = 100 (50 control, 50 intervention)	University students. Mean age = 20.8; 62% female	Randomized 1:1 (Intervention vs Waitlist)	AI Chatbot named “Psy-Bot”	7 days	Depression (CES-D) and loneliness (UCLA Loneliness scale) and anxiety (GAD-7)
³⁶	(Xu & Ma, 2025)	Open-label RCT	n = 84 (HSC vs LSC chatbot)	College students; aged 18–28 years; 51,2% male	SPSS random number generator	Neil, an Artificial Intelligence (AI)-driven chatbot	16 weeks	Reduction in depression (PHQ-9) and anxiety (GAD-7) scores, including WAI-SR and CSQ-8
¹⁸	(Heinz et al., 2025)	RCT	n = 210 (intervention 106, waitlist control group 104)	Mean age 33.86 years; 59,52% female; positive CHR-FED	Computer-generated sequence	Therabot, a text-based multithreaded chat	4 weeks, with follow up 8 weeks	Changes in symptoms of MDD (PHQ-9), GAD (GAD-7), and CHR-FED (WCS)
³³	(Sharp et al., 2025)	Two-arm RCT	n = 60 (intervention 30, control 30)	People on waitlists for eating disorder treatment. Age: ≥ 16 years	This multicenter 2-armed RCT	The ED ESSI chatbot	4 months and three days	Eating disorder pathology
³⁵	(Zhao et al., 2025)	RCT	n = 865 (intervention 269, control 388)	Mean age 20.59 years; 61,8% female	Simple randomization	Douyin companion bot	28 days	Depression, anxiety, positive and negative moods
³⁷	(Gan et al., 2025)	Single-blind, pilot RCT	n = 55 (intervention 27, control 28)	Patients with knee osteoarthritis. Age: 45–80 years	Single-blind, randomized controlled pilot study	ChatGPT 4.0	3 months	Perioperative anxiety and patient satisfaction
¹⁹	(Karkosz et al., 2024)	Two-arm, open-label RCT	n = 81 (intervention 40, control 41)	Participants with subclinical depression or anxiety	Two-arm, open-label RCT	Fido chatbot	2 weeks intervention and 1 month follow up	Depression (CESD-R, PHQ-9), anxiety (STAI), worry tendencies (PSWQ), satisfaction with life (SWLS), and loneliness (R-UCLA)
⁶	(Sadeh-Sharvit et al., 2023)	RCT	n = 47 total adult consented (AI group n = 23; TAU group n = 24)	Adults with depression or anxiety. Mean age = 30.64 years; 72% female	Therapist-level randomization	AI Platform (Eleos Health)	2 months	Feasibility and acceptability of AI platform, changes in depression (PHQ-9) and anxiety (GAD-7) symptoms
²⁰	(Suharwardy et al., 2023)	Single center RCT	n = 192 (intervention 96, control 96)	Postpartum women aged ≥18 years: mean age 34 years	Block randomization	Woebot (mental health chatbot)	6 weeks	Depression measured by PHQ-9 and EPDS
²¹	(Danieli et al., 2022)	RCT	n = 60 (SMT-CBT 16, SMT-CBT PHA 16, PHA 14, test only 14)	Active workers with stress and anxiety. Age ≥ 55 years: 78% female	RCT random number generator	Traditional psychotherapy CBT, AI agent, and TEO	8 weeks	Symptoms related to stress, anxiety, and depression
⁴⁰	(Klos et al., 2021)	Pilot RCT	n = 181 (82 control, 99 intervention), completers is 34 control and 39 intervention	College students. Age 18–33 years; 87,2% female	Simple randomization	Tess, an Artificial Intelligence (AI)-based chatbot	8 weeks	Preliminary data comparison of depression (PHQ-9) and anxiety (GAD-7) symptoms, focusing on viability and acceptability
³⁴	(Fulmer et al., 2018)	RCT	n = 74 (2 test n = 50, 1 control n = 24)	College students. Mean age 22.9 years; 70% female	Computer-based randomization	Tess, an Artificial Intelligence (AI)-based chatbot	group 1: 2 weeks, group 2: 4 weeks	Reduction of symptoms of depression (PHQ-9) and anxiety (GAD-7) and measured PANAS

Table 4. The effects of AI-based interventions in reducing depression and anxiety.

Citation	Author, year	Intervention	Comparator	Primary outcomes	Outcome interpretation
³²	Akdogan et al, 2025)	Chat GPT 4.0	Standard clinician-led education group	Anxiety (HADS-anxiety) and depression (HADS-depression)	Effective for both outcomes
³⁸	(Chen et al., 2025)	AI Chatbot	Nurse hotline	Anxiety (GAD-7) and depression (PHQ-9)	Significant within-group
¹⁷	(Wang et al., 2025)	AI Chatbot “Psy-Bot”	Waitlist control	Depression (CES-D) and loneliness (UCLA Loneliness scale) and anxiety (GAD-7)	Effective for depression only
³⁶	(Xu & Ma, 2025)	Neil, AI- chatbot (text + voice + animations)	LSC group (text only)	Depression (PHQ-9) and anxiety (GAD-7)	Effective for both outcomes
¹⁸	(Heinz et al., 2025)	Therabot, a text-based multithreaded chat	Waitlist	MDD (PHQ-9), GAD (GAD-7), and CHR-FED (WCS)	Effective for both outcomes
³³	(Sharp et al., 2025)	The ED ESSI chatbot	Web-based information	Eating disorder pathology (EDE-Q), Psychosocial impairment (CIA), depression, anxiety, stress (DASS-21)	Effective for both outcomes
³⁵	(Zhao et al., 2025)	Douyin companion bot	Waiting list group	Depression (PHQ-9), anxiety (GAD-7), positive and negative moods (PANAS)	Effective for both outcomes
³⁷	(Gan et al., 2025)	ChatGPT 4.0	Traditional physician explanation	Anxiety/Depression (HADS), Perioperative Apprehension Scale-7 (PAS-7), and Visual Analogue Scales for Anxiety (VAS-A, VAS-P)	Effective for anxiety only
¹⁹	(Karkosz et al., 2024)	Fido chatbot	Self-help book	Depression (CESD-R, PHQ-9), anxiety (STAI), worry tendencies (PSWQ), satisfaction with life (SWLS), and loneliness (R-UCLA)	Both groups improved; null between groups effect
⁶	(Sadeh-Sharvit et al., 2023)	AI Platform (Eleos Health)	Treatment as usual	Depression (PHQ-9) and anxiety (GAD-7) symptoms	Effective for both outcomes
²⁰	(Suharwardy et al., 2023)	Woebot (mental health chatbot)	Usual postpartum care	Depression measured by PHQ-9 and EPDS	Effective for depression only
²¹	(Danieli et al., 2022)	AI agent and TEO	Traditional therapy	Stress, anxiety, and depression	Null between-group; some within-group improvements
⁴⁰	(Klos et al., 2021)	Tess, (AI)-based chatbot	Psychoeducation book	Depression (PHQ-9) and anxiety (GAD-7)	Null between-group; anxiety decreased within group
³⁴	(Fulmer et al., 2018)	Tess, (AI)-based chatbot	The information-only	Depression (PHQ-9), anxiety (GAD-7), and PANAS	Effective for both outcomes

Research design

The study designs of the 14 articles were predominantly randomised controlled trials (RCTs) (n = 14), encompassing variations such as two-centre RCTs,³² single-centre RCTs,²⁰ and two-arm RCTs.^19,33 Some studies were designed as quasi-RCT or pilot RCT designs,^37,38,40 while other studies used RCTs.^{6,17,18,21,34,35}

Geographic distribution of studies

Fourteen articles published between 2015 and 2025 consistently examined the effects of AI-based interventions in reducing depression and anxiety. The distribution of publications across years was as follows: 2018 (7%), 2021 (7%), 2022 (7%), 2023 (15%), 2024 (7%), and the majority in 2025 (57%). Four studies were conducted in the United States^6,18,20,34 and one study was conducted in Argentina.⁴⁰ Studies in Europe were conducted in Poland¹⁹ and Italy.²¹ Studies in Asia were conducted in Turkey,³² Hong Kong,³⁸ and China.^17,35–37 Oceania was represented by one study in Australia.³³ Figure 2(A) presents the distribution of publication years based on 14 selected journals from 2015 to 2025. Meanwhile, Figure 2(B) illustrates the geographical distribution of studies on AI-based interventions for depression and anxiety between 2015 and 2025.

Figure 2. (A) Distribution of studies by publication year.

This figure illustrates the distribution of the 14 studies included in the review according to their year of publication between 2015–2025. (B). Geographical distribution of studies. This figure shows the countries in which the included studies on AI-based interventions for depression and anxiety were conducted.

Sample size and demographics

The sample sizes across the 14 studies ranged from small to moderate. Moderate-sized samples included more than 500 participants (n = 865),³⁵ while other studies involved fewer than 500 participants. Gender distribution varied across studies, with most studies stated that females were the dominant population. The age range spanned from adolescents and young adults (university students) to adults and the elderly. The populations included were heterogeneous, such as students, patients with specific medical conditions (e.g., cancer, knee osteoarthritis, postpartum mothers), individuals with specific psychological problems (eating disorders, depression and anxiety, and work-related stress), and general populations such as parents, adults, and workers.

AI intervention

These digital interventions take various forms and are designed to address the limitations of traditional mental health services. The identified digital interventions include: (1) chatbots and conversational agents, which are the most common forms, including text-based applications such as AI Chatbot, Tess, Woebot, Psy-Bot, Fido, Therabot, and ED ESSI^{17–20,33,34,36,38,40}; (2) large language models (LLM), which are technologies such as ChatGPT (version 4.0) which are used as digital counselling agents or companions to provide medical information and emotional support^32,35,37; (3) integrated AI platforms, such as the Eleos Health system, which supports conventional therapy by monitoring patient progress and improving therapist efficiency⁶; and⁴ passive behaviour monitoring systems, which detect early signs of psychological stress through passive digital behaviour tracking.²¹ The visualisation of interventions from the selected articles is presented in Figure 3.

Figure 3. Type of AI-based interventions.

This figure illustrates the types of AI-based interventions identified in the 14 studies included on the review.

Duration

The duration of AI-based tool use in the review was categorised into three time frames: (1) short term (7 days to 4 weeks), in which interventions were designed to provide rapid emotional support or triage; for example, Psy-Bot was used for 7 days, Tess for 3–4 weeks, and LLM-based chatbots for 28 days. Interventions using Socratic questioning and Therabot were also conducted for a duration of 2–4 weeks. (2) Medium term (6 weeks to 3 months), which is typically used to assess more stable clinical effects; for example, Woebot was used for 6 weeks, Tess and the TEO platform for 8 weeks, and the Eleos Health platform for 2 months. The use of ChatGPT 4.0 in a medical context (e.g., cancer and orthopaedic patients) was generally used for 3 months. (3) Long-term (more than 4 months) which involves more complex or monitoring-based with longer durations; for example, the Neil chatbot (16 weeks), and the ED ESSI chatbot (over 4 months). The duration of interventions from the 14 selected articles is visualised in Figure 4.

Figure 4. AI Intervention duration.

This figure shows the duration of AI-based interventions reported in the 14 studies included in the review.

Risk of bias

Figure 5(B) summarises the risk-of-bias assessment for the 14 trials included in this review. Overall, 9 studies were classified in the “some concerns” category (64.3%), 3 studies were assessed as low risk (21.4%), and 2 studies (14.3%) were judged to have a high risk of bias. These findings indicate that although the available evidence is generally potential, several studies still present methodological limitations that should be interpreted with caution.

Figure 5. (A). Summary of risk of bias assessments for individual studies.

This figure summarizes the risk of bias assessments for each included study across the evaluated domains. (B). Distribution of risk of bias across studies. This figure illustrates the proportion of studies rated as low risk, some concerns, or high risk across each risk of bias domain and overall risk of bias.

Across domains, the most notable limitation was bias arising from deviations from intended interventions (D2), which was the only domain contributing to the high-risk ratings in this review. In contrast, bias due to missing outcome data (D3) was less problematic, with most studies classified as low risk in this domain. For the remaining domains—bias arising from the randomisation process (D1), bias in measurement of the outcome (D4), and bias in selection of the reported result (D5)—the most common judgement was “some concerns”, generally reflecting incomplete or unclear reporting of methodological procedures rather than clear evidence of serious bias. The detailed distribution of risk-of-bias judgements is presented in Figures 5(A) and 5(B).

RQ 3. What are the implementation issues of AI-based interventions for depression and anxiety?

Most studies employed passive control conditions (e.g., usual care or waitlist), which may limit the ability to control for placebo effects. Only a small number of studies used active control groups (e.g., psychoeducation, books, or nurse hotlines),^19,38,40 which may reduce inferential strength. Moreover, only one study involved a therapist in a face-to-face setting when delivering AI-based CBT,⁶ while another study involved direct responses from a physician.³⁷ Human support, however, appears to play an important role in influencing adherence and intervention effects.

Digital and social media–based recruitment methods tend to attract self-selected, technologically literate populations. As a result, there is a risk of selection bias, whereby the findings may not fully represent populations with lower digital literacy or those with limited access to devices due to economic constraints. In addition, study samples are often drawn from a single population segment. For example, postpartum studies may include only women without severe depression, while eating disorder studies may recruit only adolescents on waiting lists, thereby limiting generalisability. Furthermore, follow-up periods are relatively short, typically ranging from 2–8 weeks, making it difficult to assess long-term effects.

Heinz et al.¹⁸ highlighted another issue related to engagement, characterised by a decline in user participation over time (i.e., low retention). This pattern is often attributed to a “novelty effect”, which diminishes after the initial sessions. In addition, Sharp et al.³³ indicated challenges in integrating chatbots into standard clinical care systems, particularly for patients on waiting lists. Other findings suggest that the transition from rule-based chatbots to those based on large language models (LLMs), such as ChatGPT, introduces new challenges related to personalisation and safety. Although generative AI enables more natural interactions, concerns regarding data privacy and the potential for medical “hallucinations” remain prominent in implementation within formal healthcare settings.³²

4. Discussion

This review suggests that AI-based interventions have the potential to reduce symptoms of depression and anxiety in both general and clinical populations. Several studies reported short-term symptom improvement, indicating that AI may be considered a supportive tool in mental health services.^6,18,34,38 However, these findings should be interpreted with caution due to the limited number of studies and the substantial heterogeneity observed. While some interventions demonstrated greater symptom reduction compared to standard care, others reported non-significant results. For example, text-based interventions did not consistently provide additional benefits compared to established self-help approaches.¹⁹

Design factors play an important role in determining intervention effects. Approaches that incorporate richer social cues, such as voice or visual elements, tend to produce better outcomes than purely text-based approaches.^35,38 The sustainability of intervention effects also warrants attention. Several studies have reported a decline in effects during follow-up periods, particularly in the absence of human support.¹⁷ In addition, technical limitations, such as repetitive responses and failures in user intent recognition, may disrupt the therapeutic alliance and reduce user adherence.¹⁹

This review has several limitations that should be considered. The relatively small number of included studies, combined with substantial heterogeneity in intervention types, study designs, and outcome measures, limits the generalisability of the findings. In particular, the reviewed studies encompassed a wide range of AI approaches, including rule-based chatbots such as Tess, large language models (LLMs) such as ChatGPT-4.0 and Therabot, AI platforms that support clinical practice such as Eleos Health, and passive behavioural monitoring systems that detect or estimate psychological conditions through digital behavioural data. These differences suggest that each technology operates through distinct mechanisms, varies in its level of autonomy, and is applied to different clinical purposes. Accordingly, the core components contributing to intervention effects may not be consistent across studies. As a result, differences in effect outcomes between studies may not solely reflect whether an intervention is effective, but also the diversity of technologies being evaluated. Therefore, this review is more appropriately understood as an examination of diverse forms of AI-based mental health interventions, rather than an evaluation of a single uniform AI model.

Despite the effect of AI-based interventions in reducing symptoms of depression and anxiety, their real-world implementation remains constrained by several practical challenges. User engagement is a recurring concern, as many interventions demonstrate strong short-term outcomes but declining adherence over time, suggesting a novelty effect and limited sustained interaction. In addition, technological limitations, particularly in simpler chatbot systems, such as repetitive responses, limited contextual understanding, and failures in intent recognition, may weaken user trust and reduce the quality of the therapeutic experience. These issues highlight that effects observed under controlled conditions does not always translate directly into consistent real-world use. Another limitation of the statistical findings in most of the studies was the absence of reported confidence intervals, which limited the accuracy of interpretations regarding effect sizes. In addition, recruitment procedures lacked standardisation, allowing for the possibility of confounding variables that may have influenced the findings. Although all 14 studies were published in Scopus Q1–Q2 indexed journals, their findings should still be interpreted with caution due to methodological limitations, as the majority were rated as having “some concerns” regarding risk of bias. Additionally, two of the fourteen studies were classified as having a high risk of bias. Therefore, future research is recommended to employ more rigorous selection and randomisation procedures in order to strengthen the findings.

Beyond technical and behavioral factors, implementation is further shaped by clinical, ethical, and contextual constraints. The integration of AI into existing healthcare workflows remains limited, with unclear role definitions between AI systems and human practitioners, often positioning AI as a supplementary rather than a fully embedded tool. At the same time, concerns related to data privacy, clinical safety, and accountability persist, particularly in high-risk situations where AI may not adequately respond to severe psychological distress. Furthermore, the predominance of studies conducted in digitally literate populations raises questions about generalisability across broader and more diverse contexts. These findings suggest that successful implementation depends not only on technological capability but also on sustained engagement design, ethical safeguards, and alignment with clinical practice.

5. Strengths, limitations, and recommendations for future studies on AI-Based interventions for anxiety and depression

5.1 Strengths of the research in this paper review

This systematic review has several clear strengths. First, its compilation follows PRISMA guidelines, making the review more transparent, organised, and easier to trace. Second, this review deliberately focuses only on RCTs, yielding higher-quality evidence than when study designs are mixed. Third, the review focuses not only on mental health in general, but specifically examines AI interventions that target two outcomes simultaneously: depression and anxiety. Fourth, the scope of AI interventions is also quite broad, ranging from chatbots and large language models to integrated AI platforms, machine learning-based prediction systems, and passive behaviour monitoring. This prevents the interpretation of results from being too “narrow” to a single technology type. Finally, this review does not stop at summarising the results; it also includes a formal assessment of study quality using the Cochrane Risk of Bias 2 (RoB 2) tool. By assessing potential bias across five domains, the reported results are more “fair” to read, making it easier to identify which findings are strong and which require more careful interpretation. This summary of strengths is then clarified through the visualisation in Figure 6.

Figure 6. Research strengths.

This figure presents the key strengths identified across the 14 studies included in the review.

5.2 Study limitations

Although this review has several strengths, it also has several limitations that researchers need to acknowledge. First, the included studies show considerable heterogeneity in the types of AI interventions, exposure durations, outcome measurement instruments, and participant characteristics. Second, although all studies used an RCT design, implementation quality was not always consistent. The risk of bias assessment results show that most trials remain in the ‘some concerns’ category, and a small number are even at high risk. Third, the scope for generalising the findings also appears to be limited. The majority of trials were conducted in middle- and high-income countries, with participants who tended to be younger, mostly female, and well-versed in digital literacy. Fourth, most studies had relatively short to medium follow-up periods, so the sustainability of the effects has not been fully addressed. Finally, this review included only English-language publications, so there may be a language bias, and some relevant studies in other languages may not have been accessible. The study’s limitations will be illustrated in Figure 7.

Figure 7. Limitation of the research.

This figure summarizes the key limitations identified across the 14 studies reviewed.

5.3 Recommendations for future research

Several recommendations were made in the studies included in this review. First, future research should involve larger samples and more diverse populations. Second, the effects of conventional therapy should be compared with that of technology-based, face-to-face complementary therapies, such as integrating virtual reality, teletherapy, website-based therapy, and other AI interventions. Third, longer intervention durations are accompanied by follow-up sessions to assess the intervention’s long-term effects. Fourth, attention to participant safety and to effects testing procedures conducted in accordance with strict protocols. Fifth, providing interventions for higher-risk and more severe clinical disorders. Sixth, exploring the sustainable impact of the interventions provided. Finally, increasing human involvement with AI to enhance treatment impact, user satisfaction, and intervention usefulness. Recommendations from the 14 articles included in the study are explained in Figure 8.

Figure 8. Future Research.

This figure explains the recommendations for future research based on the evidence identified in the 14 included studies.

6. Conclusion

AI-based interventions show potential effects for reducing symptoms of depression and anxiety; however the current evidence remains preliminary and heterogeneous. The reviewed studies varied substantially in intervention type, study design, population, and implementation context, and several raising concerns regarding risk of bias. Accordingly, the findings should not be interpreted as evidence that AI is broadly superior to standard care. Rather, AI appears to be a potentially useful supportive approach, with effects dependent on context, therapeutic design, and implementation quality. Future studies should employ more rigorous and standardised methods, include more diverse populations, and report long-term, safety, and implementation outcomes more clearly.

Data availability

All data and materials supporting the findings of this systematic review, including the PRISMA flow diagram, PRISMA checklist, and extracted data, are openly available in Open Science Framework (DOI: https://doi.org/10.17605/OSF.IO/7FAUP)⁴¹ under a CC-By Attribution 4.0 license.

Acknowledgements

The author would like to express their gratitude to the Indonesia Endowment Fund for Education (LPDP) and the Ministry of Finance, Republic of Indonesia, for funding these master’s and doctoral studies. The author also sincerely thanks Adhan Efendi, M.Pd., for his valuable insights and constructive feedback during the preparation of this manuscript.

References

1. Joshi AC, Ghogare AS, Madavi PB: Systematic review of artificial intelligence enabled psychological interventions for depression and anxiety: A comprehensive analysis. Ind. Psychiatry J. 2025 May; 34(2): 158–166. PubMed Abstract | Publisher Full Text | Free Full Text
2. Bhattacharya R, Shen C, Sambamoorthi U: Excess risk of chronic physical conditions associated with depression and anxiety. BMC Psychiatry. 2014 Dec; 14(1): 10. PubMed Abstract | Publisher Full Text | Free Full Text
3. Karustis JL, Power TJ, Rescorla LA, et al.: Anxiety and depression in children with ADHD: Unique associations with academic and social functioning. J. Atten. Disord. 2000 Nov 1; 4(3): 133–149. Publisher Full Text
4. Armbrecht E, Shah A, Schepman P, et al.: Economic and humanistic burden associated with noncommunicable diseases among adults with depression and anxiety in the United States. J. Med. Econ. 2020 Sep 1; 23(9): 1032–1042. PubMed Abstract | Publisher Full Text
5. Rollwage M, Habicht J, Juechems K, et al.: Using Conversational AI to Facilitate Mental Health Assessments and Improve Clinical Efficiency Within Psychotherapy Services: Real-World Observational Study. JMIR AI. 2023 Dec 13; 2: e44358. Publisher Full Text
6. Sadeh-Sharvit S, Camp TD, Horton SE, et al.: Effects of an Artificial Intelligence Platform for Behavioral Interventions on Depression and Anxiety Symptoms: Randomized Clinical Trial. J. Med. Internet Res. 2023 Jul 10; 25: e46781. PubMed Abstract | Publisher Full Text | Free Full Text
7. Saha S, Lim CCW, Cannon DL, et al.: Co-morbidity between mood and anxiety disorders: A systematic review and meta-analysis. Depress. Anxiety. 2021 Mar; 38(3): 286–306. PubMed Abstract | Publisher Full Text | Free Full Text
8. Terlizzi E, Zablotsky, Benjamin B: Symptoms of Anxiety and Depression Among Adults: United States, 2019 and 2022. National Center for Health Statistics (U.S.); 2024 Nov [cited 2026 Jan 23]. Publisher Full Text Reference Source
9. Remes O, Mendes JF, Templeton P: Biological, Psychological, and Social Determinants of Depression: A Review of Recent Literature. Brain Sci. 2021 Dec 10; 11(12): 1633. PubMed Abstract | Publisher Full Text | Free Full Text
10. Arango-Dávila CA, Rincón-Hoyos HG: Depressive disorder, anxiety disorder and chronic pain: Multiple manifestations of a common clinical and pathophysiological core. Revista Colombiana de Psiquiatría (English ed). 2018 Jan; 47(1): 46–55. PubMed Abstract | Publisher Full Text
11. Lkhagvasuren B, Hiramoto T, Bat-Erdene E, et al.: Anxiety, depression, and brain overwork in the general population of Mongolia. Sci. Rep. 2024 Jan 30; 14(1): 2484. PubMed Abstract | Publisher Full Text | Free Full Text
12. Ebert DD, Mortier P, Kaehlke F, et al.: Barriers of mental health treatment utilization among first-year college students: First cross-national results from the WHO World Mental Health International College Student Initiative.2019; 28(2): e1782. PubMed Abstract | Publisher Full Text | Free Full Text
13. Nyakhar S, Wang H: Effectiveness of artificial intelligence chatbots on mental health & well-being in college students: a rapid systematic review. Front. Psych. 2025 Oct 21; 16: 1621768. PubMed Abstract | Publisher Full Text | Free Full Text
14. Palmer CE, Marshall E, Millgate E, et al.: Combining Artificial Intelligence and Human Support in Mental Health: Digital Intervention With Comparable Effectiveness to Human-Delivered Care. J. Med. Internet Res. 2025 May 13; 27: e69351. PubMed Abstract | Publisher Full Text | Free Full Text
15. Lau Y, Ang WHD, Ang WW, et al.: Artificial Intelligence–Based Psychotherapeutic Intervention on Psychological Outcomes: A Meta-Analysis and Meta-Regression. Lamela D, editor. Depress. Anxiety. 2025 Jan; 2025(1): 8930012. Publisher Full Text
16. Li H, Zhang R, Lee YC, et al.: Systematic review and meta-analysis of AI-based conversational agents for promoting mental health and well-being. npj Digit Med. 2023 Dec 19; 6(1): 236. PubMed Abstract | Publisher Full Text | Free Full Text
17. Wang Y, Li X, Zhang Q, et al.: Effect of a Cognitive Behavioral Therapy–Based AI Chatbot on Depression and Loneliness in Chinese University Students: Randomized Controlled Trial With Financial Stress Moderation. JMIR Mhealth Uhealth. 2025 Aug 29; 13: e63806–e63806. PubMed Abstract | Publisher Full Text | Free Full Text
18. Heinz MV, Mackin DM, Trudeau BM, et al.: Randomized Trial of a Generative AI Chatbot for Mental Health Treatment. NEJM AI. 2025 Mar 27; 2(4). Publisher Full Text
19. Karkosz S, Szymański R, Sanna K, et al.: Effectiveness of a Web-based and Mobile Therapy Chatbot on Anxiety and Depressive Symptoms in Subclinical Young Adults: Randomized Controlled Trial. JMIR Form Res. 2024 Mar 20; 8: e47960. PubMed Abstract | Publisher Full Text | Free Full Text
20. Suharwardy S, Ramachandran M, Leonard SA, et al.: Feasibility and impact of a mental health chatbot on postpartum mental health: a randomized controlled trial. AJOG Global Reports. 2023 Aug; 3(3): 100165. PubMed Abstract | Publisher Full Text | Free Full Text
21. Danieli M, Ciulli T, Mousavi SM, et al.: Assessing the Impact of Conversational Artificial Intelligence in the Treatment of Stress and Anxiety in Aging Adults: Randomized Controlled Trial. JMIR Ment Health. 2022 Sep 23; 9(9): e38067. PubMed Abstract | Publisher Full Text | Free Full Text
22. He Y, Yang L, Zhu X, et al.: Mental Health Chatbot for Young Adults With Depressive Symptoms During the COVID-19 Pandemic: Single-Blind, Three-Arm Randomized Controlled Trial. J. Med. Internet Res. 2022 Nov 21; 24(11): e40719. PubMed Abstract | Publisher Full Text | Free Full Text
23. Villarreal-Zegarra D, Reategui-Rivera CM, García-Serna J, et al.: Self-Administered Interventions Based on Natural Language Processing Models for Reducing Depressive and Anxiety Symptoms: Systematic Review and Meta-Analysis. JMIR Ment Health. 2024 Aug 21; 11: e59560. PubMed Abstract | Publisher Full Text | Free Full Text
24. Amjad A, Kordel P, Fernandes G: A Review on Innovation in Healthcare Sector (Telehealth) through Artificial Intelligence.2023; 15(8): 6655. Publisher Full Text
25. Feng Y, Hang Y, Wu W, et al.: Effectiveness of AI-Driven Conversational Agents in Improving Mental Health Among Young People: Systematic Review and Meta-Analysis. J. Med. Internet Res. 2025 May 14; 27: e69639. PubMed Abstract | Publisher Full Text | Free Full Text
26. Pavlopoulos A, Rachiotis T, Maglogiannis I: An Overview of Tools and Technologies for Anxiety and Depression Management Using AI. Appl. Sci. 2024 Oct 8; 14(19): 9068. Publisher Full Text
27. Page MJ, McKenzie JE, Bossuyt PM, et al.: The PRISMA 2020 statement: an updated guideline for reporting systematic reviews.2021; 372. PubMed Abstract | Publisher Full Text | Free Full Text
28. Higgins JPT, Thomas J, Chandler J, et al.: Cochrane handbook for systematic reviews of interventions. Version 6.5, 2024. London, United Kingdom: Cochrane; 2024.
29. Popay J, Roberts H, Sowden A, et al.: Guidance on the conduct of narrative synthesis in systematic reviews: A product from the ESRC Methods Programme.2006. Publisher Full Text
30. Nejadghaderi SA, Balibegloo M, Rezaei N: The Cochrane risk of bias assessment tool 2 (RoB 2) versus the original RoB: A perspective on the pros and cons. Health Science Reports. 2024 Jun; 7(6): e2165. PubMed Abstract | Publisher Full Text | Free Full Text
31. Smith KW, Freeman NLB, Bir A: Assessing risk of bias in the meta-analysis of round 1 of the Health Care Innovation Awards. Syst. Rev. 2024 Jan 22; 13(1): 36. PubMed Abstract | Publisher Full Text | Free Full Text
32. Akdogan O, Uyar GC, Yesilbas E, et al.: Effect of a ChatGPT-based digital counseling intervention on anxiety and depression in patients with cancer: A prospective, randomized trial. Eur. J. Cancer. 2025 May; 221: 115408. PubMed Abstract | Publisher Full Text
33. Sharp G, Dwyer B, Randhawa A, et al.: The Effectiveness of a Chatbot Single-Session Intervention for People on Waitlists for Eating Disorder Treatment: Randomized Controlled Trial. J. Med. Internet Res. 2025 May 21; 27: e70874. Publisher Full Text
34. Fulmer R, Joerin A, Gentile B, et al.: Using Psychological Artificial Intelligence (Tess) to Relieve Symptoms of Depression and Anxiety: Randomized Controlled Trial. JMIR Ment Health. 2018 Dec 13; 5(4): e64. PubMed Abstract | Publisher Full Text | Free Full Text
35. Zhao Y, Qian W, Chen Y, et al.: Effect of an AI agent trained on a large language model (LLM) as an intervention for depression and anxiety symptoms in young adults: A 28-day randomized controlled trial. Applied Psych Health & Well. 2025 Oct; 17(5): e70067. Publisher Full Text
36. Xu S, Ma T: Depression intervention using AI chatbots with social cues: a randomized trial of effectiveness. J. Affect. Disord. 2025 Nov; 389: 119760. PubMed Abstract | Publisher Full Text
37. Gan W, Ouyang J, She G, et al.: ChatGPT’s role in alleviating anxiety in total knee arthroplasty consent process: a randomized controlled trial pilot study. Int. J. Surg. 2025 Mar; 111(3): 2546–2557. PubMed Abstract | Publisher Full Text | Free Full Text
38. Chen C, Lam KT, Yip KM, et al.: Comparison of an AI Chatbot With a Nurse Hotline in Reducing Anxiety and Depression Levels in the General Population: Pilot Randomized Controlled Trial. JMIR Hum. Factors. 2025 Mar 6; 12: e65785–e65785. PubMed Abstract | Publisher Full Text | Free Full Text
39. Ilmaskal R, Prabandari YS, Oktaria V, et al.: Impact of Digital-based Intervention on Smoking Abstinence and Cessation among Adolescents: A Systematic Review. Asian Journal of Social Health and Behavior. 2026 Jan; 9(1): 1–14. Publisher Full Text
40. Klos MC, Escoredo M, Joerin A, et al.: Artificial Intelligence–Based Chatbot for Anxiety and Depression in University Students: Pilot Randomized Controlled Trial. JMIR Form Res. 2021 Aug 12; 5(8): e20678. PubMed Abstract | Publisher Full Text | Free Full Text
41. Puspitarani NMAP, Devi IGAAIRP, Ngey MWNT, et al.: Data and supplementary materials for: The Effects of Artificial Intelligence-Based Interventions on Depression and Anxiety: A Systematic Review.2026. Publisher Full Text

Comments on this article Comments (0)

Version 1

VERSION 1 PUBLISHED 18 Jun 2026

Author details Author details

¹ Department of Psychology,Faculty of Psychology, Universitas Airlangga, Surabaya, East Java, 60286, Indonesia
² Faculty of Business and Economics, Monash University, Tangerang, Indonesia, 15339, Indonesia

Ni Made Adinda Putri Puspitarani
Roles: Conceptualization, Data Curation, Formal Analysis, Investigation, Methodology, Supervision, Validation, Visualization, Writing – Original Draft Preparation

I Gusti Ayu Agung Istri Risna Prajna Devi
Roles: Investigation, Methodology, Validation, Writing – Original Draft Preparation

Modesta Windyarti Natalia Toa Ngey
Roles: Investigation, Validation, Writing – Original Draft Preparation

Kevin Putri Novera Tanaem
Roles: Investigation, Validation, Writing – Original Draft Preparation

Radita Sonixtus Arauna
Roles: Formal Analysis, Investigation, Validation, Writing – Original Draft Preparation

Wiwin Hendriani
Roles: Writing – Review & Editing

Competing interests

No competing interests were disclosed.

Grant information

The research was funded by the Indonesia Endowment Fund for Education (LPDP), Ministry of Finance, Republic of Indonesia.
The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.

Article Versions (1)

version 1

Published: 18 Jun 2026, 15:964

https://doi.org/10.12688/f1000research.181969.1

Copyright

© 2026 Puspitarani NMAP et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Download

Export To

metrics

	Views	Downloads
F1000Research	-	-
PubMed Central Data from PMC are received and updated monthly.	-	-

Citations

0

SEE MORE DETAILS

CITE

how to cite this article

Puspitarani NMAP, Devi IGAAIRP, Ngey MWNT et al. The Effects of Artificial Intelligence-Based Interventions on Depression and Anxiety: A Systematic Review [version 1; peer review: awaiting peer review]. F1000Research 2026, 15:964 (https://doi.org/10.12688/f1000research.181969.1)

NOTE: If applicable, it is important to ensure the information in square brackets after the title is included in all citations of this article.

track

receive updates on this article

Track an article to receive email alerts on any updates to this article.

Open Peer Review

Comments on this article Comments (0)

Version 1

VERSION 1 PUBLISHED 18 Jun 2026

Open Peer Review

Reviewer Status

AWAITING PEER REVIEW

Comments on this article

All Comments(0)

Add a comment

Sign up for content alerts

Browse by related subjects

[1] 1. Joshi AC, Ghogare AS, Madavi PB: Systematic review of artificial intelligence enabled psychological interventions for depression and anxiety: A comprehensive analysis. Ind. Psychiatry J. 2025 May; 34(2): 158–166. PubMed Abstract | Publisher Full Text | Free Full Text

[2] 2. Bhattacharya R, Shen C, Sambamoorthi U: Excess risk of chronic physical conditions associated with depression and anxiety. BMC Psychiatry. 2014 Dec; 14(1): 10. PubMed Abstract | Publisher Full Text | Free Full Text

[3] 3. Karustis JL, Power TJ, Rescorla LA, et al.: Anxiety and depression in children with ADHD: Unique associations with academic and social functioning. J. Atten. Disord. 2000 Nov 1; 4(3): 133–149. Publisher Full Text

[4] 4. Armbrecht E, Shah A, Schepman P, et al.: Economic and humanistic burden associated with noncommunicable diseases among adults with depression and anxiety in the United States. J. Med. Econ. 2020 Sep 1; 23(9): 1032–1042. PubMed Abstract | Publisher Full Text

[5] 5. Rollwage M, Habicht J, Juechems K, et al.: Using Conversational AI to Facilitate Mental Health Assessments and Improve Clinical Efficiency Within Psychotherapy Services: Real-World Observational Study. JMIR AI. 2023 Dec 13; 2: e44358. Publisher Full Text

[6] 6. Sadeh-Sharvit S, Camp TD, Horton SE, et al.: Effects of an Artificial Intelligence Platform for Behavioral Interventions on Depression and Anxiety Symptoms: Randomized Clinical Trial. J. Med. Internet Res. 2023 Jul 10; 25: e46781. PubMed Abstract | Publisher Full Text | Free Full Text

[7] 7. Saha S, Lim CCW, Cannon DL, et al.: Co-morbidity between mood and anxiety disorders: A systematic review and meta-analysis. Depress. Anxiety. 2021 Mar; 38(3): 286–306. PubMed Abstract | Publisher Full Text | Free Full Text

[8] 8. Terlizzi E, Zablotsky, Benjamin B: Symptoms of Anxiety and Depression Among Adults: United States, 2019 and 2022. National Center for Health Statistics (U.S.); 2024 Nov [cited 2026 Jan 23]. Publisher Full Text Reference Source

[9] 9. Remes O, Mendes JF, Templeton P: Biological, Psychological, and Social Determinants of Depression: A Review of Recent Literature. Brain Sci. 2021 Dec 10; 11(12): 1633. PubMed Abstract | Publisher Full Text | Free Full Text

[10] 10. Arango-Dávila CA, Rincón-Hoyos HG: Depressive disorder, anxiety disorder and chronic pain: Multiple manifestations of a common clinical and pathophysiological core. Revista Colombiana de Psiquiatría (English ed). 2018 Jan; 47(1): 46–55. PubMed Abstract | Publisher Full Text

[11] 11. Lkhagvasuren B, Hiramoto T, Bat-Erdene E, et al.: Anxiety, depression, and brain overwork in the general population of Mongolia. Sci. Rep. 2024 Jan 30; 14(1): 2484. PubMed Abstract | Publisher Full Text | Free Full Text

[12] 12. Ebert DD, Mortier P, Kaehlke F, et al.: Barriers of mental health treatment utilization among first-year college students: First cross-national results from the WHO World Mental Health International College Student Initiative.2019; 28(2): e1782. PubMed Abstract | Publisher Full Text | Free Full Text

[13] 13. Nyakhar S, Wang H: Effectiveness of artificial intelligence chatbots on mental health & well-being in college students: a rapid systematic review. Front. Psych. 2025 Oct 21; 16: 1621768. PubMed Abstract | Publisher Full Text | Free Full Text

[14] 14. Palmer CE, Marshall E, Millgate E, et al.: Combining Artificial Intelligence and Human Support in Mental Health: Digital Intervention With Comparable Effectiveness to Human-Delivered Care. J. Med. Internet Res. 2025 May 13; 27: e69351. PubMed Abstract | Publisher Full Text | Free Full Text

[15] 15. Lau Y, Ang WHD, Ang WW, et al.: Artificial Intelligence–Based Psychotherapeutic Intervention on Psychological Outcomes: A Meta-Analysis and Meta-Regression. Lamela D, editor. Depress. Anxiety. 2025 Jan; 2025(1): 8930012. Publisher Full Text

[16] 16. Li H, Zhang R, Lee YC, et al.: Systematic review and meta-analysis of AI-based conversational agents for promoting mental health and well-being. npj Digit Med. 2023 Dec 19; 6(1): 236. PubMed Abstract | Publisher Full Text | Free Full Text

[17] 17. Wang Y, Li X, Zhang Q, et al.: Effect of a Cognitive Behavioral Therapy–Based AI Chatbot on Depression and Loneliness in Chinese University Students: Randomized Controlled Trial With Financial Stress Moderation. JMIR Mhealth Uhealth. 2025 Aug 29; 13: e63806–e63806. PubMed Abstract | Publisher Full Text | Free Full Text

[18] 18. Heinz MV, Mackin DM, Trudeau BM, et al.: Randomized Trial of a Generative AI Chatbot for Mental Health Treatment. NEJM AI. 2025 Mar 27; 2(4). Publisher Full Text

[19] 19. Karkosz S, Szymański R, Sanna K, et al.: Effectiveness of a Web-based and Mobile Therapy Chatbot on Anxiety and Depressive Symptoms in Subclinical Young Adults: Randomized Controlled Trial. JMIR Form Res. 2024 Mar 20; 8: e47960. PubMed Abstract | Publisher Full Text | Free Full Text

[20] 20. Suharwardy S, Ramachandran M, Leonard SA, et al.: Feasibility and impact of a mental health chatbot on postpartum mental health: a randomized controlled trial. AJOG Global Reports. 2023 Aug; 3(3): 100165. PubMed Abstract | Publisher Full Text | Free Full Text

[21] 21. Danieli M, Ciulli T, Mousavi SM, et al.: Assessing the Impact of Conversational Artificial Intelligence in the Treatment of Stress and Anxiety in Aging Adults: Randomized Controlled Trial. JMIR Ment Health. 2022 Sep 23; 9(9): e38067. PubMed Abstract | Publisher Full Text | Free Full Text

[22] 22. He Y, Yang L, Zhu X, et al.: Mental Health Chatbot for Young Adults With Depressive Symptoms During the COVID-19 Pandemic: Single-Blind, Three-Arm Randomized Controlled Trial. J. Med. Internet Res. 2022 Nov 21; 24(11): e40719. PubMed Abstract | Publisher Full Text | Free Full Text

[23] 23. Villarreal-Zegarra D, Reategui-Rivera CM, García-Serna J, et al.: Self-Administered Interventions Based on Natural Language Processing Models for Reducing Depressive and Anxiety Symptoms: Systematic Review and Meta-Analysis. JMIR Ment Health. 2024 Aug 21; 11: e59560. PubMed Abstract | Publisher Full Text | Free Full Text

[24] 24. Amjad A, Kordel P, Fernandes G: A Review on Innovation in Healthcare Sector (Telehealth) through Artificial Intelligence.2023; 15(8): 6655. Publisher Full Text

[25] 25. Feng Y, Hang Y, Wu W, et al.: Effectiveness of AI-Driven Conversational Agents in Improving Mental Health Among Young People: Systematic Review and Meta-Analysis. J. Med. Internet Res. 2025 May 14; 27: e69639. PubMed Abstract | Publisher Full Text | Free Full Text

[26] 26. Pavlopoulos A, Rachiotis T, Maglogiannis I: An Overview of Tools and Technologies for Anxiety and Depression Management Using AI. Appl. Sci. 2024 Oct 8; 14(19): 9068. Publisher Full Text

[27] 27. Page MJ, McKenzie JE, Bossuyt PM, et al.: The PRISMA 2020 statement: an updated guideline for reporting systematic reviews.2021; 372. PubMed Abstract | Publisher Full Text | Free Full Text

[28] 28. Higgins JPT, Thomas J, Chandler J, et al.: Cochrane handbook for systematic reviews of interventions. Version 6.5, 2024. London, United Kingdom: Cochrane; 2024.

[29] 29. Popay J, Roberts H, Sowden A, et al.: Guidance on the conduct of narrative synthesis in systematic reviews: A product from the ESRC Methods Programme.2006. Publisher Full Text

[30] 30. Nejadghaderi SA, Balibegloo M, Rezaei N: The Cochrane risk of bias assessment tool 2 (RoB 2) versus the original RoB: A perspective on the pros and cons. Health Science Reports. 2024 Jun; 7(6): e2165. PubMed Abstract | Publisher Full Text | Free Full Text

[31] 31. Smith KW, Freeman NLB, Bir A: Assessing risk of bias in the meta-analysis of round 1 of the Health Care Innovation Awards. Syst. Rev. 2024 Jan 22; 13(1): 36. PubMed Abstract | Publisher Full Text | Free Full Text

[32] 32. Akdogan O, Uyar GC, Yesilbas E, et al.: Effect of a ChatGPT-based digital counseling intervention on anxiety and depression in patients with cancer: A prospective, randomized trial. Eur. J. Cancer. 2025 May; 221: 115408. PubMed Abstract | Publisher Full Text

[33] 33. Sharp G, Dwyer B, Randhawa A, et al.: The Effectiveness of a Chatbot Single-Session Intervention for People on Waitlists for Eating Disorder Treatment: Randomized Controlled Trial. J. Med. Internet Res. 2025 May 21; 27: e70874. Publisher Full Text

[34] 34. Fulmer R, Joerin A, Gentile B, et al.: Using Psychological Artificial Intelligence (Tess) to Relieve Symptoms of Depression and Anxiety: Randomized Controlled Trial. JMIR Ment Health. 2018 Dec 13; 5(4): e64. PubMed Abstract | Publisher Full Text | Free Full Text

[35] 35. Zhao Y, Qian W, Chen Y, et al.: Effect of an AI agent trained on a large language model (LLM) as an intervention for depression and anxiety symptoms in young adults: A 28-day randomized controlled trial. Applied Psych Health & Well. 2025 Oct; 17(5): e70067. Publisher Full Text

[36] 36. Xu S, Ma T: Depression intervention using AI chatbots with social cues: a randomized trial of effectiveness. J. Affect. Disord. 2025 Nov; 389: 119760. PubMed Abstract | Publisher Full Text

[37] 37. Gan W, Ouyang J, She G, et al.: ChatGPT’s role in alleviating anxiety in total knee arthroplasty consent process: a randomized controlled trial pilot study. Int. J. Surg. 2025 Mar; 111(3): 2546–2557. PubMed Abstract | Publisher Full Text | Free Full Text

[38] 38. Chen C, Lam KT, Yip KM, et al.: Comparison of an AI Chatbot With a Nurse Hotline in Reducing Anxiety and Depression Levels in the General Population: Pilot Randomized Controlled Trial. JMIR Hum. Factors. 2025 Mar 6; 12: e65785–e65785. PubMed Abstract | Publisher Full Text | Free Full Text

[39] 39. Ilmaskal R, Prabandari YS, Oktaria V, et al.: Impact of Digital-based Intervention on Smoking Abstinence and Cessation among Adolescents: A Systematic Review. Asian Journal of Social Health and Behavior. 2026 Jan; 9(1): 1–14. Publisher Full Text

[40] 40. Klos MC, Escoredo M, Joerin A, et al.: Artificial Intelligence–Based Chatbot for Anxiety and Depression in University Students: Pilot Randomized Controlled Trial. JMIR Form Res. 2021 Aug 12; 5(8): e20678. PubMed Abstract | Publisher Full Text | Free Full Text

[41] 41. Puspitarani NMAP, Devi IGAAIRP, Ngey MWNT, et al.: Data and supplementary materials for: The Effects of Artificial Intelligence-Based Interventions on Depression and Anxiety: A Systematic Review.2026. Publisher Full Text

The Effects of Artificial Intelligence-Based Interventions on Depression and Anxiety: A Systematic Review

Abstract

Introduction

Methods

Results

Conclusions

Systematic Review Registration

Keywords

1. Introduction

2. Methods

2.1 Identifying research questions (RQ)

2.2 Identifying literature sources

2.3 Conducting a literature search that answers the research questions

Table 1. Inclusion and exclusion criteria.

2.4 Analysing the findings

Table 2. 14 Selected papers.

2.5 Risk of bias

3. Results

RQ 1. What are the effects of AI-based interventions in reducing depression and anxiety?

The effects of artificial Intelligence-based interventions for depression and anxiety

Individual outcomes

RQ 2. What are the characteristics and patterns of research on AI-based interventions for anxiety and depression over the past 10 years?

Figure 1. PRISMA Flow diagram of study selection.

Study characteristics

Table 3. Study characteristics.

Table 4. The effects of AI-based interventions in reducing depression and anxiety.

Research design

Geographic distribution of studies

Figure 2. (A) Distribution of studies by publication year.

Sample size and demographics

AI intervention

Figure 3. Type of AI-based interventions.

Duration

Figure 4. AI Intervention duration.

Risk of bias

Figure 5. (A). Summary of risk of bias assessments for individual studies.

RQ 3. What are the implementation issues of AI-based interventions for depression and anxiety?

4. Discussion

5. Strengths, limitations, and recommendations for future studies on AI-Based interventions for anxiety and depression

5.1 Strengths of the research in this paper review

Figure 6. Research strengths.

5.2 Study limitations

Figure 7. Limitation of the research.

5.3 Recommendations for future research

Figure 8. Future Research.

6. Conclusion

Data availability

Acknowledgements

References

Comments on this article Comments (0)

Open Peer Review

Comments on this article Comments (0)

Open Peer Review

Reviewer Status

Comments on this article

Browse by related subjects

Competing Interests Policy

Stay Updated