MOODMIND: Artificial Intelligence for Major Depressive Disorder Screening in Tuberculosis Patients

Erlina Wijayanti; Ammar Abror; Ummi Azizah Rachmawati; Citra Fitri Agustina; Helwiah Umniyati; Diana Batara Munti; Exir Najib Rahmat; Athoillah Ahkam Diansyah

doi:10.12688/f1000research.168964.1

Home Browse MOODMIND: Artificial Intelligence for Major Depressive Disorder Screening...

ALL Metrics

-

Views

-

Downloads

Get PDF

Get XML

Export

▬

✚

Software Tool Article

MOODMIND: Artificial Intelligence for Major Depressive Disorder Screening in Tuberculosis Patients

[version 1; peer review: 1 not approved]

Erlina Wijayanti ¹, Ammar Abror², Ummi Azizah Rachmawati², [...] Citra Fitri Agustina³, Helwiah Umniyati⁴, Diana Batara Munti¹, Exir Najib Rahmat⁵, Athoillah Ahkam Diansyah⁵

Erlina Wijayanti ¹, Ammar Abror², [...] Ummi Azizah Rachmawati², Citra Fitri Agustina³, Helwiah Umniyati⁴, Diana Batara Munti¹, Exir Najib Rahmat⁵, Athoillah Ahkam Diansyah⁵

PUBLISHED 13 Oct 2025

Author details Author details

¹ Family Medicine Primary Care Study Program, Faculty of Medicine, Yarsi University, Central Jakarta, Jakarta, Indonesia
² Faculty of Information Technology, Yarsi University, Jakarta, Indonesia
³ Department of Psychiatry, Faculty of Medicine, Yarsi University, Jakarta, Indonesia
⁴ Faculty of Dentistry, YARSI University, Jakarta, Indonesia
⁵ Faculty of Medicine, Yarsi University, Jakarta, Indonesia

Erlina Wijayanti
Roles: Conceptualization, Data Curation, Funding Acquisition, Methodology, Supervision, Writing – Original Draft Preparation, Writing – Review & Editing

Ammar Abror
Roles: Software, Writing – Original Draft Preparation

Ummi Azizah Rachmawati
Roles: Conceptualization, Methodology, Validation

Citra Fitri Agustina
Roles: Conceptualization, Methodology, Validation

Helwiah Umniyati
Roles: Investigation, Project Administration

Diana Batara Munti
Roles: Investigation, Resources

Exir Najib Rahmat
Roles: Investigation, Resources

Athoillah Ahkam Diansyah
Roles: Investigation, Resources

OPEN PEER REVIEW

REVIEWER STATUS

This article is included in the Artificial Intelligence and Machine Learning gateway.

This article is included in the AI in Medicine and Healthcare collection.

Abstract

Background

Major Depressive Disorder (MDD) can occur in patients with tuberculosis. The purpose of this research was to develop an early detection system for MDD and conduct an accuracy test.

Methods

The MOODMIND application uses Natural Language Processing (NLP) with sentiment analysis techniques. MOODMIND offers both speech and text options and is available in Indonesian/English. The screening results were compared with those of the doctor’s autoanamnesis test. Single blinding is used so that doctors are unaware of the application test.

Results

The app asks open- and closed-ended questions for MDD identification based on the DSM-5. The test results were divided into non-depressive (none or at-risk) and suspected depression groups. MOODMIND showed 67% sensitivity and 100% specificity.

Conclusions

Ease is advantageous because the steps are simple. MOODMIND has sufficient accuracy, but it can be improved by adding words related to depression in the lexicon adjustment.

Keywords

Artificial intelligence, depression, detection, tuberculosis, Natural Language Processing 

Corresponding author: Erlina Wijayanti

Competing interests: No competing interests were disclosed.

Grant information: This research was supported by a grant from YARSI University (number 1365/REK/PN.00/VII/2024).
The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.

Copyright: © 2025 Wijayanti E et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

How to cite: Wijayanti E, Abror A, Rachmawati UA et al. MOODMIND: Artificial Intelligence for Major Depressive Disorder Screening in Tuberculosis Patients [version 1; peer review: 1 not approved]. F1000Research 2025, 14:1079 (https://doi.org/10.12688/f1000research.168964.1) First published: 13 Oct 2025, 14:1079 (https://doi.org/10.12688/f1000research.168964.1) Latest published: 13 Oct 2025, 14:1079 (https://doi.org/10.12688/f1000research.168964.1)

1. Introduction

Tuberculosis (TB) is a chronic infectious disease that requires at least 6 months of therapy. Psychiatric conditions are important because patients with TB can experience social stigma, worries about their illness, or difficulties during treatment. Depression has a strong effect on negative outcomes.¹ Individuals who undergo treatment with second- and third-line medications are at a greater risk of stigma and depression.²

Depression also affects the immune system by lowering CD3, CD4, C8, and lymphocyte.³ Low serum anti-inflammatory cytokine levels are observed in patients with Major Depressive Disorder (MDD)-TB. Recognition of MDD in patients with TB will be more appropriate for diagnosis, treatment, and prognosis.⁴

The prevalence of major depression is 322 million worldwide⁵ and some patients do not seek help. Major depression has the potential to lead to suicide. Questionnaires and screening tools have been developed, but most use closed-ended questions, such as the Mental Health Screening Tool for Depressive Disorders (MHS:D).⁶

Natural Language Processing (NLP) is an artificial intelligence capable of analyzing and interpreting words.⁷ NLP can be used remotely for the real-time detection of depression. Studies have built systems with NLP to analyze the signs of depression based on comments on social media, such as mental health. The researchers compared mental health with the PHQ-9 to determine the accuracy of the system.⁸

The NLP techniques used include sentiment analysis, linguistic markers, word embedding, convolutional neural networks, recurrent neural networks, and large language models. Sentiment analysis examines the tone of emotions in a text, referring to depression if a negative language is identified.⁹

Based on the above description, a web-based application was built to screen for MDD using sentiment analysis. The software provides an alternative with open-ended questions on the two key symptoms for the diagnosis of major depression in both Indonesian and English. Through early detection, it is hoped that depression can be treated immediately and that this will increase the chances of successful treatment.

2. Methods

A. MOODMIND development

The project is part of an effort to examine tuberculosis patients holistically by developing AI-based tools for detecting MDD.

1. Ethical considerations

The ethics committee of YARSI University reviewed the ethical clearance number 114/KEP-UY/EA.20/III/2025.

2. Implementation

MDD is diagnosed if it meets the criteria of five or more symptoms (there is at least one symptom point a or b) for at least two weeks.¹⁰ Figures 1 and 2 illustrate the concept of MOODMIND, respectively.

Figure 1. MOODMIND application concept for Major Depressive Disorder (MDD) screening.

Figure 2. Conceptual framework for MOODMIND application development.

3. Operation

The software can be accessed via the following link: https://moodmind-two.vercel.app/.

3.1 Technologies

MOODMIND used Next.js for Frontend Framework, Tailwind CSS, and Web Speech API for Speech Recognition. The Programming Languages are TypeScript and JavaScript.

3.2 Main components

VoiceChat.tsx manages the voice input, transcripts, and conversation flow control. UseSpeech.ts for customizing hooks to control speech recognition status. The scripts provide questions and response scripts.

3.3 Depression detection methodology

The detection approach was based on several text-based indicators derived from voice transcription, namely, language patterns and depression-related keywords.

3.4 User experience flow

Users open the web-based application and answer system questions using voice or text. The system processes the transcription using sentiment analysis. The results of the analysis are displayed in visual and narrative forms.

3.5 Adaptation for tuberculosis

MOODMIND was adapted with a custom sentiment dictionary, focusing on common terms in Bahasa Indonesia that were reported by patients with TB when experiencing emotional distress.

3.6 Implementation details in sentiment analysis integration

As part of its natural language processing features, this system is equipped with a sentiment analysis module to evaluate the emotions contained in voice recognition transcripts. Sentiment analysis aimed to identify the emotional orientation (positive, negative, or neutral) of a statement, which, in this context, was used to detect indications of mood and enthusiasm in patients. Sentiment analysis was performed using the sentiment library, an open-source JavaScript library that supports lexicon-based analysis.

Lexicon adjustments for Indonesian

By default, a sentiment library supports the English language. To support Indonesians, a special dictionary (lexicon), consisting of a list of words and their sentiment scores, was identified.

This list of words was based on commonly used terminology to express negative emotional states, and was obtained through discussions between research members (Figure 3).

Figure 3. Special dictionary related to depression in Indonesian.

Sentiment analysis process

After the user provides voice input, which is then transcribed into text, the system performs sentiment analysis of the text. The following functions were used to perform the analysis (Figure 4). The getSentiment function accepts three parameters: the transcribed text, the sentiment dictionary, and the language code (“id” for Indonesian or “en” for English). If the selected language was Indonesian, the library was registered using a specially compiled dictionary.

Figure 4. Sentiment analysis process in MOODMIND.

Analysis results

The result object returned by the analysis () function contains several attributes that provide an overview of the emotional content of the analyzed text, including the score of text sentiment (positive, negative, or neutral), comparative (the normalized score value relative to the number of tokens), tokens resulting from text segmentation, words identified as having sentiment meaning, and positive/negative words recognized in the text.

By integrating this sentiment analysis, the system automatically detects emotional indicators and provides additional data for depression-screening processes. If negative sentiments related to feelings or interests are found in the last two weeks, then it is followed by closed questions.

B. Accuracy test

Quantitative research was carried out with a cross-sectional design and aimed at testing the accuracy of MOODMIND. The research population was drug-sensitive TB patients accompanied by YARSI TB Care cadres. The inclusion criteria were patients aged 17-65 years, had undergone TB treatment for more than 1 month, and were willing to be the subject of the study. Informed Consent was carried out in writing using an electronic questionnaire. Parents or guardians would be asked for written consent (using an electronic questionnaire) for patients who are 17 years old. The samples were taken by purposive sampling in the May-July 2025.

Data collection was obtained by interview, comparing the results of detection with MOODMIND and doctor’s anamnesis. The doctor’s guide in enforcing MDD was the DSM-5.¹⁰ Univariate analysis using Microsoft Excel to calculate sensitivity, specificity, positive predictive value, and negative predictive value. Single blinding was done to the doctor so that she did not know the results of detection with MOODMIND.

4. Results

4.1 Use cases

MOODMIND users can select the languages (English and Indonesian) ( Figure 5a). Users can choose either the written or voice mode of conversation ( Figure 5b). Users’ answers were categorized into 3, namely not depressed (score = 0), at risk of depression (score = 1-4), and suspected depression (score ≥ 5) ( Figure 5c). The word “Suspected depression” was used because the diagnosis by the doctor must be carried out and the patient should receive the necessary consultation. The role of a doctor/officer cannot be replaced by AI because of empathy and direct interaction with a human being.

Figure 5. a. Front page of MOODMIND. b. Conversation flow in MOODMIND. c. Result of test in MOODMIND.

4.2 Accuration test

We conducted tests on 21 patients with TB in Central Jakarta between May and July 2025. The average age of patients was 41.4 years with an age range of 19-64 years. The patient was guided by the researcher when using MOODMIND, whereas the doctor was blinded and did not know the results of the software detection.

Table 1 shows a comparison of MOODMIND detection with the doctor’s autoanamnesis, while Table 2 shows the accuracy level of the software.

Table 1. MOODMIND screening and doctor’s examination test results.

AI MOODMIND	Autoanamnesis		Total
AI MOODMIND	Negative	Positive	Total
Negative	18	1	19
Positive	0	2	2
Total	18	3	21

Table 2. Analysis of MOODMIND screening results on doctor’s examination.

Test	Percentage
Sensitivity	67%
Specificity	100%
Positive predictive value	100%
Negative predictive value	95%

5. Discussion

The MOODMIND application was equipped with sentiment analysis by searching for keywords and analyzing sentiments in Indonesian. The Lexicon technique is used to make a list of words and score sentiments for each word.¹¹ Other research has identified the keywords depression, symbols, and expressions through social media.^12,13 Existing depression detection systems/applications such as “Mental Care” which asked 21 questions to respondents,¹⁴ Multi-Gated LeakyReLU processed depressive language using CNN,¹⁵ while another study analyzed expressions that did not directly use specific words.¹⁶

Artificial intelligence usually requires the ability of the user.¹⁷ However, MOODMIND is very easy to operate, which can reduce issues related to human resources. The main requirements are a device and an internet connection. This tool is an inspiration for the development of similar types in other countries according to the local language, minimizing the gap between the detected cases and the actual number of cases. The variation of words related to depression still adjusts to the current condition, so it must be continuously updated to increase sensitivity from time to time.

More sample research is needed to determine the accuracy of MOODMIND in a real-world setting. In addition, bridging the results of screening to electronic medical records can be a useful alternative for monitoring the mental health of patients with chronic diseases such as tuberculosis.

6. Conclusion

MOODMIND, an artificial intelligence based on Natural Language Processing, can be used as an MDD detection tool. The level of accuracy was adequate (67% sensitivity and 100% specificity). This tool supports mental health monitoring but does not replace the role of doctors. This could also be an idea for AI development in some countries to detect MDD as early as possible.

Software availability

Source code available from: https://github.com/incrementalstudios/mood-mind

Archived software available from: https://doi.org/10.5281/zenodo.16793110¹⁸

License: MIT License

Data availability

The dataset as the basis for the accuracy test findings can be accessed at the link: https://doi.org/10.5281/zenodo.17114938.¹⁹ We also include the approval sheets and interview guides in the link.

Data are available under the terms of the Creative Commons Zero v1.0 Universal

Acknowledgements

We thank the YARSI Foundation for supporting this study.

References

1. Ruiz-Grosso P, Cachay R, De La Flor A, et al.: Association between tuberculosis and depression on negative outcomes of tuberculosis treatment: A systematic review and meta-analysis. PLoS One. 2020; 15(1): e0227472–e0227413. PubMed Abstract | Publisher Full Text | Free Full Text
2. Sweetland AC, Kritski A, Oquendo MA, et al.: Addressing the tuberculosis–depression syndemic to end the tuberculosis epidemic. Int. J. Tuberc. Lung Dis. 2017; 21(8): 852–861. PubMed Abstract | Publisher Full Text | Free Full Text
3. Liu X, Bai X, Ren R, et al.: Association between depression or anxiety symptoms and immune-inflammatory characteristics in in-patients with tuberculosis: A cross-sectional study. Front. Psych. 2022; 13.
4. Alvarez-Sekely M, Lopez-Bago A, Báez-Saldaña R, et al.: Major Depressive Disorder and Pulmonary Tuberculosis Comorbidity Exacerbates Proinflammatory Immune Response—A Preliminary Study. Pathogens. 2023; 12(3). PubMed Abstract | Publisher Full Text | Free Full Text
5. World Health Organization: Depression and Other Common Mental Disorders Global Health Estimates.2017.
6. Park K, Yoon S, Cho S, et al.: Final validation of the mental health screening tool for depressive disorders: A brief online and offline screening tool for major depressive disorder. Front. Psychol. 2022; 13(October): 1–12.
7. Tyagi N, Bhushan B: Demystifying the Role of Natural Language Processing (NLP) in Smart City Applications: Background, Motivation, Recent Advances, and Future Research Directions. Wirel. Pers. Commun. 2023; 130: 857–908. PubMed Abstract | Publisher Full Text | Free Full Text
8. Salas-Zárate R, Alor-Hernández G, Paredes-Valverde MA, et al.: Mental-Health: An NLP-Based System for Detecting Depression Levels through User Comments on Twitter (X). Mathematics. 2024; 12(13): 1–30.
9. Teferra BG, Rueda A, Pang H, et al.: Screening for Depression Using Natural Language Processing: Literature Review. Interact. J. Med. Res. 2024; 13: e55067. PubMed Abstract | Publisher Full Text | Free Full Text
10. American Psychiatric Association: Diagnostic and Statistical Manual of Mental Disorders (DSM-5). Encyclopedia of Applied Psychology, Three-Volume Set. 2013; 1: 160–168. Reference Source
11. Herdiansyah H, Roestam R, Kuhon R, et al.: Their post tell the truth: Detecting social media users mental health issues with sentiment analysis. Procedia Comput. Sci. 2023; 216: 691–697. Publisher Full Text
12. Cha J, Kim S, Park E: A lexicon-based approach to examine depression detection in social media: the case of Twitter and university community. Humanit. Soc. Sci. Commun. 2022; 9(1): 1–10.
13. Li G, Li B, Huang L, et al.: Automatic construction of a depression-domain lexicon based on microblogs: Text mining study. JMIR Med. Informatics. 2020; 8(6): 1–17.
14. Gustiadi A, Lazuardi L, Kedokteran F, et al.: PENGEMBANGAN APLIKASI SKRINING KESEHATAN MENTAL. J. Inf. Kesehat Indones. 2024; 10(2): 67–77.
15. Rao G, Zhang Y, Zhang L, et al.: MGL-CNN: A Hierarchical Posts Representations Model for Identifying Depressed Individuals in Online Forums. IEEE Access. 2020; 8: 32395–32403. Publisher Full Text
16. Chiong R, Budhi GS, Dhakal S, et al.: A textual-based featuring approach for depression detection using machine learning classifiers and social media texts. Comput. Biol. Med. 2021; 135: 104499. PubMed Abstract | Publisher Full Text
17. Zafar F, Fakhare Alam L, Vivas RR, et al.: The Role of Artificial Intelligence in Identifying Depression and Anxiety: A Comprehensive Literature Review. Cureus. 2024; 16(3).
18. Abror A, Wijayanti E, Rachmawati UA, et al.: Moodmind. Zenodo. 2025. Publisher Full Text
19. Wijayanti E: Data Availability MOODMIND. Zenodo. 2025. Publisher Full Text

Comments on this article Comments (0)

Version 1

VERSION 1 PUBLISHED 13 Oct 2025

Author details Author details

¹ Family Medicine Primary Care Study Program, Faculty of Medicine, Yarsi University, Central Jakarta, Jakarta, Indonesia
² Faculty of Information Technology, Yarsi University, Jakarta, Indonesia
³ Department of Psychiatry, Faculty of Medicine, Yarsi University, Jakarta, Indonesia
⁴ Faculty of Dentistry, YARSI University, Jakarta, Indonesia
⁵ Faculty of Medicine, Yarsi University, Jakarta, Indonesia

Erlina Wijayanti
Roles: Conceptualization, Data Curation, Funding Acquisition, Methodology, Supervision, Writing – Original Draft Preparation, Writing – Review & Editing

Ammar Abror
Roles: Software, Writing – Original Draft Preparation

Ummi Azizah Rachmawati
Roles: Conceptualization, Methodology, Validation

Citra Fitri Agustina
Roles: Conceptualization, Methodology, Validation

Helwiah Umniyati
Roles: Investigation, Project Administration

Diana Batara Munti
Roles: Investigation, Resources

Exir Najib Rahmat
Roles: Investigation, Resources

Athoillah Ahkam Diansyah
Roles: Investigation, Resources

Competing interests

No competing interests were disclosed.

Grant information

This research was supported by a grant from YARSI University (number 1365/REK/PN.00/VII/2024).
The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.

Article Versions (1)

version 1

Published: 13 Oct 2025, 14:1079

https://doi.org/10.12688/f1000research.168964.1

Copyright

© 2025 Wijayanti E et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Download

Export To

metrics

	Views	Downloads
F1000Research	-	-
PubMed Central Data from PMC are received and updated monthly.	-	-

Citations

0

SEE MORE DETAILS

CITE

how to cite this article

Wijayanti E, Abror A, Rachmawati UA et al. MOODMIND: Artificial Intelligence for Major Depressive Disorder Screening in Tuberculosis Patients [version 1; peer review: 1 not approved]. F1000Research 2025, 14:1079 (https://doi.org/10.12688/f1000research.168964.1)

NOTE: If applicable, it is important to ensure the information in square brackets after the title is included in all citations of this article.

track

receive updates on this article

Track an article to receive email alerts on any updates to this article.

Open Peer Review

Version 1

VERSION 1

PUBLISHED 13 Oct 2025

Views

3

Reviewer Report 30 Dec 2025

Masab Mansoor, Edward Via College of Osteopathic Medicine, Blacksburg, Virginia, USA

Not Approved

https://doi.org/10.5256/f1000research.186213.r435555

PEER REVIEW REPORT
SUMMARY
This manuscript presents MOODMIND, a web-based Natural Language Processing (NLP) application designed to screen for Major Depressive Disorder (MDD) in tuberculosis patients. The tool employs lexicon-based sentiment analysis on both voice and text inputs ... Continue reading

PEER REVIEW REPORT
SUMMARY
This manuscript presents MOODMIND, a web-based Natural Language Processing (NLP) application designed to screen for Major Depressive Disorder (MDD) in tuberculosis patients. The tool employs lexicon-based sentiment analysis on both voice and text inputs in Indonesian and English, utilizing DSM-5 criteria. A pilot validation study with 21 TB patients demonstrated 67% sensitivity and 100% specificity compared to physician assessment.
While the authors address a clinically relevant problem and present an accessible tool, significant methodological and technical limitations require substantial revision before indexing.
DETAILED RESPONSES TO REVIEW QUESTIONS
1. Is the rationale for developing the new software tool clearly explained?
Answer: PARTLY
Strengths:

The co-morbidity between TB and MDD is well-established in the introduction
The need for accessible screening tools is apparent
The rationale for using open-ended questions is mentioned

Weaknesses:

Lacks justification for why existing validated tools (PHQ-9, GAD-7, BDI-II) are inadequate for this population
Does not explain why sentiment analysis is superior to structured questionnaires
Insufficient epidemiological data on MDD prevalence specifically in Indonesian TB patients
No discussion of barriers to current screening practices that this tool addresses
Missing cost-effectiveness or accessibility arguments compared to existing digital mental health tools

Recommendations:

Provide systematic comparison of existing MDD screening tools and their limitations in TB populations
Clarify specific advantages of MOODMIND over validated instruments
Include data on mental health service gaps in the target population

2. Is the description of the software tool technically sound?
Answer: NO
Critical Technical Deficiencies:
A. Sentiment Analysis Methodology:

Oversimplified approach: Lexicon-based sentiment analysis is outdated; modern approaches utilize transformer-based models (BERT, GPT) or at minimum, more sophisticated machine learning classifiers
No description of sentiment score thresholds for categorization into non-depressed/at-risk/suspected depression
Lack of validation for the custom Indonesian lexicon (Figure 3 shows word list but no validation process)
No explanation of how sentiment scores correlate with DSM-5 diagnostic criteria for MDD

B. Algorithm Description:

Figure 4 shows code snippet but insufficient detail for replication:
- What constitutes a "negative sentiment" threshold?
- How are comparative scores normalized?
- What happens with neutral sentiments?
Missing information on decision tree logic for transitioning from open to closed questions
No description of how the algorithm weighs different DSM-5 symptoms

C. Speech Recognition:

Uses Web Speech API but no validation of transcription accuracy for Indonesian language
No discussion of handling dialectal variations or accented speech
Error handling procedures for misrecognition not addressed

D. Scoring System:

Critical gap: The manuscript states results are categorized as score 0, 1-4, or ≥5, but never explains:
- How these numeric scores are generated from sentiment analysis
- What each point represents (symptom count? Severity weighting?)
- How DSM-5's requirement of "5 or more symptoms with at least 1 core symptom" maps to the scoring

Recommendations:

Provide detailed algorithmic flowchart from input to classification
Specify all threshold values and their empirical basis
Consider upgrading to modern NLP architectures
Validate Indonesian lexicon against clinical datasets
Provide comprehensive technical documentation in supplementary materials

3. Are sufficient details provided for replication?
Answer: NO
Missing Critical Information:
A. Lexicon Development:

Figure 3 shows word list but:
- No systematic methodology for term selection
- "Obtained through discussions between research members" is insufficient
- No validation against clinical depression corpora
- Sentiment weights/scores for each term not provided
- No inter-rater agreement statistics for lexicon development

B. Software Implementation:

While GitHub repository is referenced:
- Version control information missing (which commit/release tested?)
- Dependency versions not specified
- Deployment environment specifications absent
- Browser compatibility not documented

C. Validation Protocol:

Insufficient detail on physician assessment:
- What specific questions did physicians ask?
- How long were clinical interviews?
- What documentation was completed?
- Single physician or multiple (inter-rater reliability)?
Sampling procedure: "Purposive sampling" requires more specificity
Inclusion/exclusion criteria need expansion (e.g., cognitive impairment, substance use, psychotic disorders)

D. Statistical Analysis:

No sample size calculation or power analysis
Confidence intervals not provided for sensitivity/specificity
Missing information on handling:
- Incomplete responses
- Technical failures
- Participant withdrawals

Recommendations:

Publish complete lexicon with sentiment weights as supplementary data
Provide structured interview guide used by physicians
Include detailed statistical analysis plan
Add flowchart of participant recruitment and assessment
Specify all software versions and system requirements

4. Is sufficient information provided to interpret expected outputs?
Answer: PARTLY
Strengths:

Figure 5c shows example output interface
Three-category classification (not depressed/at-risk/suspected) is clear
Appropriate disclaimer about need for professional diagnosis

Weaknesses:
A. Output Interpretation:

No guidance on clinical action for each category:
- What should clinicians do with "at-risk" patients?
- Referral pathways not discussed
- Urgency assessment absent
Score explanation missing: Users see categorical result but not underlying score or contributing factors
No feedback on specific symptoms identified

B. Clinical Utility:

Unclear integration with clinical workflow:
- When in TB treatment should screening occur?
- How often should rescreening happen?
- Documentation recommendations?
False negative implications not discussed (with 67% sensitivity, 33% of MDD cases missed)

C. Limitations Communication:

Tool appropriately states it doesn't replace clinical diagnosis, but:
- Doesn't explain to users why (especially for low-literacy populations)
- Risk of delayed care if patients with negative screens don't seek help
- Suicidality assessment completely absent

Recommendations:

Provide detailed clinical implementation guidelines
Include symptom-level feedback in output
Add explicit suicidality screening and immediate referral protocols
Create user education materials explaining tool limitations
Develop clinician interpretation guide with case examples

5. Are conclusions adequately supported by findings?
Answer: NO
Major Concerns:
A. Overstated Claims:

"Adequate accuracy" (67% sensitivity) is debatable for screening tool:
- Missing 1 in 3 cases is problematic for MDD screening
- Compare to PHQ-9: sensitivity ~88%, specificity ~88% for MDD diagnosis
- No justification for why 67% is acceptable
"100% specificity" is misleading:
- Based on zero false positives in sample of 18 true negatives
- With 95% confidence interval, true specificity could be as low as ~82%
- Overfitting likely with such small sample

B. Statistical Limitations:

Sample size (n=21) severely underpowered:
- Only 3 MDD-positive cases
- Cannot reliably estimate diagnostic accuracy
- No subgroup analyses possible
- Results not generalizable
Selection bias: Patients "accompanied by YARSI TB Care cadres" may not represent broader TB population
Verification bias: Single physician assessment without structured interview or validated scales

C. Comparative Evidence:

No comparison with validated screening tools in same population
No benchmark against PHQ-9, which is:
- Free, brief (9 items)
- Extensively validated in medical populations
- Available in Indonesian

D. Generalizability:

Single-center study in Central Jakarta
TB patients only - MDD presentation may differ in other chronic diseases
Researcher-supervised administration - real-world performance likely lower
Indonesian language validation insufficient (no dialectal testing)

E. Missing Discussions:

False negative consequences: Untreated depression worsens TB outcomes, adherence
Screening frequency: Optimal timing during 6-month TB treatment unclear
Cost-effectiveness: Not analyzed
Implementation barriers: Internet access, device availability, digital literacy

Recommendations:

Revise conclusions to acknowledge substantial limitations
Clearly state this is a proof-of-concept pilot requiring extensive validation
Compare performance to PHQ-9 in future studies
Conduct multi-center validation with ≥300 participants
Include external validation cohort
Perform head-to-head comparison with validated instruments

ADDITIONAL MAJOR CONCERNS
Methodological Issues:

Gold Standard Inadequate:
- Physician "autoanamnesis" is not validated
- Should use Structured Clinical Interview for DSM-5 (SCID) or MINI
- Consider including validated self-report measures as convergent validity
Blinding Incomplete:
- Researchers present during MOODMIND administration could influence responses
- Should be independently administered without researcher presence
Missing Data on:
- TB disease characteristics (drug-sensitive only stated, but severity, treatment phase?)
- Psychiatric history (first episode vs. recurrent MDD?)
- Current psychotropic medications
- Comorbid psychiatric conditions
- Sociodemographic factors

Ethical Concerns:

Vulnerable Population:
- TB patients with MDD are doubly stigmatized
- Data security measures not described
- Privacy protections for voice recordings unclear
Suicidality Risk:
- No suicide risk assessment in the tool
- Critical safety gap: MDD screening without suicide screening is dangerous
- No crisis referral pathway described

Technical Concerns:

Open-Ended Question Analysis:
- How are diverse, unstructured responses converted to binary symptom presence/absence?
- Inter-rater reliability for human coding not established
- Automated coding validation absent
Closed Question Integration:
- Manuscript states both open and closed questions used
- Logic for triggering closed questions not explained
- Scoring methodology for combined responses unclear

SPECIFIC CORRECTIONS REQUIRED
Methods Section:

Line describing sample: Change "21 patients" to "21 patients (pilot feasibility study)"
Add: Structured clinical interview protocol used by physician
Add: Sample size justification or acknowledge as convenience sample
Add: Inter-rater reliability assessment (if multiple raters) or acknowledge single-rater limitation

Results Section:

Add: 95% confidence intervals for all diagnostic accuracy metrics
Add: Participant flowchart (STARD guidelines)
Add: Description of any technical failures or incomplete assessments
Modify Table 2: Include confidence intervals

Discussion Section:

Add: Direct comparison of 67% sensitivity to literature values for validated tools
Add: Clinical implications of 33% false negative rate
Add: Limitations section discussing:
- Small sample size and wide confidence intervals
- Lack of external validation
- Single-center, single-assessor design
- Absence of comparison to validated instruments
Add: Implementation research needs before clinical deployment

Conclusion Section:

Modify: Change "adequate accuracy" to "preliminary accuracy estimates requiring validation in larger studies"
Add: Explicit statement: "This tool requires extensive validation before clinical implementation"
Add: Specific next steps for validation research

MINOR ISSUES
Writing Quality:

Generally clear but some grammatical errors
"Autoanamnesis" - unusual term; clarify or use "clinical interview"
Inconsistent terminology (MDD vs. depression vs. major depression)

Figures:

Figure 1: Simplistic - could be removed or enhanced with algorithm specifics
Figure 2: Helpful conceptual framework
Figure 3: Shows code but insufficient explanation
Figure 4: Code snippet needs more context
Figure 5: Good interface examples but needs annotation

References:

Appropriate selection
Missing key references on digital mental health tools
Should cite PHQ-9 validation studies in TB populations (if available)

VERDICT AND RECOMMENDATIONS
Overall Assessment: MAJOR REVISIONS REQUIRED
Must Be Addressed for Scientific Soundness:

Acknowledge severe limitations of 21-patient pilot study throughout manuscript
Revise conclusions to reflect preliminary nature of findings
Add confidence intervals to all diagnostic accuracy estimates
Provide complete technical documentation sufficient for replication
Include suicide risk assessment in tool or acknowledge dangerous omission
Explain scoring algorithm in detail
Validate lexicon using established methodology
Compare to validated screening tools (add as limitation if not done)

Strongly Recommended:

Conduct adequately powered validation study (n≥300) before claiming clinical utility
Perform external validation in different TB treatment settings
Include structured clinical interview as gold standard
Assess inter-rater reliability
Upgrade NLP methodology to modern standards
Publish complete source code and lexicon with version control

Suggested Title Revision:
"MOODMIND: A Pilot Feasibility Study of Artificial Intelligence for Major Depressive Disorder Screening in Tuberculosis Patients"
Alternative Publication Path:
Given the early-stage development and small sample, authors might consider:

Repositioning as a "Software Tool Note" rather than validation study
Focus on technical description with clear acknowledgment that clinical validation is pending
Present diagnostic accuracy data as preliminary feasibility only

CONCLUSION
While MOODMIND addresses an important clinical need and demonstrates creative application of NLP to mental health screening, the manuscript requires substantial revision to meet scientific standards for a clinical validation study. The combination of outdated NLP methodology, inadequate technical description, severely underpowered validation study, and absence of critical safety features (suicide screening) precludes recommendation for approval in current form.
The authors should be commended for open-source development and bilingual implementation, but must conduct rigorous validation research before clinical deployment recommendations can be supported.
Recommendation: MAJOR REVISIONS REQUIRED before this manuscript can be considered for Indexing.

Is the rationale for developing the new software tool clearly explained?

Partly
Is the description of the software tool technically sound?

No
Are sufficient details of the code, methods and analysis (if applicable) provided to allow replication of the software development and its use by others?

No
Is sufficient information provided to allow interpretation of the expected output datasets and any results generated using the tool?

Partly
Are the conclusions about the tool and its performance adequately supported by the findings presented in the article?

No

Competing Interests: No competing interests were disclosed.

Reviewer Expertise: Machine Learning, artificial intelligence, health informatics, Large language models, Machine vision

I confirm that I have read this submission and believe that I have an appropriate level of expertise to state that I do not consider it to be of an acceptable scientific standard, for reasons outlined above.

CITE

Report a concern

Respond or Comment

Comments on this article Comments (0)

Version 1

VERSION 1 PUBLISHED 13 Oct 2025

Open Peer Review

Reviewer Status

Reviewer Reports

	Invited Reviewers
	1
Version 1 13 Oct 25	read

Masab Mansoor, Edward Via College of Osteopathic Medicine, Blacksburg, USA

Comments on this article

All Comments(0)

Add a comment

Sign up for content alerts

Browse by related subjects

Back to all reports

Reviewer Report

3 Views

30 Dec 2025 | for Version 1

Masab Mansoor, Edward Via College of Osteopathic Medicine, Blacksburg, Virginia, USA

3 Views Cite this report Responses(0)

Not Approved

PEER REVIEW REPORT
SUMMARY
This manuscript presents MOODMIND, a web-based Natural Language Processing (NLP) application designed to screen for Major Depressive Disorder (MDD) in tuberculosis patients. The tool employs lexicon-based sentiment analysis on both voice and text inputs in Indonesian and English, utilizing DSM-5 criteria. A pilot validation study with 21 TB patients demonstrated 67% sensitivity and 100% specificity compared to physician assessment.
While the authors address a clinically relevant problem and present an accessible tool, significant methodological and technical limitations require substantial revision before indexing.
DETAILED RESPONSES TO REVIEW QUESTIONS
1. Is the rationale for developing the new software tool clearly explained?
Answer: PARTLY
Strengths:

The co-morbidity between TB and MDD is well-established in the introduction
The need for accessible screening tools is apparent
The rationale for using open-ended questions is mentioned

Weaknesses:

Lacks justification for why existing validated tools (PHQ-9, GAD-7, BDI-II) are inadequate for this population
Does not explain why sentiment analysis is superior to structured questionnaires
Insufficient epidemiological data on MDD prevalence specifically in Indonesian TB patients
No discussion of barriers to current screening practices that this tool addresses
Missing cost-effectiveness or accessibility arguments compared to existing digital mental health tools

Recommendations:

Provide systematic comparison of existing MDD screening tools and their limitations in TB populations
Clarify specific advantages of MOODMIND over validated instruments
Include data on mental health service gaps in the target population

2. Is the description of the software tool technically sound?
Answer: NO
Critical Technical Deficiencies:
A. Sentiment Analysis Methodology:

Oversimplified approach: Lexicon-based sentiment analysis is outdated; modern approaches utilize transformer-based models (BERT, GPT) or at minimum, more sophisticated machine learning classifiers
No description of sentiment score thresholds for categorization into non-depressed/at-risk/suspected depression
Lack of validation for the custom Indonesian lexicon (Figure 3 shows word list but no validation process)
No explanation of how sentiment scores correlate with DSM-5 diagnostic criteria for MDD

B. Algorithm Description:

Figure 4 shows code snippet but insufficient detail for replication:
- What constitutes a "negative sentiment" threshold?
- How are comparative scores normalized?
- What happens with neutral sentiments?
Missing information on decision tree logic for transitioning from open to closed questions
No description of how the algorithm weighs different DSM-5 symptoms

C. Speech Recognition:

Uses Web Speech API but no validation of transcription accuracy for Indonesian language
No discussion of handling dialectal variations or accented speech
Error handling procedures for misrecognition not addressed

D. Scoring System:

Critical gap: The manuscript states results are categorized as score 0, 1-4, or ≥5, but never explains:
- How these numeric scores are generated from sentiment analysis
- What each point represents (symptom count? Severity weighting?)
- How DSM-5's requirement of "5 or more symptoms with at least 1 core symptom" maps to the scoring

Recommendations:

Provide detailed algorithmic flowchart from input to classification
Specify all threshold values and their empirical basis
Consider upgrading to modern NLP architectures
Validate Indonesian lexicon against clinical datasets
Provide comprehensive technical documentation in supplementary materials

3. Are sufficient details provided for replication?
Answer: NO
Missing Critical Information:
A. Lexicon Development:

Figure 3 shows word list but:
- No systematic methodology for term selection
- "Obtained through discussions between research members" is insufficient
- No validation against clinical depression corpora
- Sentiment weights/scores for each term not provided
- No inter-rater agreement statistics for lexicon development

B. Software Implementation:

While GitHub repository is referenced:
- Version control information missing (which commit/release tested?)
- Dependency versions not specified
- Deployment environment specifications absent
- Browser compatibility not documented

C. Validation Protocol:

Insufficient detail on physician assessment:
- What specific questions did physicians ask?
- How long were clinical interviews?
- What documentation was completed?
- Single physician or multiple (inter-rater reliability)?
Sampling procedure: "Purposive sampling" requires more specificity
Inclusion/exclusion criteria need expansion (e.g., cognitive impairment, substance use, psychotic disorders)

D. Statistical Analysis:

No sample size calculation or power analysis
Confidence intervals not provided for sensitivity/specificity
Missing information on handling:
- Incomplete responses
- Technical failures
- Participant withdrawals

Recommendations:

Publish complete lexicon with sentiment weights as supplementary data
Provide structured interview guide used by physicians
Include detailed statistical analysis plan
Add flowchart of participant recruitment and assessment
Specify all software versions and system requirements

4. Is sufficient information provided to interpret expected outputs?
Answer: PARTLY
Strengths:

Figure 5c shows example output interface
Three-category classification (not depressed/at-risk/suspected) is clear
Appropriate disclaimer about need for professional diagnosis

Weaknesses:
A. Output Interpretation:

No guidance on clinical action for each category:
- What should clinicians do with "at-risk" patients?
- Referral pathways not discussed
- Urgency assessment absent
Score explanation missing: Users see categorical result but not underlying score or contributing factors
No feedback on specific symptoms identified

B. Clinical Utility:

Unclear integration with clinical workflow:
- When in TB treatment should screening occur?
- How often should rescreening happen?
- Documentation recommendations?
False negative implications not discussed (with 67% sensitivity, 33% of MDD cases missed)

C. Limitations Communication:

Tool appropriately states it doesn't replace clinical diagnosis, but:
- Doesn't explain to users why (especially for low-literacy populations)
- Risk of delayed care if patients with negative screens don't seek help
- Suicidality assessment completely absent

Recommendations:

Provide detailed clinical implementation guidelines
Include symptom-level feedback in output
Add explicit suicidality screening and immediate referral protocols
Create user education materials explaining tool limitations
Develop clinician interpretation guide with case examples

5. Are conclusions adequately supported by findings?
Answer: NO
Major Concerns:
A. Overstated Claims:

"Adequate accuracy" (67% sensitivity) is debatable for screening tool:
- Missing 1 in 3 cases is problematic for MDD screening
- Compare to PHQ-9: sensitivity ~88%, specificity ~88% for MDD diagnosis
- No justification for why 67% is acceptable
"100% specificity" is misleading:
- Based on zero false positives in sample of 18 true negatives
- With 95% confidence interval, true specificity could be as low as ~82%
- Overfitting likely with such small sample

B. Statistical Limitations:

Sample size (n=21) severely underpowered:
- Only 3 MDD-positive cases
- Cannot reliably estimate diagnostic accuracy
- No subgroup analyses possible
- Results not generalizable
Selection bias: Patients "accompanied by YARSI TB Care cadres" may not represent broader TB population
Verification bias: Single physician assessment without structured interview or validated scales

C. Comparative Evidence:

No comparison with validated screening tools in same population
No benchmark against PHQ-9, which is:
- Free, brief (9 items)
- Extensively validated in medical populations
- Available in Indonesian

D. Generalizability:

Single-center study in Central Jakarta
TB patients only - MDD presentation may differ in other chronic diseases
Researcher-supervised administration - real-world performance likely lower
Indonesian language validation insufficient (no dialectal testing)

E. Missing Discussions:

False negative consequences: Untreated depression worsens TB outcomes, adherence
Screening frequency: Optimal timing during 6-month TB treatment unclear
Cost-effectiveness: Not analyzed
Implementation barriers: Internet access, device availability, digital literacy

Recommendations:

Revise conclusions to acknowledge substantial limitations
Clearly state this is a proof-of-concept pilot requiring extensive validation
Compare performance to PHQ-9 in future studies
Conduct multi-center validation with ≥300 participants
Include external validation cohort
Perform head-to-head comparison with validated instruments

ADDITIONAL MAJOR CONCERNS
Methodological Issues:

Gold Standard Inadequate:
- Physician "autoanamnesis" is not validated
- Should use Structured Clinical Interview for DSM-5 (SCID) or MINI
- Consider including validated self-report measures as convergent validity
Blinding Incomplete:
- Researchers present during MOODMIND administration could influence responses
- Should be independently administered without researcher presence
Missing Data on:
- TB disease characteristics (drug-sensitive only stated, but severity, treatment phase?)
- Psychiatric history (first episode vs. recurrent MDD?)
- Current psychotropic medications
- Comorbid psychiatric conditions
- Sociodemographic factors

Ethical Concerns:

Vulnerable Population:
- TB patients with MDD are doubly stigmatized
- Data security measures not described
- Privacy protections for voice recordings unclear
Suicidality Risk:
- No suicide risk assessment in the tool
- Critical safety gap: MDD screening without suicide screening is dangerous
- No crisis referral pathway described

Technical Concerns:

Open-Ended Question Analysis:
- How are diverse, unstructured responses converted to binary symptom presence/absence?
- Inter-rater reliability for human coding not established
- Automated coding validation absent
Closed Question Integration:
- Manuscript states both open and closed questions used
- Logic for triggering closed questions not explained
- Scoring methodology for combined responses unclear

SPECIFIC CORRECTIONS REQUIRED
Methods Section:

Line describing sample: Change "21 patients" to "21 patients (pilot feasibility study)"
Add: Structured clinical interview protocol used by physician
Add: Sample size justification or acknowledge as convenience sample
Add: Inter-rater reliability assessment (if multiple raters) or acknowledge single-rater limitation

Results Section:

Add: 95% confidence intervals for all diagnostic accuracy metrics
Add: Participant flowchart (STARD guidelines)
Add: Description of any technical failures or incomplete assessments
Modify Table 2: Include confidence intervals

Discussion Section:

Add: Direct comparison of 67% sensitivity to literature values for validated tools
Add: Clinical implications of 33% false negative rate
Add: Limitations section discussing:
- Small sample size and wide confidence intervals
- Lack of external validation
- Single-center, single-assessor design
- Absence of comparison to validated instruments
Add: Implementation research needs before clinical deployment

Conclusion Section:

Modify: Change "adequate accuracy" to "preliminary accuracy estimates requiring validation in larger studies"
Add: Explicit statement: "This tool requires extensive validation before clinical implementation"
Add: Specific next steps for validation research

MINOR ISSUES
Writing Quality:

Generally clear but some grammatical errors
"Autoanamnesis" - unusual term; clarify or use "clinical interview"
Inconsistent terminology (MDD vs. depression vs. major depression)

Figures:

Figure 1: Simplistic - could be removed or enhanced with algorithm specifics
Figure 2: Helpful conceptual framework
Figure 3: Shows code but insufficient explanation
Figure 4: Code snippet needs more context
Figure 5: Good interface examples but needs annotation

References:

Appropriate selection
Missing key references on digital mental health tools
Should cite PHQ-9 validation studies in TB populations (if available)

VERDICT AND RECOMMENDATIONS
Overall Assessment: MAJOR REVISIONS REQUIRED
Must Be Addressed for Scientific Soundness:

Acknowledge severe limitations of 21-patient pilot study throughout manuscript
Revise conclusions to reflect preliminary nature of findings
Add confidence intervals to all diagnostic accuracy estimates
Provide complete technical documentation sufficient for replication
Include suicide risk assessment in tool or acknowledge dangerous omission
Explain scoring algorithm in detail
Validate lexicon using established methodology
Compare to validated screening tools (add as limitation if not done)

Strongly Recommended:

Conduct adequately powered validation study (n≥300) before claiming clinical utility
Perform external validation in different TB treatment settings
Include structured clinical interview as gold standard
Assess inter-rater reliability
Upgrade NLP methodology to modern standards
Publish complete source code and lexicon with version control

Suggested Title Revision:
"MOODMIND: A Pilot Feasibility Study of Artificial Intelligence for Major Depressive Disorder Screening in Tuberculosis Patients"
Alternative Publication Path:
Given the early-stage development and small sample, authors might consider:

Repositioning as a "Software Tool Note" rather than validation study
Focus on technical description with clear acknowledgment that clinical validation is pending
Present diagnostic accuracy data as preliminary feasibility only

CONCLUSION
While MOODMIND addresses an important clinical need and demonstrates creative application of NLP to mental health screening, the manuscript requires substantial revision to meet scientific standards for a clinical validation study. The combination of outdated NLP methodology, inadequate technical description, severely underpowered validation study, and absence of critical safety features (suicide screening) precludes recommendation for approval in current form.
The authors should be commended for open-source development and bilingual implementation, but must conduct rigorous validation research before clinical deployment recommendations can be supported.
Recommendation: MAJOR REVISIONS REQUIRED before this manuscript can be considered for Indexing.

Is the rationale for developing the new software tool clearly explained?

Partly
Is the description of the software tool technically sound?

No
Are sufficient details of the code, methods and analysis (if applicable) provided to allow replication of the software development and its use by others?

No
Is sufficient information provided to allow interpretation of the expected output datasets and any results generated using the tool?

Partly
Are the conclusions about the tool and its performance adequately supported by the findings presented in the article?

No

Competing Interests

No competing interests were disclosed.

Reviewer Expertise

Machine Learning, artificial intelligence, health informatics, Large language models, Machine vision

I confirm that I have read this submission and believe that I have an appropriate level of expertise to state that I do not consider it to be of an acceptable scientific standard, for reasons outlined above.

Respond to this report

Responses (0)

[1] 1. Ruiz-Grosso P, Cachay R, De La Flor A, et al.: Association between tuberculosis and depression on negative outcomes of tuberculosis treatment: A systematic review and meta-analysis. PLoS One. 2020; 15(1): e0227472–e0227413. PubMed Abstract | Publisher Full Text | Free Full Text

[2] 2. Sweetland AC, Kritski A, Oquendo MA, et al.: Addressing the tuberculosis–depression syndemic to end the tuberculosis epidemic. Int. J. Tuberc. Lung Dis. 2017; 21(8): 852–861. PubMed Abstract | Publisher Full Text | Free Full Text

[3] 3. Liu X, Bai X, Ren R, et al.: Association between depression or anxiety symptoms and immune-inflammatory characteristics in in-patients with tuberculosis: A cross-sectional study. Front. Psych. 2022; 13.

[4] 4. Alvarez-Sekely M, Lopez-Bago A, Báez-Saldaña R, et al.: Major Depressive Disorder and Pulmonary Tuberculosis Comorbidity Exacerbates Proinflammatory Immune Response—A Preliminary Study. Pathogens. 2023; 12(3). PubMed Abstract | Publisher Full Text | Free Full Text

[5] 5. World Health Organization: Depression and Other Common Mental Disorders Global Health Estimates.2017.

[6] 6. Park K, Yoon S, Cho S, et al.: Final validation of the mental health screening tool for depressive disorders: A brief online and offline screening tool for major depressive disorder. Front. Psychol. 2022; 13(October): 1–12.

[7] 7. Tyagi N, Bhushan B: Demystifying the Role of Natural Language Processing (NLP) in Smart City Applications: Background, Motivation, Recent Advances, and Future Research Directions. Wirel. Pers. Commun. 2023; 130: 857–908. PubMed Abstract | Publisher Full Text | Free Full Text

[8] 8. Salas-Zárate R, Alor-Hernández G, Paredes-Valverde MA, et al.: Mental-Health: An NLP-Based System for Detecting Depression Levels through User Comments on Twitter (X). Mathematics. 2024; 12(13): 1–30.

[9] 9. Teferra BG, Rueda A, Pang H, et al.: Screening for Depression Using Natural Language Processing: Literature Review. Interact. J. Med. Res. 2024; 13: e55067. PubMed Abstract | Publisher Full Text | Free Full Text

[10] 10. American Psychiatric Association: Diagnostic and Statistical Manual of Mental Disorders (DSM-5). Encyclopedia of Applied Psychology, Three-Volume Set. 2013; 1: 160–168. Reference Source

[11] 11. Herdiansyah H, Roestam R, Kuhon R, et al.: Their post tell the truth: Detecting social media users mental health issues with sentiment analysis. Procedia Comput. Sci. 2023; 216: 691–697. Publisher Full Text

[12] 12. Cha J, Kim S, Park E: A lexicon-based approach to examine depression detection in social media: the case of Twitter and university community. Humanit. Soc. Sci. Commun. 2022; 9(1): 1–10.

[13] 13. Li G, Li B, Huang L, et al.: Automatic construction of a depression-domain lexicon based on microblogs: Text mining study. JMIR Med. Informatics. 2020; 8(6): 1–17.

[14] 14. Gustiadi A, Lazuardi L, Kedokteran F, et al.: PENGEMBANGAN APLIKASI SKRINING KESEHATAN MENTAL. J. Inf. Kesehat Indones. 2024; 10(2): 67–77.

[15] 15. Rao G, Zhang Y, Zhang L, et al.: MGL-CNN: A Hierarchical Posts Representations Model for Identifying Depressed Individuals in Online Forums. IEEE Access. 2020; 8: 32395–32403. Publisher Full Text

[16] 16. Chiong R, Budhi GS, Dhakal S, et al.: A textual-based featuring approach for depression detection using machine learning classifiers and social media texts. Comput. Biol. Med. 2021; 135: 104499. PubMed Abstract | Publisher Full Text

[17] 17. Zafar F, Fakhare Alam L, Vivas RR, et al.: The Role of Artificial Intelligence in Identifying Depression and Anxiety: A Comprehensive Literature Review. Cureus. 2024; 16(3).

[18] 18. Abror A, Wijayanti E, Rachmawati UA, et al.: Moodmind. Zenodo. 2025. Publisher Full Text

[19] 19. Wijayanti E: Data Availability MOODMIND. Zenodo. 2025. Publisher Full Text

MOODMIND: Artificial Intelligence for Major Depressive Disorder Screening in Tuberculosis Patients

Abstract

Background

Methods

Results

Conclusions

Keywords

1. Introduction

2. Methods

Figure 1. MOODMIND application concept for Major Depressive Disorder (MDD) screening.

Figure 2. Conceptual framework for MOODMIND application development.

3. Operation

3.1 Technologies

3.2 Main components

3.3 Depression detection methodology

3.4 User experience flow

3.5 Adaptation for tuberculosis

3.6 Implementation details in sentiment analysis integration

Figure 3. Special dictionary related to depression in Indonesian.

Figure 4. Sentiment analysis process in MOODMIND.

4. Results

4.1 Use cases

Figure 5. a. Front page of MOODMIND. b. Conversation flow in MOODMIND. c. Result of test in MOODMIND.

4.2 Accuration test

Table 1. MOODMIND screening and doctor’s examination test results.

Table 2. Analysis of MOODMIND screening results on doctor’s examination.

5. Discussion

6. Conclusion

Software availability

Data availability

Acknowledgements

References

Comments on this article Comments (0)

Open Peer Review

Comments on this article Comments (0)

Open Peer Review

Reviewer Status

Reviewer Reports

Comments on this article

Browse by related subjects

Competing Interests Policy

Stay Updated