Keywords
MHPSS, psychosocial, psychometrics, measurement, humanitarian, Rohingya, Bangladesh
Psychosocial research in humanitarian settings has been historically dominated by a focus on distress and disorder. As such, there is a need to establish the validity of instruments for a broad range of psychosocial outcomes, particularly among highly affected and under-represented populations. The current study describes the adaptation and testing of multiple psychosocial instruments among displaced Rohingya mothers in Bangladesh.
We used baseline data from 600 mothers of malnourished children aged 2 and under enrolled in an intervention study in Cox’s Bazar, Bangladesh. Instruments assessed distress (items from the International Depression Symptom Scale [IDSS] and Kessler-6 [K-6]); functional impairment (World Health Organization Disability Assessment Schedule [WHODAS]); subjective wellbeing (global Satisfaction With Life [SWL] and Personal Wellbeing Index [PWI]; and, coping (items from the Brief COPE and locally developed coping items). Instruments were piloted and refined, then used for data collection by Bangladeshi-Rohingya interviewer pairs. We conducted exploratory factor analysis, evaluated internal consistency, examined construct validity through correlation with other scales, and used regression models to explore demographic factors associated with psychosocial health.
Both the WHODAS and coping items fit 2-factor models; other scales were unidimensional. Cronbach’s alphas ranged from .76 to .90 for the refined scales. With the exception of coping, scale correlations supported construct validity; separate measures of the same construct were highly correlated, distress and impairment were moderately correlated, and both were inversely correlated with wellbeing. Correlates of poorer psychosocial health included relative socioeconomic disadvantage, current pregnancy, and being unmarried.
Most of the standard psychosocial assessment tools performed adequately, but they did not appear to fully capture local experiences and included items of little local relevance. Findings highlight the need for further mixed methods research to develop a rich battery of instruments with cross-cultural validity, particularly for positive outcomes such as coping which was particularly challenging to assess.
MHPSS, psychosocial, psychometrics, measurement, humanitarian, Rohingya, Bangladesh
This version provides more background information about the Rohingya refugee context, with new citations of recent research describing mental health and psychosocial support needs of Rohingya refugees. It also clarifies some questions in the methods related to the instrument adaptation and translation process, identifying important limitations of our approach and the implications for interpretation of scale scores. It also expands the discussion to highlight the need for more research focused on not only broad epidemiologic assessment instruments but also those with cutpoints for clinical utility in case identification.
See the authors' detailed response to the review by Yubaraj Adhikari
See the authors' detailed response to the review by Pieter Ventevogel
The myriad ways in which culture shapes both experiences and expressions of mental wellbeing and distress have been increasingly recognized, leading to cautions against an assumption that psychosocial measures are equally valid across a range of cultures and contexts.1 With this recognition, there has been increased attention to culturally and contextually specific measurements of distress within global mental health research; for example, recent efforts have sought to study the relative consistency of mental health constructs and their indicators across cultures,2–4 identify and share validated instruments,5,6 and provide guidance and innovative approaches to validation of measures.7–9 While most of the above work has been focused on screening or assessment tools measuring psychological distress, recent reviews have highlighted that there is less consensus on how to best measure other key mental health and psychosocial domains (e.g., wellbeing, coping, social behavior, and social connectedness).10 This discrepancy in the focus of psychosocial measurement has also been raised in calls for greater conceptual clarity when considering both clinical and social-environmental intervention models,11 while efforts from researchers and organizations (e.g., the Inter-Agency Standing Committee [IASC]’s Common Monitoring and Evaluation Framework12) have catalyzed new progress in instrument validation and evaluation. To improve the evidence-base for prevention and promotion programming targeting a broader array of psychosocial outcomes among highly impacted populations, increased attention to their measurement is needed.13 Even as the IASC has recently released updates to the Monitoring and Evaluation (M&E) Framework that include guidance on means of verification,14 there is a continued need to disseminate findings on the validity, reliability, feasibility, and relevance of the most commonly used measurement tools in under-studied populations to advance both programming and research.15
Refugees often reside in complex environments that makes psychosocial measurement even more difficult.16 Spurred by ethnic violence in 2017, nearly a million Rohingya forcibly displaced from Myanmar have taken refuge in Cox’s Bazar Bangladesh – dubbed the world’s largest refugee settlement – where they remain highly dependent on humanitarian aid and are living in overcrowded, difficult conditions with serious impacts on health and wellbeing.17 Both prior exposure to traumatic events as well as current stressors related to food insecurity, mobility, and safety concerns are highly prevalent and associated with poor mental health in this population.18 A recent survey found that more than 90% of Rohingya adults living in refugee camps in Bangladesh reported prior exposure to the following potentially traumatic events: experiencing gunfire, seeing dead bodies, and having their home destroyed.19 This same survey that found 61% of Rohingya adults reported symptoms of distress at severe enough levels to indicate probable PTSD and more than 80% of clinically meaningful depression/anxiety.19 Consistent with this finding, in a previous needs assessment conducted by the humanitarian organization Action contre La Faim in Northern Rakhine State, Myanmar, found that approximately a third of all Rohingya adults screened reported extreme levels of stress, and half of adults who underwent a full psychosocial evaluation reported suicidal ideation.20,21 This has led ACF to begin to build psychosocial support programing for Rohingya displaced in Myanmar as well as into Bangladesh.22,23
Given the magnitude of need for mental health and psychosocial support (MHPSS) for displaced Rohingya people, it is necessary to grow our inventory of valid assessment tools so that service organizations like ACF can evaluate a broader set of psychosocial outcomes that have relevance, meaning and importance to communities of Rohingya. Yet, at present, only limited information is available on the psychosocial wellbeing of the Rohingya population in Cox’s Bazar,24 and the outcomes of programs may be assessed qualitatively only.25,26 While qualitative outcome evaluation is critical for providing depth and context, quantitative tools can guide organizations in screening and identifying individuals at most need of intervention, tracking changes over time, and in gaining a larger picture of wellbeing at the community level. This is particularly important for sub-groups who are at higher risk of poor mental health outcomes and thus are prioritized for prevention and promotion interventions, or for whom standardized instruments may perform differently, such as pregnant and parenting mothers of young children.27 Language barriers that already complicate or delay provision of critical services28 could also complicate attempts to adapt or develop new measures. The Rohingya dialect, which is unwritten, has a vast vocabulary for emotions, behaviors, and idioms of distress, yet they are often not translated well, could vary between established refugees and more recent arrivals.24 Moreover, whereas some MHPSS personnel in Cox’s Bazar speak Chittagonian dialect, which has a high degree of overlap with the Rohingya language, few are fluent Rohingya speakers.24,29 These challenges point to a need to explore the psychometrics of common psychosocial measures in this population. While some efforts have begun in this area,30 it is important to extend these to quantitatively assessing well-being in addition to distress so that prevention and promotion programs can be evaluated in addition to treatment services.
The purpose of the current study was to adapt, translate, pilot, and assess the psychometric properties of a collection of instruments for use with Rohingya refugee mothers enrolled in an intervention study in Cox’s Bazar, Bangladesh. A second aim of this study was to explore demographic characteristics associated with psychosocial health among a sample of Rohingya refugee mothers accessing nutrition treatment services for their children under age two (i.e., women not necessarily seeking MHPSS support).
This study was carried out in Rohingya refugee camps in Cox’s Bazar, Bangladesh, where the study’s operational partner, ACF, has provided critical humanitarian services since 2008, including efforts to meet the expanded need from the 2017 refugee influx through a number of Integrated Nutrition Centers (INCs). As a core feature of their holistic approach to supporting child wellbeing, ACF also offered maternal psychosocial support at these INCs, including the Baby Friendly Spaces (BFS) program. BFS is a low-intensity psychosocial support intervention aimed at promoting maternal and child wellbeing and improving care practices among mothers of children under age two who are receiving nutritional treatment for acute malnutrition. During the study period, the BFS program completed approximately 50-100 intakes of mother-child dyads per month at each of the 10 included INCs. Resident populations across INC catchment areas were generally comparable, with the exception of length of time in displacement across camps; some camps primarily hosted people who were displaced in the 1990s and others from the more recent 2017 influx. For this study, we prioritized INCs located in camps hosting more recent arrivals. Data were collected from November, 2021 through January, 2022.
This paper uses baseline data from a clustered randomized controlled trial (RCT) of the BFS program collected among a sample of 600 Rohingya refugee mothers who were newly enrolling in BFS services at 10 participating INCs. RCT eligibility criteria included being a mother, age 18 or older, of a child under two years of age identified as suffering from moderate or severe acute malnutrition without complication by ACF as a part of their routine programming. Mothers who had cognitive impairment or psychosis that would preclude participation in program activities, as well as mothers of children with severe developmental disabilities or severe malnutrition with complications, were excluded. Additionally, while mothers who received individual counseling from a counselor or psychologist within ACF remained eligible, anyone receiving referral care outside of ACF for more severe mental health or protection needs (in accordance with standard ACF protocols) were excluded.
Measures were chosen to assess multiple indicators of person-focused mental health and psychosocial wellbeing as outlined in the IASC Common M&E Framework for MHPSS in emergencies.12 As this study was undertaken as one of five projects involved in the USAID-funded Health Evaluation and Applied Research Development (HEARD) learning collaborative of related MHPSS studies,31 key research domains and assessment instruments were aligned for consistent use across projects. Thus, selection prioritized existing, standardized instruments with demonstrated prior utility in trauma-affected populations across multiple cultures and contexts.
Prior to launching the study, all instruments were translated into Bengali language with independent back-translation (English-Bengali) to facilitate local IRB review. Following reconciliation, the Bengali versions were further translated and back-translated to create a written version in the Chittagonian Bengali dialect that is widely spoken in Cox’s Bazar and has a substantial degree of mutual intelligibility with the Rohingya dialect (which has no standard written form). Ten Chittagonian Bengali-speaking interviewers from the Cox’s Bazar district who had conversational fluency in the Rohingya dialect were then trained in the full assessment battery, and in partnership with Rohingya volunteers, collaboratively standardized Rohingya phrasing. The ten data collector pairs (comprised of a Bangladeshi interviewer and Rohingya volunteer; see procedures below) each subsequently piloted the instruments with 1-2 separate Rohingya mothers recruited from the BFS program at the same INCs that would be involved in study recruitment. After noting all feedback, the Bangladeshi interviewers met together with the research team to make collective recommendations on instrument revisions to clarify wording, reduce redundancy, remove problematic items, align response options, and shorten the overall survey.
Psychological distress was measured using two separate scales. To align across the HEARD learning collaborative, we used the Kessler 6-item scale (K-6), a standardized measure of general distress that has been used in diverse settings.32 Responses are based on frequency of experience over a 2-week recall period, with a 5-point Likert type response option of “none”, “a little’, “some”, “most”, and “all” of the time. We supplemented this primary instrument with items from the International Depression Symptom Scale (IDSS), a 29-item measure of depression developed to reflect symptoms that were commonly reported in a global review of qualitative studies on depression and showed reliable cross-cultural performance in quantitative analyses.33 The IDSS has previously been validated, refined, and tested among four distinct groups of people from Myanmar,33–36 though not with Rohingya people. That previous work included using item response theory to develop and further test the Myanmar-wide IDSS & AMHR Short Instrument,35,36 a 19-item abbreviated measure that performed similarly well and included a subset of IDSS items as well as other items assessing anxiety and post-traumatic stress. Although we initially piloted the 29-item IDSS, feedback regarding the length of the assessment battery led us to reduce the IDSS portion from 29 to 12 items to reduce participant burden. This included removing two items that were redundant to the K-6 (feelings of worthlessness and hopelessness) as well as many items that had been removed from the Myanmar-wide short version (e.g., items related to weight, appetite, and sleep, as well as a number of items describing heart-related symptoms).35,36 One item about concentration and three somatic items from the full IDSS (headache, stomach ache, and other bodily aches) that were not included in the Myanmar-wide short version were retained on an exploratory basis due to the potential relevance in this population. Responses to all versions of the IDSS are typically provided on a 4-point response scale, but for this administration we revised the response scale to match the K-6. Both the IDSS and K-6 were scored by taking the average of all contributing items to generate scale scores ranging from 0-4, with higher scores indicating greater distress. Given conceptual overlap between the K-6 and IDSS as measures of distress, we also explored the psychometric properties of a combined score using items from both scales, again calculated on the 0-4 score range.
Functional impairment was measured using the 12-item version of the WHO Disability Assessment Schedule 2.0 (WHODAS), a widely used assessment of health and disability developed to be applicable across a range of cultures and contexts37 and used previously with Rohingya adults.38 The WHODAS assesses six domains of functioning: cognition, mobility, self-care, getting along/social interactions, life activities, and participation. The WHODAS is typically administered by asking how much difficulty the respondent has had carrying out various activities in the past month, but we adjusted the recall period to two weeks to align with other scales on the assessment battery. Response options included “none”, “mild”, “moderate”, “severe”, and “extreme or cannot do”. Scores were calculated by taking the average of all contributing items for a range of 0-4, with higher scores indicating greater functional impairment.
Subjective wellbeing was measured using items recommended by the Organization for Economic Co-operation and Development (OECD),39 including a single-item rating of global Satisfaction With Life (SWL) that is widely used internationally and the 8-item Personal Wellbeing Index (PWI).40 Frequently administered with the PWI, the SWL item asks “Thinking about your own life and personal circumstances, how satisfied are you with your life as a whole these days?” The PWI has one item for each of seven core life domains (e.g., health, safety, and personal relationships) and an optional item on religion, which we included. Previous research with Rohingya people has used similar question stems, i.e., asking “how satisfied” the respondent is with personal relationships and support from friends.18 Both the SWL item and the PWI used the same 0-10 response scale, accompanied by the image of a ladder with ten rungs that correspond to the ratings with 0 at the bottom (i.e., “no satisfaction at all”) and ten at the top (i.e. “completely satisfied”). After piloting, one item “what you are achieving in life” was removed, as the question was not clear or meaningful to respondents. The PWI was scored by taking the average of all contributing items for a range of 0-10, with higher scores indicating greater subjective wellbeing.
Coping strategies were assessed using items from the Brief COPE,41 a 28-item instrument that has been translated and validated in numerous languages,42–44 and which is suggested as a means of verification in the IASC M&E framework.14 The Brief COPE assesses a broad range of coping strategies, such as denial, humor, active coping, and use of instrumental and emotional support. Responses were provided on a 4-point scale regarding how frequently each strategy has been used, ranging from 0 “I haven’t been doing this at all” to 3 “I’ve been doing this a lot”, with a 2-week recall period. The Brief COPE has previously been used to generate three overarching coping styles of problem-focused, emotion-focused, and avoidant coping. However, studies have shown the factor structure of the Brief COPE to vary across samples; for example, in a previous maternal sample the instrument consisted of two separate sub-scales: active coping and disengaged coping.45 For the current study, the items on the original Brief COPE were supplemented with several items depicting additional coping strategies found to be relevant locally through qualitative research with Rohingya adults (e.g., “thinking about a better future for the children”).18 After piloting, feedback regarding redundancy, item irrelevance, and respondent fatigue led us to remove some items resulting in a 24-item scale that retained 18 items from the Brief COPE and six locally developed coping items. Scores were generated by taking the average of contributing items for a range of 0-3, with higher scores indicating more frequent use of the included coping strategies.
As part of their programmatic intake, women enrolling in the BFS program were screened for study eligibility by BFS providers using the inclusion/exclusion criteria listed above. Eligible participants were informed of the study, and those who indicated an interest in study participation were referred to a trained data collector who obtained oral informed consent and administered the assessment interview. We used an oral consent process due to low literacy levels in the target population. As described above, these interviewers were native Chittagonian Bengali speakers who were also proficient in Rohingya, and who were additionally accompanied by a Rohingya volunteer to facilitate comfort and understanding. Engaging Rohingya volunteers has previously been recommended to help overcome language and cultural barriers29; our paired approach was due to both hiring restrictions in the camps as well as low literacy levels among Rohingya volunteers. As such, consent was obtained by the interviewer with the volunteer present as a witness, and assessment interviews were conducted in Rohingya, the participants’ native language. Baseline data collection lasted between 1-2 hours and was completed prior to the participant engaging in BFS program activities. Data were recorded in KoBo Toolbox46 using a handheld tablet. As consent was obtained at the same time as the interview, consent was also documented in Kobo Toolbox.
This study was approved by the Institutional Review Board of the Institute of Health Economics (approval number IHE-IRB/DU/2021/33/Final) at the University of Dhaka.
Analyses were conducted in Stata 17.0.47 Descriptive analyses included cross-tabulations of demographic variables and histograms of individual item distributions. We then examined each scale’s factor structure using exploratory factor analysis (EFA). An exploratory rather than confirmatory approach was used due the dearth of prior instrument validation research with this population, the substantial revisions made to some of the instruments prior to data collection that could impact factor structure, and prior evidence for different factor structures for some instruments.45,48 Because item responses were categorical, we first generated and examined polychoric correlation matrices for the items on each instrument. We determined the number of factors to extract based on number of eigenvalues over 1 and Horn’s parallel analysis for principal components. Factor models used iterated principal factor estimation with promax rotation. Items were retained based on high factor loadings (>.4) and low uniqueness. Where there was ambiguity in factor structure, decisions were made based on parsimony and meaningful interpretation. Once refined, scale scores were calculated and summarized to examine sample distributions, and each scale’s internal consistency was assessed using Cronbach’s alpha (α).
We then evaluated construct validity by examining correlations between scales, hypothesizing that different indicators of similar domains (e.g., K-6 and IDSS, both measures of distress) would be highly correlated; indicators of unique but related domains (e.g., IDSS and WHODAS, measuring distress and functional impairment) would be moderately correlated; and, indicators of positive wellbeing (e.g., PWI) would be negatively correlated with distress but positively correlated with each other. Lastly, we examined correlates of psychosocial health by regressing each scale (as separate models) onto a consistent set of demographic characteristics in multiple linear regression models with cluster-robust standard errors to account for clustering. Due to low missingness (e.g., <1%), listwise deletion was used.
A total of 600 Rohingya mothers, ranging in age from 18 to 46, were enrolled in the study and completed baseline assessments. Sample characteristics are reported in Table 1. Women had been living in the camp anywhere from 1-34 years (mean = 5.7, SD = 4.2). The vast majority of women were married (95.8%) and did not work outside the home (96.3%). Most also had no (74.3%) or limited (some primary; 20.5%) formal education. Women reported having anywhere from 1 to 9 children (mean = 3.2, SD = 1.7). Their index child (the child for whom they were receiving ACF supports and attending the BFS program) ranged in age from 6-24 months (mean = 11.4, SD = 4.5) and was more often female (65.5%) than male. Eleven percent (n = 66) of women were currently pregnant. Only 21% of women reported that their family ate meat on at least a weekly basis; 55% reported monthly meat consumption, and 24% less than once per month. The full raw data can be found under Underlying data.72
International Depression Symptom Scale Items. Parallel analysis identified two eigenvalues above 1, although the second was marginal (adjusted: 1.03) and fell below that of the simulated value in Horn’s analysis. To explore potential multidimensionality, we fit a 2-factor EFA model. Factor loadings and uniqueness are provided in Table 2. Most items loaded highly onto a single factor that accounted for 84% of the variance, with only the three exploratory somatic items loading onto the second factor. As these items were excluded from prior short versions of the IDSS in Myanmar, we exclude these items. Additionally, the item assessing recent thoughts of self-harm had high uniqueness (.82) with no substantial loading onto either factor. In prior use of the Myanmar short instrument this item has been retained for clinical purposes but not factored into scale scores; we opted to do the same for the current study. Thus, the final IDSS scale score was comprised of eight items (IDSS-8) with high internal consistency (α = .87).
Item | Factor 1 | Factor 2 | Uniqueness |
---|---|---|---|
.67 | .47 | ||
.66 | .45 | ||
.68 | .55 | ||
.45 | .33 | .50 | |
.68 | .54 | ||
-- | .58 | .56 | |
-- | .69 | .59 | |
-- | .65 | .56 | |
.77 | .38 | ||
.85 | .36 | ||
.79 | .37 | ||
| .36 | .82 | |
Correlation between F1 and F2 | .61 |
Kessler 6-item Scale. Parallel analysis of the K-6 identified only one eigenvalue above 1. In a single-factor model, all six items had satisfactory factor loadings (.57-.69) and satisfactory internal consistency (α = .78).
Combined IDSS/K6. Using the items retained from the separate IDSS and K-6 EFAs, parallel analysis of the combined items supported a single factor structure that accounted for nearly 94% of the variance. Factor loadings for contributing items ranged from .55 to .79, and internal consistency for the 14-item combined scale was high (α = .90).
WHO Disability Assessment Schedule 2.0. Parallel analysis identified three adjusted eigenvalues over 1, although the 3rd was marginal (1.01) and fell below the simulated value. The first two factors together accounted for 95% of variance. As such we fit a 2-factor EFA (Table 3). Six unique items loaded on each factor sufficiently, and internal consistency of the two subscales were .73 and .78, respectively. Factor 1 was comprised of items reflecting aspects of self-care (e.g., washing your whole body) and social interaction (e.g., joining in community activities), and appeared to present less difficulty (mean = .37, SD = .44), whereas Factor 2 was comprised of more task-oriented items (e.g., standing, walking, household activities) for which a higher level of impairment was reported (mean = .78, SD = .63). However, the factors were moderately correlated (r = .49), and internal consistency was improved for the full scale (α = .81); as such, we opted to explore the relative contribution of each factor to the overall scale score and construct validity, but retained the single full-scale score for regression analysis.
Item | Factor 1 | Factor 2 | Uniqueness |
---|---|---|---|
-- | .66 | .62 | |
-- | .68 | .46 | |
.65 | -- | .52 | |
.77 | -- | .38 | |
-- | .53 | .61 | |
-- | .60 | .50 | |
-- | .70 | .55 | |
.54 | .32 | .43 | |
.67 | -- | .33 | |
.75 | -- | .56 | |
.63 | -- | .46 | |
-- | .68 | .44 | |
Correlation Between F1 and F2 | .49 |
Personal Wellbeing Index. Parallel analysis clearly supported a single factor model with only one eigenvalue over 1 and X% of the variance explained. The six standard items all loaded highly, with factor loadings ranging from .61 to .78. The optional item assessing satisfaction with religion did not load at or above .4 and had high uniqueness (.96) and was thus excluded. Cronbach’s alpha with that item excluded was .85.
Brief COPE and Local Coping Items. Parallel analysis for all coping items (inclusive of locally developed items) indicated four adjusted eigenvalues greater than one, although two of these fell at or below those of the randomly generated set. We first fit 4- and 3-factor models, yet each of these models produced one factor with only a single sufficiently loading item. Thus, we fit a more parsimonious 2-factor (shown in Table 4) in which standard COPE items loaded on Factor 1 and locally developed items on Factor 2, with the exception that finding comfort in religion (a standard item) loaded with the locally developed items. A number of items did not load on either factor (e.g., substance use, making jokes). The internal consistency of Factor 1 was satisfactory (α = .84) and all contributing items had positive item-rest correlations. This was unexpected, given that the items reflected a blend of presumably adaptive (e.g., getting comfort, accepting reality) and maladaptive (e.g., criticizing yourself, giving up) strategies which would make a cumulative score not hold utility. Given this, we further examined scale psychometrics using theoretical or previously demonstrated subscales (e.g., emotion-focused, problem-focused, avoidant, adaptive, maladaptive), none of which produced satisfactory scale properties (results not shown). We then examined the internal consistency of only those theoretically adaptive items loading onto Factor 1, and were able to identify a theoretically useful 10-item subscale of adaptive Brief COPE items (i.e., “B-COPE”, α = 81). A similar effort to identify a maladaptive subset was unsuccessful (e.g., 3-items, α = .47). Using Factor 2, we were able to generate a second 4-item subscale of locally derived items (i.e., “L-Cope”) after excluding “solving disputes”, which had low factor loading and high uniqueness. Removal of that item increased the subscale’s internal consistency from α = .74 to α = .76. We then re-ran the parallel analysis and EFA using only those items retained across the two scales, and items continued to clearly organize themselves across the two distinct factors. As the factors were only minimally correlated, (r = .24), we opted to treat these as two separate constructs for analysis.
Item | Factor 1 | Factor 2 | Uniqueness |
---|---|---|---|
| .64 | -- | .59 |
| .60 | -- | .62 |
.63 | -- | .64 | |
| .62 | -- | .64 |
| .54 | -- | .73 |
.45 | -- | .76 | |
| .82 | -- | .35 |
| .61 | -- | .66 |
.50 | -- | .68 | |
| .63 | -- | .58 |
-- | 36 | .72 | |
| .60 | -- | .67 |
| .43 | -- | .75 |
| -- | .79 | .37 |
| .67 | -- | .52 |
-- | -- | .93 | |
-- | .78 | .43 | |
| -- | -- | .92 |
| .40 | .76 | |
| -- | -- | 1.00 |
-- | .67 | .56 | |
-- | .85 | .31 | |
| -- | -- | .90 |
Correlation Between F1 and F2 | .31 |
Descriptions of each of the final scales, including the number of contributing items, internal consistency, possible and observed score ranges, and sample distributions are reported in Table 5. Distress and functional impairment were relatively low, with moderate subjective wellbeing ratings. Comparing score distributions among like constructs, the Myanmar-specific IDSS items had a slightly higher mean relative to the K-6 (1.03 vs. .77). The two subjective wellbeing scores performed relatively similarly, as evidenced by similar means (6.89 vs. 6.67). There was a larger variation between the two measures of coping, with scores indicating more frequent use of strategies included in the local coping scale (mean = 2.3) relative to the Brief COPE items (mean =1.06).
Correlations between scores, as shown in Table 6, aligned with our hypotheses aside from coping. For example, the IDSS-derived scale and K-6 (i.e., measures of distress) were highly correlated with each other (r = .71), and both moderately correlated with total functional impairment (IDSS: r = .65; K-6: r = .58), although these correlations appeared to be driven largely by WHODAS Factor 2 (r range .61 to .70) rather than Factor 1 (r range .31 to .34). PWI scores, reflecting a composite score of satisfaction with various life domains, were likewise positively correlated with a global rating of life satisfaction (r = .68). Both measures of subjective wellbeing were negatively correlated with both measures of distress and functional impairment (r range: -.35 to -.44). However, the two coping measures were not highly correlated with each other (r = .27) and had seemingly divergent relationships with other scales. For example, the Brief COPE items appeared to be more highly – and positively – correlated with functional impairment and distress (r range from.26-.38), with very low – but negative – correlations with subjective wellbeing (r = -.09 and -.17). The local coping scale had lower positive correlations with distress, but also slight positive correlations with subjective wellbeing (r = .09 and .17).
Adjusted associations between demographic characteristics and scale scores are reported in Table 7. Controlling for other factors, demographic characteristics most strongly and consistently associated with psychosocial outcomes included household meat consumption, pregnancy status, and marital status. Relative to mothers from households with the least access to meat, mothers from households eating meat on either a monthly or more frequent basis reported significantly lower distress (β range from -.18 to -.30, most p < .05) and significantly higher subjective wellbeing (β range from .75 to 1.85, all p < .05). Mothers who were currently pregnant also reported significantly higher distress (β range from .18 to .22, most p < .05) and lower wellbeing (SWL: β = -.47; PWI: -.32), though not all estimates were not statistically significant at p < .05. Likewise, the small number of unmarried mothers in the sample reported seemingly higher distress (β range from .51 to .53) and functional impairment (β = .36), and lower subjective wellbeing (SWL: β = -1.42; PWI: -1.09), though these associations were not statistically significant (p < .10). Other characteristics were less associated with distress and wellbeing, but meaningfully associated with other outcomes. For example, mothers with higher educational attainment reported lower functional impairment (completed primary: β = -.23, p < .01) and higher use of coping strategies (any primary: β = .31; completed primary: β = .44; both p < .01). Length of time in displacement had a small but positive association both with indicators of distress (all β = .02, p < .05) and coping (β = .01 and.03, p < .05). Other correlates of coping strategies were less consistent and largely not significant.
IDSS | K-6 | IDSS/K-6 | WHODAS | SWL | PWI | B-COPE | L-COPE | |
---|---|---|---|---|---|---|---|---|
β (SE) | β (SE) | β (SE) | β (SE) | β (SE) | β (SE) | β (SE) | β (SE) | |
Mother’s Age | .01 (.01)† | .00 (.01) | .01 (.01) | -.00 (.00) | -.01 (.02) | .02 (.02) | .01 (.01) | .00 (.01) |
Years living in camp | .02 (.01)* | .02 (.01)** | .02 (.01)** | -.00 (.01) | -.01 (.03) | -.01 (.02) | .01 (.01)* | .03 (.01)* |
Number of Children | .03 (.02) | .01 (.02) | .02 (.02) | .02 (.01) | -.09 (.08) | -.14 (.07)† | .00 (.02) | .03 (.03) |
Child Sex | ||||||||
Male | REF | REF | REF | REF | REF | REF | REF | REF |
Female | .05 (.05) | .02 (.04) | .04 (.04) | .10 (.02)** | .15 (.21) | .03 (.07) | .12 (.05)* | .01 (.06) |
Child Age | .01 (.01) | -.00 (.01) | .00 (.01) | -.00 (.00) | .02 (.02) | -.01 (.02) | -.00 (.00) | .02 (.01) |
Education | ||||||||
None | REF | REF | REF | REF | REF | REF | REF | REF |
Some Primary | .10 (.07) | .09 (.06) | .10 (.07) | .05 (.10) | -.42 (.35) | -.12 (.27) | .31 (.09)** | -.04 (.13) |
Completed Primary | .05 (.12) | .11 (.13) | .08 (.11) | -.23 (.06)** | .17 (.64) | .33 (.59) | .44 (.15)** | .19 (.16) |
Marital Status | ||||||||
Married | REF | REF | REF | REF | REF | REF | REF | REF |
Other | .51 (.29) | .53 (.29)† | .52 (.29) | .36 (.17)† | -1.42 (.65)† | -1.09 (.48)† | .32 (.17)† | .23 (.14) |
Employment Status | ||||||||
No employment | REF | REF | REF | REF | REF | REF | REF | REF |
Any employment | .40 (.18)† | .17 (.11) | .30 (.14)† | .30 (.12)* | .08 (.28) | .03 (.22) | -.03 (.17) | .09 (.20) |
Household Meat Freq. | ||||||||
Less than monthly | REF | REF | REF | REF | REF | REF | REF | REF |
Monthly | -.21 (.10)† | -.18 (.07)* | -.20 (.08)* | -.03 (.06) | 1.13 (.29)** | .75 (.28)* | -.09 (.09) | .16 (.17) |
Weekly | -.30 (.11)* | -.28 (.07)** | -.29 (.08)** | -.15 (.08) | 1.85 (.32)** | 1.24 (.20)** | -.18 (.09)† | .37 (.23) |
Pregnant | ||||||||
No | REF | REF | REF | REF | REF | REF | REF | REF |
Yes | .22 (.07)* | .18 (.10)† | .20 (.08)* | .08 (.05) | -.47 (.24)† | -.32 (.16)† | .02 (.06) | .16 (.09)† |
MHPSS research in humanitarian settings has historically been dominated by a narrow focus on distress-oriented outcomes, particularly depression and post-traumatic stress. This focus has facilitated the availability of more frequently tested assessment tools for these outcomes, with demonstrated validity across a range of cultures and contexts,49 that have helped create a relatively strong evidence base for the treatment of distress with psychotherapy. However, even assessments of distress conducted with Rohingya people have typically not been validated against clinical criteria,18,19,38,50–55 resulting in a lack of case identification tools for use in practice. Moreover, to continue to build a similar evidence base for a broader range of prevention and promotion interventions, there is a great need for progress in establishing instrument validity and utility for the psychosocial outcomes that should more often be directly targeted by those interventions, such as subjective wellbeing, functioning, and coping. Further, there is a critical need to test these instruments that are underrepresented in psychometric studies to determine where and how to balance use of standardized tools that support comparability across studies with the need for capturing meaningful experiences that are highly contextual.
This study, conducted with a community-based, non-clinical sample of Rohingya refugee mothers, provides a valuable illustration of this tension. Though not locally validated, the included distress and wellbeing instruments generally showed remarkable consistency with both theoretical expectations and prior performance. For example, our finding that the three exploratory somatic items (headache, stomach ache, and other bodily aches) did not load together with the core IDSS items from the Myanmar-Wide Short Version aligns with prior research among different ethnic groups in Myanmar,35,36 demonstrating the consistency of this instrument across populations. The IDSS-derived scale and K6 were also highly correlated, items had good internal consistency as a single scale, and both measures aligned with hypothesized expectations about their relationships with functional impairment and subjective wellbeing. This consistency in performance likely reflects the long history of research that has informed the development of these distress-based measures;33 we note that in fact the IDSS typically does include items similar to those on the K-6, and the use of both in this study – and resulting removal of redundant IDSS items – was due to outcome alignment across the broader learning collaborative. However, the items retained on the IDSS-derived scale did appear to be more sensitive to maternal distress than the K-6 as evidenced by the higher mean scores.
The subjective wellbeing measures, likewise, were well correlated with one another and findings support their construct validity. Additionally, that the magnitude of correlation between measures of subjective wellbeing and distress was relatively moderate, supporting conceptualization of subjective wellbeing as a distinct construct rather than simply the inverse of distress.56 Yet, the higher variation on the global SWL rating may indicate that the domain-based PWI is failing to capture certain aspects of life among the Rohingya in Cox’s Bazar that they are factoring into their perceptions of wellbeing. Moreover, the PWI item on satisfaction with what achievements in life seemed so irrelevant and difficult to understand during piloting that it was not recommended for use. It is thus plausible that a scale developed based on locally sourced domains of relevance would hold notable variations from standard instruments; such work would be valuable in the future.
Similar to the PWI, the WHODAS also performed satisfactorily but not without some questions regarding the overall utility and optimal scale composition in this population. The WHODAS is one of the most common assessments of functional impairment worldwide,57 with good evidence for reliability and validity across a range of cultures, supporting use of full and comparable scale scores when possible. Yet there is also evidence that it is a multidimensional instrument for which the factor structure may vary, suggesting that different factors may be driving overall scores across different groups.48 In this study, it appeared that a subset of role or task-oriented items (walking, standing, household tasks, daily school, or work) for which impairment was higher loaded onto a separate factor and drove the relationships between distress and functional impairment, whereas basic self-care and social engagement were less impaired and less closely related to distress. This may reflect the influence of a community-based sample with relatively low rather than clinical levels of distress and impairment. It is noteworthy that the women in this study were recruited from a program seeking to improve child’s health and development by improving caregiver functioning; that role or task-oriented items were more likely to be impaired aligns with this theoretical approach. That tasks of daily living are particularly context- and role-driven, locally developed tools that value contextual relevance over cross-cultural comparability are likely to be particularly useful.58
The most problematic assessment we encountered was that of coping. Coping strategies, like expectations for functioning, are likely to be highly dependent on cultural beliefs and practices, as well as what is feasible and accessible in a given setting.59 Whereas we sought to include standard items from the Brief COPE, multiple items were found to be irrelevant or simply not endorsed and thus removed, while the remaining standard items did not cohere together in meaningful ways. While we pragmatically identified a subset of “adaptive” items that appeared to have good internal consistency, its relationship with other constructs, particularly temporal relationships that we could not account for in cross-sectional data, was unclear. For example, do observed positive correlations between coping and distress potentially support validity as the presence of distress necessitates leveraging coping strategies, or would construct validity be supported if coping and distress were inversely related because coping protects from distress? How are similar positive relationships between coping and both distress and subjective wellbeing accounted for? The endorsement of strategies included on the COPE was relatively limited compared to items added from local qualitative research18 these latter items reflected a combination of physical, mental, and spiritual self-care strategies that were highly endorsed, but seemingly irrespective of other characteristics and thus potentially less sensitive to change. Moreover, some locally reported coping strategies also seemed to conflate coping and functioning (e.g., solving disputes, practicing good hygiene); as improved coping is likely to be a key intermediate outcome for many psychosocial interventions.11 As such, there is a critical need to improve the conceptualization and measurement of coping in humanitarian settings, which our findings suggest will require in-depth qualitative research that seeks to understand how participants conceptualize the relationship between coping and distress and the types of strategies with the most relevance and utility in a given context.
Beyond scale psychometrics, the current study also identified factors associated with the measured psychosocial outcomes. The Rohingya community in Bangladesh faces substantial political and societal opposition to integration.59–61 This reception, paired with a rapid influx of newly displaced Rohingya refugees that have further strained relations and resources in recent years, has contributed to stressful very stressful living conditions.17 Within this context, among food-insecure mothers with limited education or employment opportunities, key risk factors associated with poorer psychosocial health included relative socioeconomic disadvantage, current pregnancy, and being unmarried. These findings align with previous research indicating that Rohingya refugee women are especially vulnerable and even more so if they are separated, widowed or divorced due to numerous reasons ranging from poorer economic opportunities to increased gender-based violence.62,63 Recalling that these women were all mothers to children aged 6-24 months, that current pregnancy emerged as a clear risk factor may indicate a lack of women’s empowerment in reproductive health and planning. Also related to female empowerment, higher educational attainment was associated with less functional impairment and greater use of coping strategies, consistent with previous studies amongst other refugee groups.59,64 While employment may have been expected to follow a similar pattern, this was not shown in our data potentially due to restrictions on employment in the camps such that very few women reported employment, and it is possible that women seeking such opportunities do so in the absence of other supports. Gender preferences favoring male children may further explain why this sample – presenting for child malnutrition – reflected a higher proportion of female children and why having a female child in some cases appeared to be associated with poorer maternal psychosocial health.65
A key limitation of this study is a lack of formative qualitative work that would have helped to better refine instruments and potentially explain difficult findings. Due to our participation in a broader consortium of research projects, we were limited in our choice of measures and faced time constraints that precluded more intensive instrument development work. Additionally, although the instruments were piloted and refined with careful consideration to the interview process and, for the most part, did show good psychometric properties, the complexities of the adaptation and translation process did not fully align with best practices and could have introduced inconsistencies.66,69 Overall, the field is in great need of more attention to the development of both qualitatively grounded instruments assessing a range of psychosocial outcomes of interest, as well as clinical validation of pathology-oriented measures,70,71 both needs we were unable to address.
This study also relied on a convenience sample of mothers enrolled in a longer intervention study, which imposed some inclusion restrictions that would not have otherwise been included (e.g., excluding women referred to outside support for severe distress or suicidality). Thus, this study likely failed to capture the full range of distress that would be observed in a representative community-based sample. This could alter the observed factor structures and also makes it an inappropriate sample to attempt to establish clinical cut-points. However, the sample does represent a population that could be targeted with prevention and promotion interventions.
Lastly, we took an exploratory approach to factor analysis given the changes made to the instruments and the limited prior research with this population; future research can build upon this work to take a more confirmatory approach. It is worth noting that given the extent of post-pilot instrument modifications to remove many items from the IDSS and the Brief COPE and the exploratory factor analytic approach, the resulting scales can only be considered as approximations of the originating tools and cannot be presumed to fully represent the constructs those tools were originally designed to measure.
Among a group of displaced Rohingya mothers seeking nutrition treatment for their malnourished young children, a set of standard assessment tools performed adequately to assess a range of psychosocial outcomes but did not appear to fully capture local experiences. Some items included on standard measures were seldom endorsed, suggesting less relevance. Most critically, both standard and locally developed coping items showed weak evidence of construct validity and utility. Testing and reporting on the validity of outcome instruments intended for use in program evaluation and intervention research is a critical step in addressing current knowledge gaps regarding the effectiveness of many commonly implemented, broadly supportive psychosocial programs that do not necessarily target mental illness. Findings highlight the need for further mixed methods research to develop a rich battery of instruments with cross-cultural validity, particularly for positive psychosocial wellness outcomes. Such efforts should focus on both culturally grounded measures of a range of psychosocial outcomes of interest, as well as clinical validation of tools intended to be used for case identification and treatment purposes.
OSF: Measuring the psychosocial wellbeing of Rohingya mothers in Cox’s Bazar, Bangladesh: Psychometric properties of an MHPSS assessment battery, https://doi.org/10.17605/OSF.IO/AMDF2.72
This project contains the following underlying data:
OSF: Measuring the psychosocial wellbeing of Rohingya mothers in Cox’s Bazar, Bangladesh: Psychometric properties of an MHPSS assessment battery, https://doi.org/10.17605/OSF.IO/AMDF2.72
This project contains the following extended data:
Data are available under the terms of the Creative Commons Attribution 4.0 International license (CC-BY 4.0).
Views | Downloads | |
---|---|---|
F1000Research | - | - |
PubMed Central
Data from PMC are received and updated monthly.
|
- | - |
Competing Interests: No competing interests were disclosed.
Reviewer Expertise: Refugee mental health, global mental health, humanitarian mental health and psychosocial support
Is the work clearly and accurately presented and does it cite the current literature?
Partly
Is the study design appropriate and is the work technically sound?
Partly
Are sufficient details of methods and analysis provided to allow replication by others?
Partly
If applicable, is the statistical analysis and its interpretation appropriate?
I cannot comment. A qualified statistician is required.
Are all the source data underlying the results available to ensure full reproducibility?
Yes
Are the conclusions drawn adequately supported by the results?
Yes
References
1. Borsa J, Damásio B, Bandeira D: Adaptação e validação de instrumentos psicológicos entre culturas: algumas considerações. Paidéia (Ribeirão Preto). 2012; 22 (53): 423-432 Publisher Full TextCompeting Interests: No competing interests were disclosed.
Reviewer Expertise: Public Mental Health in Humanitarian settings, Cultural Validation and Adaptation of Psychological Measures, Task shifting - Training and Supervision, Moral Injury, and Mental Health at Work
Is the work clearly and accurately presented and does it cite the current literature?
Partly
Is the study design appropriate and is the work technically sound?
Yes
Are sufficient details of methods and analysis provided to allow replication by others?
Yes
If applicable, is the statistical analysis and its interpretation appropriate?
Yes
Are all the source data underlying the results available to ensure full reproducibility?
Yes
Are the conclusions drawn adequately supported by the results?
Partly
References
1. Riley A, Varner A, Ventevogel P, Taimur Hasan MM, et al.: Daily stressors, trauma exposure, and mental health among stateless Rohingya refugees in Bangladesh.Transcult Psychiatry. 2017; 54 (3): 304-331 PubMed Abstract | Publisher Full TextCompeting Interests: No competing interests were disclosed.
Reviewer Expertise: Refugee mental health, global mental health, humanitarian mental health and psychosocial support
Alongside their report, reviewers assign a status to the article:
Invited Reviewers | ||
---|---|---|
1 | 2 | |
Version 2 (revision) 30 Jan 24 |
read | |
Version 1 29 Nov 22 |
read | read |
Provide sufficient details of any financial or non-financial competing interests to enable users to assess whether your comments might lead a reasonable person to question your impartiality. Consider the following examples, but note that this is not an exhaustive list:
Sign up for content alerts and receive a weekly or monthly email with all newly published articles
Already registered? Sign in
The email address should be the one you originally registered with F1000.
You registered with F1000 via Google, so we cannot reset your password.
To sign in, please click here.
If you still need help with your Google account password, please click here.
You registered with F1000 via Facebook, so we cannot reset your password.
To sign in, please click here.
If you still need help with your Facebook account password, please click here.
If your email address is registered with us, we will email you instructions to reset your password.
If you think you should have received this email but it has not arrived, please check your spam filters and/or contact for further assistance.
Comments on this article Comments (0)