Keywords
Quality of life, WHOQOL-OLD, Validity, Reliability, Old age people, Ethiopia
This article is included in the Health Services gateway.
This article is included in the Global Public Health gateway.
Quality of life, WHOQOL-OLD, Validity, Reliability, Old age people, Ethiopia
Advancement in public health sector along with changes in clinical interventions have resulted in a rise in life expectancy in almost every area of the world.1 People are living longer around the world, but they are not necessarily healthier.2,3 At the same time, the number of years spent living with impairments and chronic illnesses is increasing. The health of old age people is changing more frequently and faster as they live longer lives, which affects their quality of life.4,5
The World Health Organization (WHO) described QOL as “individuals’ perception of their position in life in the context of the culture and value systems in which they live and in relation to their goals, expectations, standards, and concerns”.6 However, assessing and improving QOL in old age are a difficult undertaking. This is related to the complicated concept of QOL, the identification of many instruments, and the subjectivity of how older people and healthcare practitioners judge their patients' health.7–9 Despite this, if old age people have their independence, autonomy, and good physical health, as well as remain active, find purpose in their lives, and fulfill their social obligations, their QOL may be good or at least maintained.10–12
Furthermore, WHOQOL-OLD has been developed specifically for measuring QOL in old age people13 and its novel form contains a total of 24 items assembled into six domains, each with four items: autonomy (AUT), past, present, and future activities (PPF), sensory abilities (SAB), social participation (SOP), death and dying (DAD), and intimacy (INT).14
There is a vast disparity in the proportion of old age people aged 60 years and above who report their QOL across countries.9 Sensory abilities and intimacy scored the highest QOL sub-scale in high-income countries,15–17 while social participation (SOP) scored the highest QOL sub-scale in low-income countries.18,19
Available studies in Ethiopia did not use the WHOQOL-OLD. They instead used other tools, such as the medication-related quality of life (MRQoL),20 Control, Autonomy, and Self-realization (CASP),21 and the World Health Organization Quality of Life-brief version (WHOQOL-BREF).22,23 To the best of our knowledge, these tools were neither developed for nor have yet been rigorously validated for Ethiopian old age people. Still, the accessible tools for evaluating QOL are usually designed and validated in developed nations, which have distinct cultural, socio-economic, and life standards contrary to those of African nations. Furthermore, the majority of old age Africans are illiterate, making it difficult to use QOL questionnaires that demand users to read and write.24
The lack of validated instruments troubles the accuracy of the data generated and its extrapolation to a larger population, as well as the ability to compare findings through studies. Subsequently, low-quality data can have a detrimental impact on policies and services, as well as efficient use of resources.25 Therefore, this study aimed to translate and validate the WHOQOL-OLD tool for Ethiopian old age people.
This study was conducted in Bahir Dar City, the capital of the Amhara Regional State. Bahir Dar is located in Amhara Regional State, Northwest Ethiopia, which is 565 kilometers away from Addis Ababa, the capital city of Ethiopia.
A cross-sectional study design was conducted from January 16 to March 13, 2021.
This study utilized two groups of the population. The first group were health care experts used for content validation, and the second group were community-dwelling old age people for psychometric validation. For the expert judgment, 10 healthcare experts were purposefully selected based on the guideline recommendation for the Delphi technique.26 For the psychometric validation, a participant-to-variables ratio of 10:1 was followed as a rule of thumb.27 Since the mini nutritional assessment tool has 18 items, a minimum of 180 study participants were selected, and the study population was used for this WHOQOL-OLD tool validity study too. Community-dwelling old age people selected in multistage cluster sampling from Belay Zeleke, one of the sub-cities of Bahir Dar City, Northwest Ethiopia were used for this study. Community-dwelling people age 60 years and above, living in the city administration at least for six months, being capable of describing their lived experience, and being able to understand and speak the local Amharic language were included. While those who had significant spine curvature (scoliosis or kyphosis) and had both extremities amputated were excluded. The detailed study methods for study population, sample size, and sampling procedures were described in the previous study.28
This tool validation study was conducted in three stepwise phases. The first phase was to review existing QOL assessment tools for old age people. In the second phase, selection, translation, and review of the tool by experts were conducted. In the last phase, psychometric validation among community-dwelling old age people was performed.
Quality of life (QOL) has been conceived and assessed in a variety of ways based on the paradigm, discipline, target community, and time frame of the study investigating it.29 Around the world, numerous tools have been established for measuring QOL in adults and validated for the elderly.9,30 Only in Africa, 14 unique tools were identified from 22 studies to measure QOL in old age people.24 Furthermore, instruments have been developed specifically for measuring QOL in old age people, including the WHOQOL-OLD,13 the Elderly Quality of Life Index (EQLI),31 the Older People’s Quality of Life (OPQOL) questionnaire,32 and the World Health Organization Quality of Life-AGE questionnaire (WHOQOL-AGE).33
The WHOQOL-OLD novel form contains a total of 24 items assembled into six domains, each with four items: autonomy (AUT), past, present, and future activities (PPF), sensory abilities (SAB), social participation (SOP), death and dying (DAD), and intimacy (INT). The module evaluates mostly the two-week duration of testing in self-report or interviewer-administered form. Although each object is rated on a Likert scale of 1 to 5, they differ in their anchors. Each domain provides an individual score ranging from 4 to 20. The component values can also be converted to a scale of 0 to 100. Furthermore, summing the individual item values yields total scores from 24 to 120, with higher scores indicating better QOL.14
The WHOQOL-OLD instrument was chosen from the available QOL measurement tools to translate and culturally adapt for the context of our community because it: (1) is designed specifically for elderly people;13 (2) is the most comprehensive multidimensional instrument that covers multiple components of QoL;13,14,34 (3) contains items that are particularly relevant for old age people and are absent from the other instruments, such as autonomy, intimacy, and death and dying;13 (4) is subjective and culturally sensitive;35,36 (5) showed good reliability and validity in the assessment of QOL for older participants with multi-language versions;37,38 and (6) is freely available for research use.14
The English version of the WHOQOL-OLD questionnaire was initially translated into the Amharic local mother tongue version independently by bilingual internists and human nutritionists trained at master’s degree level. These two translators were selected respectively as they are experienced in care providing for old age people and nutrition research and might be familiar with the intent of each item and/or the tool as a whole. The two Amharic versions were then combined, and any inconsistencies were settled by consensus. The translated Amharic version was next translated back into the original English language to ensure the accuracy of the translation. This was done again by two independent bilingual, native Amharic-speaking language translators trained at masters’ degree level. Finally, the experts’ group reviewed both versions of the translations and reached a conclusion on all items to get a final version of the translated questionnaires (Figure 1).
Data were collected from two groups: healthcare experts and community-dwelling old age people, in exploratory mixed qualitative and quantitative methods. Each expert evaluated the content validity of the tool through face-to-face contact. The experts and old age people’s comments were used for words, grammar, clarity, appropriate scoring and applicability of items. After incorporating the experts’ comments, psychometric validation was conducted among community-dwelling old age people.
Six urban health extension workers and six bachelor of science nurses collected the data after two days of training. The principal investigator and a master’s degree trained nutritionist supervised the data collection process. The data were collected through face-to-face interviews using the standardized Amharic version of the questionnaires. Assistance from family members or caregivers was also used.
The international business machines corporation statistical package for the social science (IBM SPSS) version 2339 (RRID:SCR_002865, URL: http://www-01.ibm.com/software/uk/analytics/spss/) and the extension of Analysis of Moment Structures (AMOS) via the maximum likelihood estimation method40 were used to analyzed the data. Socio-demographic characteristics of the study participants were expressed in descriptive statistics. Whereas, the statistical analysis of the WHOQOL-OLD tool in this study was done in stages. The values for all negatively phrased items coded with a number of 1, 2, 6, 7, 8, 9, and 10 on the tool were first reverse-scaled to match the values for positively phrased questions. Second, the statistic assumptions of normality and outliers were verified. Using the squared Mahalanobis distance (d2) greater than 0.05 for each item,40 no more severe multivariate outliers were discovered, and none were deleted. Furthermore, normalized kurtosis values and critical ratios of less than 5.00 indicated that the data were normally distributed.40 Thirdly, total and mean scores were computed for each domain. Finally, the overall total score was translated into a score with a range of 0 to 100.
Content validity and acceptability
To assess the acceptability of the Amharic version of the WHOQOL-OLD, the response rate and floor and ceiling effects of summary scores were examined. If more than 15% of respondents received the lowest bad health score or the highest good health score possible score, there were floor and ceiling effects.41
Construct Validity
Exploratory and confirmatory factor analyses were performed, respectively, to check construct validity. The principal component analysis (PCA) with Promax rotation was performed to evaluate the sample adequacy and check whether the items in the translated questionnaires were organized comparably to the novel questionnaires. Oblique rotation was used rather than orthogonal since we expected that the factors of the tool would be intercorrelated, as previously verified by other studies.17,42 The Kaiser-Meyer-Olkin (KMO) test at a minimum level of 0.60 was used to determine whether the items were sufficiently correlated to allow for factor analysis.43 Whereas, Bartlett's test of sphericity with a p-value less than 0.05 was used to examine the inter-correlations between items. In addition, the eigenvalues of more than one rule and a graphic review of the scree plot were employed to decide the number of factors to maintain. Items had to be related to a single component, and each rotated component had to have at least four items to assess component affiliation. The proportion of explained variance of more than 60% was used to measure the factors' ability to describe the data.43
The data were then exported to AMOS version 23 for confirmatory factor analysis (CFA).40 A predefined six-factor model in first and second-order CFA was used to test the construct validity of the Amharic version of the WHOQOL-OLD tool. The first model is a congeneric measuring model that depicts the six-factor structure in which each item on the questionnaire was linked to the underlying latent construct of its predicted aspect. The second-order factor was introduced to see if the construct “QOL” could be represented by a single dimension.43
At least one test from each of the four typical model fit indexes was used for the acceptability of CFA suggested variables. These included the chi-squared test (X2) from the overall model fit, the goodness-of-fit index (GFI), the root mean square error of approximation (RMSEA), or the standardized root mean square residual (SRMR) from the absolute fit indexes; and the comparative fit index (CFI), the normed fit index (NFI), non-normed fit index (NNFI), or Tucker-Lewis index (TLI) from the relative or incremental fit index; and the Akaike Information Criteria (AIC) or Bayesian Information Criteria (BIC) from the predictive fit indicators.44–46 The recommended model is usually the one with the least AIC and BIC statistic value46 and an RMSEA of less than 0.08.44,45 While the GFI, CFI, NFI, and NNFI scores more than 0.90, especially those near one, indicated good fitness.44,45
The CFA also took into account for both convergent and divergent validity. Convergent validity was evaluated using the factor loading, AVE, and composite reliability (CR) tests. Good convergent validity was considered if the total correlations and factor loading or inter-item correction values exceeded 0.50 and 0.30, respectively.43
The AVE and composite reliability (CR) values were calculated as:
Where, Li is the factor loading for ith construct n is the number of item indicators for a construct and ei is the error variance term for a construct.
The values of AVE of 0.5 or more and composite reliability (CR) of 0.7 or higher were used to see if the items logged under each facet/domain were estimating the same concept.43
The divergent or discriminant validity of the Amharic version of the WHOQOL-OLD construct was achieved when the coefficient of cross-loading (correlation among the components) did not exceed 0.85.43 Additionally, the value of maximum shared variance (MSV) being less than the value of AVE was used as an indication of divergent validity.43
Reliability
Cronbach's alpha (α) was used to measure internal consistency, and a value greater than 0.7 was taken as a benchmark.47 In addition, construct reliability (CR) based on the factor loading after CFA and a coefficient of more than 0.70 was considered satisfactory.43 Furthermore, the Pearson correlation coefficient was used to correct the reliability coefficient for the 24 items of the Amharic version of the WHOQOL-OLD scale.
Data quality control
Data collection questionnaires were adapted from previously validated standards. The data collectors and supervisors took two days of training on the study’s purpose and the utilization of data collection tools. Statistical data assumptions were checked following the prescribed processes.
A total of 180 community-dwelling old age people aged from 60 to 90 years participated in this study. The mean age was 69.44, with a standard deviation of 6.8. The majority of the study participants were females (61.7%) and orthodox religious followers (73.9%). More than half (53.3%) of the respondents were married and lived with their spouses, and 40% of them could not read and write (Table 1).
Content validity and acceptability
As experts reviewed, every item in the tool was socially acceptable and had no sensitive words. Minor changes, such as word and phrase expansion and substitution of more relevant Amharic terminology and phrases were made to make the items clear and more accurate. Moreover, there were no major difficulties encountered throughout the data collection period, and the scale was completed on each participant in 25 to 35 minutes. The result showed a 100% response rate without missing any item. No significant concern was raised in their remarks about the understandability of the questions and response items. The ceiling and floor effects of each domain in the Amharic version of WHOQOL-OLD varied from 1.8 to 7.7% and 0 to 2.9 %, respectively.
Construct validity
All variables of the tool were correlated with more than 0.306 in the matrix correlation, satisfying the requirement of the presence of two or more correlated variables with more than a 0.30 coefficient. In addition, the measure of sampling adequacy, located on the diagonal of the anti-image correlation matrix of SPSS, was greater than 0.80 for each variable in the first iteration. This is commendable and does not necessitate the removal of any items. Furthermore, the Kaiser-Meyer-Olkin (KMO) measure of sampling adequacy was 0.943, and Bartlett’s test of sphericity was statistically significant (X2 = 3,915.790; n = 180; df = 276; P<0.0001). These indicate that all the 24 variables that remained in the analysis satisfied the criteria for appropriateness of factor analysis.
In the same way, the 24 variables appeared to measure five underlying components using the latent root criterion, commonly known as the Kaiser criterion (eigenvalues greater than 1.0). These variables are responsible for 78.2% of the total variance explained. The results were identical when a fixed six component based on prior knowledge was used. While the scree plot suggested that six factors would be appropriate when considering the changes in eigenvalues. Moreover, the communality value was satisfactory for all variables, with a minimum value of 0.687. Since each variable has more than 0.50, there is no need for communality variable removal.
With four items on each component, all of the 24-items are heavily factor loaded with more than 0.5. Explicitly, Factor 1 was loaded with the four items of PPF activities. The four autonomy (AUT) components, on the other hand, were loaded onto Factor 2. The loadings of the factors ranged from 0.649 to 0.846. Furthermore, there were no instances of cross-loading between the components.
The extended analysis of EFA; CFA) was used to see if the results fit a postulated measurement model. The results of the first-and second-order CFA showed that all WHOQOL-OLD facets are adequately represented on the linked items by substantial standardized loadings above 0.5 (Figure 2). When the goodness of fit index parameters of both models was compared using standard structural equation modelling (SEM) procedures, it was clear that adding the second-order common component did no influence on the model fit. All four indices displayed an acceptable fit, except the value of the goodness-of-fit index (GFI) and adjusted goodness-of-fit index (AGFI), which are slightly below 0.90 (Table 2).

The CFA also took into account both convergent and divergent validity. The scaling analysis revealed that almost all of the items had good correlations with their respective sub-scales (r≥0.65), indicating that the instrument has strong convergent validity. Additionally, the findings confirmed that all item loadings on their own factor were greater than 0.800, which is required for convergent validity.
Furthermore, AVE and composite reliability (CR) values for each construct of the Amharic version of WHOQOL-OLD were more than 0.5 and 0.7, respectively. The values for the total score of the tool were respectively 0.68 and 0.92, which are more than the acceptable range. The AVE estimations ranged from 69.8% for SAB to 77.6% for PPF activities, respectively. Thus, all constructs exceed the 50% rule of thumb, which states that items measuring similar restrictions are loaded into one domain. The AVE values are also larger than the MSV values, which is important for divergent or discriminant validity. Additionally, the calculated correlation coefficient between all six components of the model in IBM-SPSS-AMOS does not exceed 0.85. As a result, we conclude that the measuring tool for the construction of the Amharic version of WHOQOL-OLD has attained divergent or discriminant validity (Table 3).
Reliability
The Cronbach’s Alpha (α) values of the Amharic version of WHOQOL-OLD were above 0.90, varying from 0.902 for SAB to 0.932 for PPF activities. The total scale has a Cronbach’s alpha (α) value of 0.963. Meanwhile, Cronbach’s alpha coefficient of each domain as well as the total scale did not increase when each item was deleted, indicating that all had constructive contributions to their facets as well as the total scale (Table 4).
In addition, the Pearson correlation revealed high correlation coefficients between items and their theorized domains (inter-item relations) and the six domains themselves as well (Table 5).
In comparison to the other domains, the correlation coefficients between items and their postulated domains were substantially higher. Furthermore, the domains themselves were moderately correlated with each other. The lowest correlation was observed between SAB and PPF activities with a correlation coefficient value of 0.489. The highest correlation was observed between the correlation of SOP and INT with DAD, both with a correlation coefficient value of 0.744. Additionally, all of the domains were highly connected with the total QOL score, with the SAB and INT having the lowest (0.726) and highest (0.867) correlation coefficients with the overall QOL score, respectively (Table 6).
| Domains/facets | Sensory abilities (SAB) | Autonomy (AUT) | Past, present, and future activities (PPF) | Social Participation (SOP) | Death and dying (DAD) | Intimacy (INT) | Overall score | 
|---|---|---|---|---|---|---|---|
| Sensory abilities (SAB) | 1 | 0.726** | |||||
| Autonomy (AUT) | 0.536** | 1 | 0.780** | ||||
| Past, present, and future activities (PPF) | 0.489** | 0.588** | 1 | 0.825** | |||
| Social Participation (SOP) | 0.519** | 0.554** | 0.643** | 1 | 0.850** | ||
| Death and dying (DAD) | 0.529** | 0.590** | 0.617** | 0.744** | 1 | 0.861** | |
| Intimacy (INT) | 0.528** | 0.599** | 0.673** | 0.698** | 0.744** | 1 | 0.867** | 
This is the first study examination of the psychometric properties of the WHOQOL-OLD for a representative sample of the Ethiopian population aged 60 years and older. The results revealed that all items in the Amharic version of the WHOQOL-OLD were simple to understand and respond to, indicating that the scale is practicable. Similar findings were reported from psychometric studies of Korea42 and Iran.48 In addition, all of the domain scores and the overall score revealed less than 15.0% ceiling and floor effects, which is acceptable for all subscales.41 This classification indicated that the Amharic version of WHOQOL-OLD had no significant floor and ceiling effects, indicating its discriminant ability. This is consistent with the other cultural studies conducted in Korea42 and Iran.48
In terms of content validity, this study yields statistically significant item-facet correlation coefficients that are identical to those found in China.37 Moreover, the results of CFA for a six-factor model indicated acceptable construct validity that best fit the study data and was congruent with the reported priori factor structure of the original scale13,14 and in the validation studies of Vietnam,17 Korea,42 Iran,48 and the Netherlands.15
Our analysis also revealed the psychometric qualities of the Amharic version of the WHOQOL-OLD, such as RMSEA of 0.047, CFI of 0.975, GFI of 0.867, and NFI of 0.917. These are comparable to, if not better than, those reported in the worldwide WHOQOL-OLD field research14 and those of other country versions in the Netherlands,15 Vietnam,17 Korea,42 and Iran.48
The CFA-based fit indices in this study are also acceptable as measures of divergent validity, which is a subtype of construct validity.43 There was no evidence of scaling error, as the tool’s items discriminate significantly between their own and other domains, demonstrating divergent validity.
Furthermore, all corrected item-total correlations and factor loadings based on the six-factor CFA model appear higher than 0.30, which is consistent with a study from Vietnam.17
Internal consistency Cronbach's alpha value in the current study demonstrated high-reliability coefficients and item-scale respective inter-item correlations for the total and subdomains of the Amharic version of WHOQOL-OLD. The findings are higher than compared to those of prior research conducted in Vietnam,17 Korea,42 and Iran.48 This could be because of socio-cultural differences, with older people residing in different countries. There could also be a chance of reporting bias based on respondents' willingness and ability to provide accurate responses, especially when it comes to the length of time in the interview.
To our knowledge, this is the first study that adapt and validate the WHOQOL-OLD tool in Ethiopia. This study has strengths, as the data collection and the validation were conducted both from experts and community-dwelling old age people, which could have decreased some bias. Data collection was conducted by experienced health extension workers and nurses.
Despite these strengths, this research has few limitations. The primary weakness is the self-reported nature of the tool, which can lead to the under-or overrepresentation of results. Second, it was conducted among community-dwelling old age people in urban locations; as a result, the findings may not apply to those living in rural or institutional settings. Third, test-retest reliability and sensitivity to change of the instruments could not be tested due to the study's cross-sectional design.
The current study found that the translated Amharic versions of the WHOQOL-OLD tool indicated robust internal consistency and construct validity. The instrument can be utilized in routine care provision activities among the community-dwelling old age people in Bahir Dar, Northwestern Ethiopia. Other social care-providing organizations can also use the Amharic version of WHOQOL-OLD to estimate the impacts of their policies, services, or targeted interventions might have on elder people. However, since Ethiopia is a country of socio-cultural diversity, more research on multiethnic and multi-cultural issues is required.
This research was conducted as part of a Ph.D. dissertation that received ethical approval from Bahir Dar University (R.N./IRB/003/2021). In addition, participantion was entirely voluntary, and every participant gave informed consent.
Muhye Ahmed planned the research, analyzed the data, and wrote the paper. Fentahun Netsanet was involved in the design, data analysis, manuscript preparation, and critical evaluation of the study. Both authors read and approved the final manuscript.
Dryad: Data from: Validation of Quality-of-Life assessment tool for Ethiopian old age people. https://doi.org/10.5061/dryad.zkh1893dq. 49
This project contains the following underlying data:
The study questionnaire. 49 This project contains the following extended data:
- The study questionnaire 49
- STROBE checklist- for quality of life as cross-sectional study. 49
Data are available under the terms of licensed under a CC0 1.0 Universal (CC0 1.0) Public Domain Dedication license.
We would like to thank Bahir Dar University for allowing us to perform this research. Our sincere thanks go to all of our study participants and experts who helped with all of the translations, assessed the semantic equivalence, and assessed the instruments for face and content validity.
| Views | Downloads | |
|---|---|---|
| F1000Research | - | - | 
| PubMed Central Data from PMC are received and updated monthly. | - | - | 
Is the work clearly and accurately presented and does it cite the current literature?
Partly
Is the study design appropriate and is the work technically sound?
Yes
Are sufficient details of methods and analysis provided to allow replication by others?
Yes
If applicable, is the statistical analysis and its interpretation appropriate?
Yes
Are all the source data underlying the results available to ensure full reproducibility?
Yes
Are the conclusions drawn adequately supported by the results?
Yes
Competing Interests: No competing interests were disclosed.
Reviewer Expertise: Epidemiology
Is the work clearly and accurately presented and does it cite the current literature?
Partly
Is the study design appropriate and is the work technically sound?
Yes
Are sufficient details of methods and analysis provided to allow replication by others?
Yes
If applicable, is the statistical analysis and its interpretation appropriate?
Yes
Are all the source data underlying the results available to ensure full reproducibility?
Yes
Are the conclusions drawn adequately supported by the results?
Yes
Competing Interests: No competing interests were disclosed.
Reviewer Expertise: Gerontopsychology, Social Gerontology, Clinical Psychology
Alongside their report, reviewers assign a status to the article:
| Invited Reviewers | |||
|---|---|---|---|
| 1 | 2 | 3 | |
| Version 2 (revision) 28 Mar 24 | read | ||
| Version 1 14 Mar 23 | read | read | |
Provide sufficient details of any financial or non-financial competing interests to enable users to assess whether your comments might lead a reasonable person to question your impartiality. Consider the following examples, but note that this is not an exhaustive list:
Sign up for content alerts and receive a weekly or monthly email with all newly published articles
Already registered? Sign in
The email address should be the one you originally registered with F1000.
You registered with F1000 via Google, so we cannot reset your password.
To sign in, please click here.
If you still need help with your Google account password, please click here.
You registered with F1000 via Facebook, so we cannot reset your password.
To sign in, please click here.
If you still need help with your Facebook account password, please click here.
If your email address is registered with us, we will email you instructions to reset your password.
If you think you should have received this email but it has not arrived, please check your spam filters and/or contact for further assistance.
Comments on this article Comments (0)