Dataset: A consolidated and harmonised Verbal Autopsy dataset from Health and Demographic Surveillance Sites in South Africa

Eilidh Cowan; Lucia D'Ambruoso; Jessica Price; Edward Fottrell; Kobus Herbst

doi:10.12688/f1000research.55377.1

Home Browse Dataset: A consolidated and harmonised Verbal Autopsy dataset from...

ALL Metrics

-

Views

-

Downloads

Get PDF

Get XML

Export

▬

✚

Data Note

Dataset: A consolidated and harmonised Verbal Autopsy dataset from Health and Demographic Surveillance Sites in South Africa

[version 1; peer review: 1 approved, 2 approved with reservations]

Eilidh Cowan ^1,2, Lucia D'Ambruoso^2-5, Jessica Price⁴, Edward Fottrell⁶, Kobus Herbst^7,8

Eilidh Cowan ^1,2, Lucia D'Ambruoso^2-5, [...] Jessica Price⁴, Edward Fottrell⁶, Kobus Herbst^7,8

PUBLISHED 19 May 2023

Author details Author details

¹ School of Geosciences, University of Edinburgh, Edinburgh, UK
² Aberdeen Centre of Health Data Science (ACHDS), Institute of Applied Health Sciences, School of Medicine, Medical Sciences and Nutrition,, University of Aberdeen, Aberdeen, UK
³ Department of Epidemiology and Global Health, Umea University, Umea, Sweden
⁴ 4. MRC/Wits Rural Public Health and Health Transitions Research Unit (Agincourt), School of Public Health, Faculty of Health Sciences,, University of the Witwatersrand, Johannesburg, South Africa
⁵ National Health Service, Grampian, UK
⁶ Institute for Global Health, University College London, London, UK
⁷ Africa Health Research Institute, Johannesburg, South Africa
⁸ DSI-MRC South African Population Research Infrastructure Network (SAPRIN), Johannesburg, South Africa

Eilidh Cowan
Roles: Data Curation, Writing – Original Draft Preparation

Lucia D'Ambruoso
Roles: Conceptualization, Writing – Original Draft Preparation

Jessica Price
Roles: Writing – Review & Editing

Edward Fottrell
Roles: Writing – Review & Editing

Kobus Herbst
Roles: Data Curation, Supervision, Writing – Review & Editing

OPEN PEER REVIEW

REVIEWER STATUS

This article is included in the Health Services gateway.

Abstract

This data note provides details of the development of a Verbal Autopsy (VA) dataset produced with the South African Population Research Infrastructure Network (SAPRIN) drawing on datasets from health and socio-demographic surveillance sites’ (HDSS) ‘ covering a population of over 250,000 in two rural provinces in South Africa for the period 2012-2019. The purpose of the data set was to refine an analytical tool within VA, which provides unique information on care seeking and utilisation at and around the time of death complementary to that of medical cause of death. On an individual basis, the dataset includes demographic data, probable cause of death data, and data on care seeking and utilisation at or around the time of death drawn from longitudinal population cohorts. The purpose of this publication is to describe both the dataset and methods in formatting and processing the data for other researchers who may be interested in similar data. The data described in this paper are available to be requested from the respective HDSS repositories.

Keywords

South Africa; Verbal Autopsy; Cause of death; Circumstances of Mortality

Corresponding author: Eilidh Cowan

Competing interests: No competing interests were disclosed.

Grant information: Conceptualisation of COMCAT was supported through a parent study funded by the Joint Health Systems Research Initiative from Department for International Development (DFID)/Medical Research Council (MRC)/Wellcome Trust/Economic and Social Research Council (ESRC) (MR/ P014844/1). Support was also provided through the UKRI Covid-19 Extension Allocation Fund (RG15639-15) and by the University of Aberdeen and the Scottish Funding Council (SFC) (SF10206-45).
The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.

Copyright: © 2023 Cowan E et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

How to cite: Cowan E, D'Ambruoso L, Price J et al. Dataset: A consolidated and harmonised Verbal Autopsy dataset from Health and Demographic Surveillance Sites in South Africa [version 1; peer review: 1 approved, 2 approved with reservations]. F1000Research 2023, 12:520 (https://doi.org/10.12688/f1000research.55377.1) First published: 19 May 2023, 12:520 (https://doi.org/10.12688/f1000research.55377.1) Latest published: 19 May 2023, 12:520 (https://doi.org/10.12688/f1000research.55377.1)

Introduction

Every year, the medical causes of approximately 30 million deaths, half of all deaths worldwide, are not formally registered¹. These deaths occur predominantly in low- and middle-income countries where there is a lack of complete and functioning civil registration and vital statistics (CRVS) systems². Verbal autopsy (VA) is currently the only realistic alternative to medical certification of deaths in settings where CRVS is incomplete or absent. VA is a pragmatic survey-based method in which trained fieldworkers gather information from final caregivers on signs and symptoms of the deceased prior to death. VA data are then interpreted, by physicians or computer models, to determine probable cause(s) of death³. The method is used to quantify levels and causes of death in otherwise unregistered populations. The World Health Organization (WHO) leads the development of international standards for VA.

This data note provides details of the development of a Verbal Autopsy dataset produced with the South African Population Research Infrastructure Network (SAPRIN) drawing on datasets from health and socio-demographic surveillance sites’ (HDSS. The purpose of the data set was to refine an analytical tool within VA, which provides unique information on care seeking and utilisation at and around the time of death complementary to that of medical cause of death.

Acknowledging the social determinants of heath as the fundamental causes of avoidable mortality and health inequalities, we sought to develop a systematic and scalable categorization system for circumstantial drivers of deaths⁴. We previously devised an approach within VA tools called Circumstances of Mortality Categories (COMCAT)⁵. The system is designed for large scale population assessment of burden of disease inclusive of the needs and behaviours of individuals and the responsiveness of the health system towards these⁶. For example, a woman whose cause of death is assigned as obstetric haemorrhage might have died at home, while another woman with the same cause of death might have been inadequately managed despite reaching a facility. Measuring these scenarios at population level will provide important information for health services and reducing avoidable mortality.

The development of the COMCAT model began with the supplementation of existing interview questions on medical causes of death, to include input questions on care seeking and utilisation at and around the time of death, which were taken up in the 2012 WHO VA standard⁷. From this, models were developed within existing automated VA data interpretation tools to assign likelihoods to circumstantial categories for each death on: emergencies, recognition of illness severity, use of traditional medicine, accessing care, and perceptions of poor quality of care⁵.

This paper describes the collation and formatting of a mortality dataset from Health and Demographic Surveillance Sites (HDSS) in South Africa for use in refining the COMCAT system. HDSS are geographically defined populations that undergo continuous demographic monitoring. All vital events, such as births and deaths, are regularly recorded to track population change and highlight health and social care priorities⁸. The dataset harmonises and links routinely collected VA data from the South African Population Research Infrastructure Network (SAPRIN). SAPRIN is a national research infrastructure funded by the National Department of Science and Innovation that aims to harmonise and integrate South Africa’s HDSSs.

Methods

Each HDSS had a specific VA questionnaire that, since 2012, is broadly based on the WHO-2012 or WHO-2016 standard. VA data are collected electronically at household level by trained fieldworkers. Trained fieldworkers select responses to the questions from a specified set of answers, with logical skips and validation rules consistent with the WHO standard. Data quality control is carried out on al captured questionairres by specific HDSS team supervisors using either RedCap or Survey Solutions. We obtained all VA data, from the three HDSS’ included in the SAPRIN Network that had been collected on deaths that occurred from 2012 onwards. This was in order to increase the likelihood of inclusion of the COMCAT data, which were included in the WHO standard since 2012.

As each HDSS has a unique VA questionnaire, we aligned each of the HDSS’ questionnaires and potential responses to the WHO-2016 standard. As the VA interpretation tools are based on the WHO standard, in doing this we ensured the required indicators were available to utilise both a VA data formatting packages (PyCrossVA) and one of the automated VA interpretation tools to generate probable cause of death. A common data specification was developed that would retain maximum information but allow us to utilise one of the VA interpretation tools. VA interpretation tools use mathematical formulae, such as Bayes theorem, to calculate the probability of cause of death from a prior set of probabilities relating to input indicators, from the VA questionnaire⁹.

After formation of the data specification, data were examined, as detailed above, to ensure the dataset included the indicators required to be processed in a VA interpretation tool to output both a reliable probable cause of death and COMCATs. A variety of additional indicators to the WHO standard had been included in the different sites’ questionnaires. These indicators were not included in the consolidated dataset as they are not required for the automated VA tool. However, individual case ID remained consistent throughout and these additional indicators could be included from the original dataset if of interest after the data had been processed by the VA interpretation tool. At this stage, we excluded one of the HDSSs, DIMAMO, as they did not have relevant data on the COMCAT input indicators. Data were then recoded and renamed in line with the newly developed data specification, this was done in pyCrossVA, a Python package (Python Programming Language, RRID:SCR_008394) developed to format VA data from WHO standard into the format for use in the desired VA interpretation tools. At this stage, we processed the data using the InterVA-5.1 interpretation tool in R 3.61 (R Project for Statistical Computing, RRID:SCR_001905). InterVA-5 was selected as this is currently the only tool that will output COMCATs, and refining these was the objective for the use of the data.

At all stages, data were processed individually by HDSS’. After the data had been processed through InterVA-5.1 we then added an additional variable of HDSS name to allow us to differentiate these by location before appending the two datasets. The final data set included records of 7980 deaths, 5924 and 2056 from Agincourt and AHRI HDSS respectively, for the period of 2012–19, and consisted of 25 variables detailing, basic demographics, probable cause of death, COMCAT and COMCAT input indicators.

The data were subject to consistency checks in InterVA-5.1. These are carried out before probable causes of death are determined for each individual death, where possible errors will be adjusted by InterVA-5.1 using other questions. These generate warning messages that can be interpreted by researchers. For example, a record of a male that has identified as pregnant will generate a warning message and, depending on the other information available, one of these inputs (i.e. male or pregnant) will be deemed an error and corrected by InterVA-5.1. Further to this, we excluded those aged over 100 years due to the unreliability of the data given the average life expectancy in the region.

Software availibility

Software packages used to both format and process VA data are all open source and are available from the following ‘https://github.com/verbal-autopsy-software’. These packages also contain functions to analyse VA data.

Data availability statement

The data described in this study cannot be made available to the public in an open repository due to the sensitive nature of the data. However, the data are available to be requested from SAPRIN or the respective HDSS repositories. Requests for the data can be made at the following link https://saprindata.samrc.ac.za/index.php/catalog/33.

Acknowledgements

The authors acknowledge the South African Population Research Infrastructure Network (SAPRIN), the African Health Research Institute (AHRI) and the MRC/Wits Rural Public Health and Health Transitions Research Unit (Agincourt) for their support. The authors gratefully acknowledge Chodwizadziwa Kabudula, Daniel Mahlangu, Dickman Gareta, Siyabonga Nxumalofrom and Joseph Tlouyamma from the Agincourt, AHRI and DIMAMO HDSSs who supported with data, and individuals who supported the development and maintenance of the OpenVA software.

Faculty Opinions recommended

References

1. Setel PW, Macfarlane SB, Szreter S, et al.: A scandal of invisibility: making everyone count by counting everyone. Lancet. 2007; 370(9598): 1569–1577. PubMed Abstract | Publisher Full Text
2. Marinda E, Simbayi L, Zuma K, et al.: Towards achieving the 90-90-90 HIV targets: Results from the south African 2017 national HIV survey. BMC Public Health. 2020; 20(1): 1375. PubMed Abstract | Publisher Full Text | Free Full Text
3. Basera TJ, Schmitz K, Price J, et al.: Community surveillance and response to maternal and child deaths in low- and middle-income countries: A scoping review. PLoS One. 2021; 16(3): e0248143. PubMed Abstract | Publisher Full Text | Free Full Text
4. D’Ambruoso L, Byass P, Qomariyah SN, et al.: A lost cause? Extending verbal autopsy to investigate biomedical and socio-cultural causes of maternal death in Burkina Faso and Indonesia. Soc Sci Med. 2010; 71(10): 1728–38. PubMed Abstract | Publisher Full Text
5. Hussain-Alkhateeb L, D'Ambruoso L, Tollman S, et al.: Enhancing the value of mortality data for health systems: adding Circumstances Of Mortality CATegories (COMCATs) to deaths investigated by verbal autopsy. Glob Health Action. 2019; 12(1): 1680068. PubMed Abstract | Publisher Full Text
6. D’Ambruoso L: Care in obstetric emergencies : quality of care, access to care and participation in health in rural Indonesia. PhD Thesis University of Aberdeen. University of Aberdeen, Aberdeen, 2011.
7. D’Ambruoso L, Kahn K, Wagner RG, et al.: Moving from medical to health systems classifications of deaths: extending verbal autopsy to collect information on the circumstances of mortality. Glob Health Res Policy. 2016; 1(1): 2. PubMed Abstract | Publisher Full Text | Free Full Text
8. Kahn K, Tollman SM, Collinson MA, et al.: Research into health, population and social transitions in rural South Africa: Data and methods of the Agincourt health and demographic surveillance system. Scand J Public Health Suppl. 2007; 69: 8–20. PubMed Abstract | Publisher Full Text | Free Full Text
9. InterVA - software for verbal autopsy. [Accessed: 09-Jul-2021]. Reference Source

Comments on this article Comments (0)

Version 1

VERSION 1 PUBLISHED 19 May 2023

Author details Author details

¹ School of Geosciences, University of Edinburgh, Edinburgh, UK
² Aberdeen Centre of Health Data Science (ACHDS), Institute of Applied Health Sciences, School of Medicine, Medical Sciences and Nutrition,, University of Aberdeen, Aberdeen, UK
³ Department of Epidemiology and Global Health, Umea University, Umea, Sweden
⁴ 4. MRC/Wits Rural Public Health and Health Transitions Research Unit (Agincourt), School of Public Health, Faculty of Health Sciences,, University of the Witwatersrand, Johannesburg, South Africa
⁵ National Health Service, Grampian, UK
⁶ Institute for Global Health, University College London, London, UK
⁷ Africa Health Research Institute, Johannesburg, South Africa
⁸ DSI-MRC South African Population Research Infrastructure Network (SAPRIN), Johannesburg, South Africa

Eilidh Cowan
Roles: Data Curation, Writing – Original Draft Preparation

Lucia D'Ambruoso
Roles: Conceptualization, Writing – Original Draft Preparation

Jessica Price
Roles: Writing – Review & Editing

Edward Fottrell
Roles: Writing – Review & Editing

Kobus Herbst
Roles: Data Curation, Supervision, Writing – Review & Editing

Competing interests

No competing interests were disclosed.

Grant information

Conceptualisation of COMCAT was supported through a parent study funded by the Joint Health Systems Research Initiative from Department for International Development (DFID)/Medical Research Council (MRC)/Wellcome Trust/Economic and Social Research Council (ESRC) (MR/ P014844/1). Support was also provided through the UKRI Covid-19 Extension Allocation Fund (RG15639-15) and by the University of Aberdeen and the Scottish Funding Council (SFC) (SF10206-45).
The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.

Article Versions (1)

version 1

Published: 19 May 2023, 12:520

https://doi.org/10.12688/f1000research.55377.1

Copyright

© 2023 Cowan E et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Download

Export To

metrics

	Views	Downloads
F1000Research	-	-
PubMed Central Data from PMC are received and updated monthly.	-	-

Citations

0

SEE MORE DETAILS

CITE

how to cite this article

Cowan E, D'Ambruoso L, Price J et al. Dataset: A consolidated and harmonised Verbal Autopsy dataset from Health and Demographic Surveillance Sites in South Africa [version 1; peer review: 1 approved, 2 approved with reservations]. F1000Research 2023, 12:520 (https://doi.org/10.12688/f1000research.55377.1)

NOTE: If applicable, it is important to ensure the information in square brackets after the title is included in all citations of this article.

track

receive updates on this article

Track an article to receive email alerts on any updates to this article.

Open Peer Review

Current Reviewer Status: ?

Key to Reviewer Statuses VIEW HIDE

ApprovedThe paper is scientifically sound in its current form and only minor, if any, improvements are suggested

Approved with reservations A number of small changes, sometimes more significant revisions are required to address specific details and improve the papers academic merit.

Not approvedFundamental flaws in the paper seriously undermine the findings and conclusions

Version 1

VERSION 1

PUBLISHED 19 May 2023

Views

4

Reviewer Report 13 Sep 2024

Tom Smith, Swiss Tropical and Public Health Institute, Basel, Switzerland

Approved with Reservations

https://doi.org/10.5256/f1000research.58945.r174907

The article provides a well written description of the rationale for the dataset. If anything, the importance of this is understated, since analysis of such dataset is crucial for understanding how to reduce mortality across the world.

... Continue reading

The article provides a well written description of the rationale for the dataset. If anything, the importance of this is understated, since analysis of such dataset is crucial for understanding how to reduce mortality across the world.

Unfortunately the authors do not actually provide the data, stating instead that "the data are available to be requested from SAPRIN or the respective HDSS repositories", a statement which the reader must take on trust.

I noted a few minor typos:

(HDSS should read (HDSS).

"The purpose of the data set was ... " should be "... is... " I think.

Citations should be provided for the software used (e.g. Red Cap)

Is the rationale for creating the dataset(s) clearly described?

Yes
Are the protocols appropriate and is the work technically sound?

Yes
Are sufficient details of methods and materials provided to allow replication by others?

Partly
Are the datasets clearly presented in a useable and accessible format?

No

Competing Interests: No competing interests were disclosed.

Reviewer Expertise: Epidemiology

I confirm that I have read this submission and believe that I have an appropriate level of expertise to confirm that it is of an acceptable scientific standard, however I have significant reservations, as outlined above.

CITE

Report a concern

Respond or Comment

Views

12

Reviewer Report 06 Feb 2024

Bruno Masquelier, Université Catholique de Louvain, Louvain, Belgium

Approved with Reservations

https://doi.org/10.5256/f1000research.58945.r237763

I read this short article with interest, as it deals with an important subject. I agree with the authors on the importance of complementing traditional verbal autopsies with additional, standardized information on the circumstances surrounding death, regarding the social determinants ... Continue reading

I read this short article with interest, as it deals with an important subject. I agree with the authors on the importance of complementing traditional verbal autopsies with additional, standardized information on the circumstances surrounding death, regarding the social determinants of health and care-seeking. The COMCAT tool is a significant contribution in this respect. It is also important to acknowledge the heavy work done on standardizing and pooling together datasets from different HDSS. However, I don't quite see the point of this article specifically, as it doesn't provide results, it doesn't document a database in the public domain, and it doesn't specify how the consolidated database can be acquired, and under what conditions. The article looks more like a deliverable for a project, but the contribution to scientific literature is not obvious to me at this stage. I would invite the authors to provide a sample anonymized dataset, as suggested by another reviewer, or to detail the procedures required to access the dataset. More info on the variations in data quality or response rates across HDSS sites concerning the COMCAT questions would also strengthen the paper.

Minor comments
- In the introduction, the authors mention that the cause of death of 30 million deaths is not formally recorded in a CRVS system. But they base this assertion on a 2007 article by Setel and colleagues. I imagine the situation has changed over the past 15 years, and more recent estimates are available.
- In the "methods" section, the authors mention additional indicators and indicate that they could be associated with this database based on individual IDs. Could you give some examples?
- small typo="al captured questionnaires"

Is the rationale for creating the dataset(s) clearly described?

Yes
Are the protocols appropriate and is the work technically sound?

Yes
Are sufficient details of methods and materials provided to allow replication by others?

Partly
Are the datasets clearly presented in a useable and accessible format?

No

Competing Interests: No competing interests were disclosed.

Reviewer Expertise: Demography, child and adult mortality estimation

I confirm that I have read this submission and believe that I have an appropriate level of expertise to confirm that it is of an acceptable scientific standard, however I have significant reservations, as outlined above.

CITE

Report a concern

Respond or Comment

Views

13

Reviewer Report 21 Jun 2023

Tathagata Bhattacharjee, Department of Population Health, London School of Hygiene and Tropical Medicine, London, UK

Approved

https://doi.org/10.5256/f1000research.58945.r174908

The paper brings out an important aspect for the need to prepare datasets for VA analysis. The method is crisply explained. However, sharing a sample anonymized dataset would have been more appreciated along with some sample code implementations for more ... Continue reading

The paper brings out an important aspect for the need to prepare datasets for VA analysis. The method is crisply explained. However, sharing a sample anonymized dataset would have been more appreciated along with some sample code implementations for more clarity on the implementation which would help researchers to replicate or derive guidance on the processes in preparation of such datasets.

Is the rationale for creating the dataset(s) clearly described?

Yes
Are the protocols appropriate and is the work technically sound?

Yes
Are sufficient details of methods and materials provided to allow replication by others?

Yes
Are the datasets clearly presented in a useable and accessible format?

Yes

Competing Interests: No competing interests were disclosed.

Reviewer Expertise: data integration, ETL, record linkage, data standardization, OMOP /OHDSI

I confirm that I have read this submission and believe that I have an appropriate level of expertise to confirm that it is of an acceptable scientific standard.

CITE

Report a concern

Respond or Comment

Comments on this article Comments (0)

Version 1

VERSION 1 PUBLISHED 19 May 2023

Open Peer Review

Reviewer Status

Reviewer Reports

	Invited Reviewers
	1	2	3
Version 1 19 May 23	read	read	read

Tathagata Bhattacharjee, London School of Hygiene and Tropical Medicine, London, UK
Bruno Masquelier, Université Catholique de Louvain, Louvain, Belgium
Tom Smith, Swiss Tropical and Public Health Institute, Basel, Switzerland

Comments on this article

All Comments(0)

Add a comment

Sign up for content alerts

Browse by related subjects

Back to all reports

Reviewer Report

4 Views

13 Sep 2024 | for Version 1

Tom Smith, Swiss Tropical and Public Health Institute, Basel, Switzerland

4 Views Cite this report Responses(0)

Approved With Reservations

The article provides a well written description of the rationale for the dataset. If anything, the importance of this is understated, since analysis of such dataset is crucial for understanding how to reduce mortality across the world.

Unfortunately the authors do not actually provide the data, stating instead that "the data are available to be requested from SAPRIN or the respective HDSS repositories", a statement which the reader must take on trust.

I noted a few minor typos:

(HDSS should read (HDSS).

"The purpose of the data set was ... " should be "... is... " I think.

Citations should be provided for the software used (e.g. Red Cap)

Is the rationale for creating the dataset(s) clearly described?

Yes
Are the protocols appropriate and is the work technically sound?

Yes
Are sufficient details of methods and materials provided to allow replication by others?

Partly
Are the datasets clearly presented in a useable and accessible format?

No

Competing Interests

No competing interests were disclosed.

Reviewer Expertise

Epidemiology

I confirm that I have read this submission and believe that I have an appropriate level of expertise to confirm that it is of an acceptable scientific standard, however I have significant reservations, as outlined above.

Respond to this report

Responses (0)

Back to all reports

Reviewer Report

12 Views

06 Feb 2024 | for Version 1

Bruno Masquelier, Université Catholique de Louvain, Louvain, Belgium

12 Views Cite this report Responses(0)

Approved With Reservations

I read this short article with interest, as it deals with an important subject. I agree with the authors on the importance of complementing traditional verbal autopsies with additional, standardized information on the circumstances surrounding death, regarding the social determinants of health and care-seeking. The COMCAT tool is a significant contribution in this respect. It is also important to acknowledge the heavy work done on standardizing and pooling together datasets from different HDSS. However, I don't quite see the point of this article specifically, as it doesn't provide results, it doesn't document a database in the public domain, and it doesn't specify how the consolidated database can be acquired, and under what conditions. The article looks more like a deliverable for a project, but the contribution to scientific literature is not obvious to me at this stage. I would invite the authors to provide a sample anonymized dataset, as suggested by another reviewer, or to detail the procedures required to access the dataset. More info on the variations in data quality or response rates across HDSS sites concerning the COMCAT questions would also strengthen the paper.

Minor comments
- In the introduction, the authors mention that the cause of death of 30 million deaths is not formally recorded in a CRVS system. But they base this assertion on a 2007 article by Setel and colleagues. I imagine the situation has changed over the past 15 years, and more recent estimates are available.
- In the "methods" section, the authors mention additional indicators and indicate that they could be associated with this database based on individual IDs. Could you give some examples?
- small typo="al captured questionnaires"

Is the rationale for creating the dataset(s) clearly described?

Yes
Are the protocols appropriate and is the work technically sound?

Yes
Are sufficient details of methods and materials provided to allow replication by others?

Partly
Are the datasets clearly presented in a useable and accessible format?

No

Competing Interests

No competing interests were disclosed.

Reviewer Expertise

Demography, child and adult mortality estimation

I confirm that I have read this submission and believe that I have an appropriate level of expertise to confirm that it is of an acceptable scientific standard, however I have significant reservations, as outlined above.

Respond to this report

Responses (0)

Back to all reports

Reviewer Report

13 Views

21 Jun 2023 | for Version 1

Tathagata Bhattacharjee, Department of Population Health, London School of Hygiene and Tropical Medicine, London, UK

13 Views Cite this report Responses(0)

Approved

The paper brings out an important aspect for the need to prepare datasets for VA analysis. The method is crisply explained. However, sharing a sample anonymized dataset would have been more appreciated along with some sample code implementations for more clarity on the implementation which would help researchers to replicate or derive guidance on the processes in preparation of such datasets.

Is the rationale for creating the dataset(s) clearly described?

Yes
Are the protocols appropriate and is the work technically sound?

Yes
Are sufficient details of methods and materials provided to allow replication by others?

Yes
Are the datasets clearly presented in a useable and accessible format?

Yes

Competing Interests

No competing interests were disclosed.

Reviewer Expertise

data integration, ETL, record linkage, data standardization, OMOP /OHDSI

I confirm that I have read this submission and believe that I have an appropriate level of expertise to confirm that it is of an acceptable scientific standard.

Respond to this report

Responses (0)

[1] 1. Setel PW, Macfarlane SB, Szreter S, et al.: A scandal of invisibility: making everyone count by counting everyone. Lancet. 2007; 370(9598): 1569–1577. PubMed Abstract | Publisher Full Text

[2] 2. Marinda E, Simbayi L, Zuma K, et al.: Towards achieving the 90-90-90 HIV targets: Results from the south African 2017 national HIV survey. BMC Public Health. 2020; 20(1): 1375. PubMed Abstract | Publisher Full Text | Free Full Text

[3] 3. Basera TJ, Schmitz K, Price J, et al.: Community surveillance and response to maternal and child deaths in low- and middle-income countries: A scoping review. PLoS One. 2021; 16(3): e0248143. PubMed Abstract | Publisher Full Text | Free Full Text

[4] 4. D’Ambruoso L, Byass P, Qomariyah SN, et al.: A lost cause? Extending verbal autopsy to investigate biomedical and socio-cultural causes of maternal death in Burkina Faso and Indonesia. Soc Sci Med. 2010; 71(10): 1728–38. PubMed Abstract | Publisher Full Text

[5] 5. Hussain-Alkhateeb L, D'Ambruoso L, Tollman S, et al.: Enhancing the value of mortality data for health systems: adding Circumstances Of Mortality CATegories (COMCATs) to deaths investigated by verbal autopsy. Glob Health Action. 2019; 12(1): 1680068. PubMed Abstract | Publisher Full Text

[6] 6. D’Ambruoso L: Care in obstetric emergencies : quality of care, access to care and participation in health in rural Indonesia. PhD Thesis University of Aberdeen. University of Aberdeen, Aberdeen, 2011.

[7] 7. D’Ambruoso L, Kahn K, Wagner RG, et al.: Moving from medical to health systems classifications of deaths: extending verbal autopsy to collect information on the circumstances of mortality. Glob Health Res Policy. 2016; 1(1): 2. PubMed Abstract | Publisher Full Text | Free Full Text

[8] 8. Kahn K, Tollman SM, Collinson MA, et al.: Research into health, population and social transitions in rural South Africa: Data and methods of the Agincourt health and demographic surveillance system. Scand J Public Health Suppl. 2007; 69: 8–20. PubMed Abstract | Publisher Full Text | Free Full Text

[9] 9. InterVA - software for verbal autopsy. [Accessed: 09-Jul-2021]. Reference Source

Dataset: A consolidated and harmonised Verbal Autopsy dataset from Health and Demographic Surveillance Sites in South Africa

Abstract

Keywords

Introduction

Methods

Software availibility

Data availability statement

Acknowledgements

References

Comments on this article Comments (0)

Open Peer Review

Comments on this article Comments (0)

Open Peer Review

Reviewer Status

Reviewer Reports

Comments on this article

Browse by related subjects

Competing Interests Policy

Stay Updated