Skip to content
ALL Metrics
-
Views
-
Downloads
Get PDF
Get XML
Cite
Export
Track
Research Article
Revised

Limitations to current methods to estimate cause of death: a validation study of a verbal autopsy model

[version 3; peer review: 2 approved, 1 approved with reservations]
PUBLISHED 05 May 2021
Author details Author details

Abstract

Background: Accurate information on causes of death (CoD) is essential to estimate burden of disease, track global progress, prioritize cost-effective interventions, and inform policies to reduce mortality. In low-income settings, where a significant proportion of deaths take place at home or in poorly-resourced peripheral health facilities, data on CoD often relies on verbal autopsies (VAs). Validations of VAs have been performed against clinical diagnosis, but never before against an acceptable gold standard: the complete diagnostic autopsy (CDA).
Methods: We have validated a computer-coded verbal autopsy method –the InterVA- using individual and population metrics to determine CoD against the CDA, in 316 deceased patients of different age groups who died in a tertiary-level hospital in Maputo, Mozambique between 2013 and 2015.  
Results: We found a low agreement of the model across all age groups at the individual (kappa statistic ranging from -0.030 to 0.232, lowest in stillbirths and highest in adults) and population levels (chance-corrected cause-specific mortality fraction accuracy ranging from -1.00 to 0.62, lowest in stillbirths, highest in children). The sensitivity in identifying infectious diseases was low (0% for tuberculosis, diarrhea, and disseminated infections, 32% for HIV-related infections, 33% for malaria and 36% for pneumonia). Of maternal deaths, 26 were assigned to eclampsia but only four patients actually died of eclampsia.
Conclusions: These findings do not lead to building confidence in current estimates of CoD. They also call to the need to implement autopsy methods where they may be feasible, and to improve the quality and performance of current VA techniques.

Keywords

Validation, verbal autopsy, cause of death, complete diagnostic autopsy, Mozambique

Revised Amendments from Version 2

The affiliation of Dr. Pedro L Alonso has been changed due to internal regulations of the institution that appeared in the previous version.

To read any peer review reports and author responses for this article, follow the "read" links in the Open Peer Review table.

Introduction

Global and disease-specific health statistics are regularly published and constitute an essential tool to define priorities and goals, identify inequalities, and track progress, including the achievement of global targets such as the health-related Sustainable Development Goals (SDGs). Insufficient confidence in the accuracy of estimates, particularly in those related to cause of death (CoD) has been indicated as a constraint to reduce mortality globally1,2. This lack of precise information on CoD in many low-income settings is largely explained by the significant number of deaths that occur either at home or at poorly resourced health facilities, with significant limitations of both qualified personnel as well as accurate diagnostic methods; but also to the very limited number of diagnostic autopsies performed partly due to the massive shortfall of trained pathologists1,2. As such, CoD in low-income settings, continues to rely on estimates based on clinical records and verbal autopsies (VAs).

Clinical errors, which are common even in well-equipped hospitals, are more frequent in resource-restricted settings35. On the other hand, VAs remain the most practical and commonly used approach to estimate CoD at the population level in low-income settings6. A verbal autopsy consists of a structured interview to witnesses of the death subsequently interpreted and coded by physicians or using computerized methods. The method has shown to provide inconsistent results over time and place7. In addition, its diagnostic accuracy depends on the CoD, being high when the disease has a characteristic and well defined set of signs and symptoms, but much lower for conditions with unspecific symptoms, notably, malaria and acute respiratory infection in children, or meningitis in all age groups6. This results in frequent misclassifications of the CoD, which in turn leads to inaccurate cause-specific mortality rates6. Computerized methods of interpretation of the VA questionnaire have been developed to overcome some of the limitations of the VA technique. These methods are based either on algorithms derived from deaths with a medically confirmed CoD, or on probabilistic analyses8.

Computerized VA methods have been validated against physician-certified VA and clinical records911. However, neither computer-coded VA, nor physician-certified VA techniques have been validated against the complete diagnostic autopsy (CDA), the true gold standard for CoD determination. We present herein the results of a validation study of a commonly used computer-coded VA method, the InterVA (Interpreting Verbal Autopsy) model against the CDA in a series of deaths occurring in Maputo, Mozambique.

Methods

Study design and setting

The study included 316 CDA performed to patients who died between 2013 and 2015 at the Maputo Central Hospital, a 1500-bed institution that serves as the referral center for other hospitals in Mozambique. All the patients included in this analysis fulfilled the following criteria: (1) a CDA requested by the clinician as part of the medical evaluation of the patient and (2) informed consent to perform the autopsy given by the relatives. The following exclusion criterion was established: death of traumatic origin. In order to select only two cases per day from among the daily CDA requests received at the department of pathology (between 5 and 12 per day) without introducing selection biases, the two patients with death recorded before and closest to the time of 8:00 A.M. were included in the study. All maternal deaths that occurred in the study period were included.

From the 316 cases, 18 (6%) were stillbirths, 41 (13%) were neonates, 54 (17%) were children 1 month-15 years of age, 91 (29%) were maternal deaths and 112 (35%) were other adults. Written informed consent to perform the autopsy was obtained from the relatives of the deceased patients. In Maputo province malaria transmission is reported to be low (3%) and HIV prevalence is high (22%)12,13.

This study received approval by the Clinical Research Ethics Committee of the Hospital Clinic of Barcelona, Spain (File 2013/8677) and the National Bioethics Committee of Mozambique (Ref. 342/CNBS/13).

Determination of the cause of death by the complete diagnostic autopsy

The methodology for CoD determination by the CDA has been described in detail elsewhere1418. Briefly, a panel of experts evaluated the CDA macroscopic, microscopic and microbiologic data, as well as the clinical information and assign the CoD. All morbid conditions directly leading to death, any underlying and any other significant conditions possibly contributing to death were codified according to the international classification of diseases, tenth revision (ICD-10, ICD-10 MM for maternal deaths)19. When more than one severe diagnosis was identified, the disease most likely causing the death was considered the final diagnosis1417.

Cause of death assignment by the Verbal Autopsy model

We used the InterVA probabilistic model because it is one of the most commonly implemented VA tools20 and has shown a generally good level of agreement with the physician-coded verbal autopsy approach; it has also the advantage of being a completely reproducible method, reliable and standardized to interpretation21,22 reducing subjectivity. The InterVA method is based on the Bayes’ theorem and calculates the probability of a set of CoD given the presence of indicators reported in VA interviews23,24. We used version 4.04 of the model (InterVA-4) since the most recent version (InterVA-5) had not been released yet. In this analysis, the information feeding the model was extracted by the attending physician at the hospital from the clinical record of the deceased individual and from the obstetric record in perinatal deaths (Extended data25: Clinical and epidemiological data collection questionnaire), unified into the WHO 2012 VA standard format7, converted into the 245 input indicators of the VA model, and processed with malaria prevalence set to “low”, and HIV prevalence set to “high” using the InterVA4 package version 1.7.5 implemented in R version 3.5.0 software26. Of the 245 input indicators of the model, 43 could not be extracted from the medical records; 24 (56%) of them were secondary questions, which are not pertinent if certain events did not occur.

Validation of the model

To validate the VA model across a variety of CoD distributions, 500 cause compositions based on uninformative Dirichlet sampling were generated for each study group27. The performance of the model at the individual level was estimated comparing the CoD established by the CDA with the most probable CoD provided by the model. The Kappa statistic and the chance corrected concordance (CCC) were used as measures of the overall performance of the model (Extended data25: Table S1 and Figure S1)2830.

At the population level, cause-specific mortality fractions (CSMFs) were calculated for each CoD and method within each study group. Since the model estimates up to three CoD with associated likelihoods for each cause, all identified CoD were considered as proportional to their partial likelihoods in the rate calculations for the model. In contrast, only one CoD was considered for the CDA and consequently, the associated likelihood was assumed to be 1. The CSMF accuracy (CSMFA) and the chance-corrected CSMFA (CCCSMFA) were calculated to compare the CSMFs determined by the InterVA model with those determined by the CDA (Extended data25: Table S1 and Figure S1)28. All analyses were done in Stata version 15 (Stata Corp., College Station, TX, USA) and R version 3.5.0 (R Core Team, 2017) statistical packages.

Results

The VA model assigned one CoD in 267 (84%) cases and two CoD in 33 (10%) cases. In 16 (5%) cases the model resulted in a non-conclusive diagnosis. The average likelihood of the model in estimating the first CoD was 90% (range 89% to 99%), and for the second CoD it was 38% (range 35% to 46%) (Extended data25: Table S2). Three of the 316 cases (1%) had a non-conclusive diagnosis in the CDA.

Assignment of the CoD at the individual level compared with the CDA

In 168/316 cases (53%) the two methods agreed in the CoD. Most of the agreement was in the first CoD, while only in 8 cases the agreement was in the second CoD with a mean likelihood of 38% [95%CI: (33–43)] (Extended data25: Table S3). In 148/316 cases (47%), there was no agreement in the CoD between the two methods.

Overall, the performance of the VA method in assigning a CoD to individual deaths was low (Table 1). In stillbirths, the sensitivity of the model in identifying infections, fetal growth restriction, and intrapartum and intrauterine hypoxia was 0%. In neonates, the sensitivity was 93% for infectious CoD, while it was 0% and 25% for preterm complications and congenital malformations, respectively. In children, the sensitivity of the model in identifying an infectious disease as CoD was 83%, while it was 0% for the congenital malformations, tumors and other diseases. The sensitivity of the model in identifying maternal mortality causes was low for all conditions except for eclampsia (75%) and obstetric hemorrhage (75%). In other adults, the sensitivity of the model was highest for infectious diseases (68%) and lowest for malignant neoplasms (19%).

Table 1. Performance of the InterVA (Interpreting Verbal Autopsy) model compared to the complete diagnostic autopsy at individual-level prediction by study group and cause of death.

Cause of death (CDA)nClassification of
cases
Sensitivity
(%)
Specificity
(%)
PPV
(%)
NPV
(%)
TPTNFPFN
Stillbirths
(N=18)
Infections4014040 100 N/A78
Fetal growth
restriction
7010170 91 0 59
Intrapartum hypoxia3015030 100 N/A83
Intrauterine hypoxia2016020 100 N/A89
Congenital
malformations
001170N/A61 0 100
Non-conclusive2179150 44 10 88
Neonates
(N=41)
Infections2725212293146850
Congenital
malformations
4137032510010092
Preterm
complications
5036050100N/A88
Intrapartum
complication
31362233953395
Other diseases2039020100N/A95
Non-conclusive004100N/A100N/A100
Children
(N=54)
Infections423539783258030
Congenital
malformations
2052020100N/A96
Malignant neoplasms7047070100N/A87
Other diseases304473086094
Non-conclusive005130N/A940100
Maternal
deaths
(N=91)
Infections*391139132828754658
Abortion9082090100N/A90
Eclampsia436423175 74 12 98
Obstetric
hemorrhage
16125916475 7943 94
Other obstetric
complications
6085060100N/A93
Non-obstetric
diseases**
1646781225 89 33 85
Non-conclusive1089110 99 099
Other
adults
(N=112)
Infections805420122668628243
Malignant neoplasms1639511319997588
Other diseases16106630662692592
Non-conclusive0011020N/A980100

CDA: complete diagnostic autopsy; n: number of cases; TP: true positives; TN: true negatives; FP: false positives; FN: false negatives; PPV: positive predictive value; NPV: negative predictive value; N/A: not applicable.

* Includes all infections, both obstetric and non-obstetric.

** Non-obstetric diseases do not include infections.

Table 2 shows the measures of overall concordance between the two methods corrected for chance by study group for all CoDs. The CCC ranged between -0.093 and 0.246, and Kappa statistic ranged from -0.030 to 0.232 (lowest in stillbirths and highest in other adults). Figure 1 presents the alluvial diagrams showing the differences in the assignment of individual CoD established in the two methods by study group.

Table 2. Measures of performance of the InterVA (Interpreting Verbal Autopsy) model compared to the complete diagnostic autopsy at individual-level prediction for all causes of death by study group.

Study groupCCCKappa
Stillbirths-0.093-0.030(poor)
Neonates0.1190.142(slight)
Children0.0200.020(slight)
Maternal deaths0.1790.159(slight)
Other adults0.2460.232(fair)

CCC: Chance-corrected concordance calculated from 500 Dirichlet draws

6e451b97-2b93-4f4b-b09c-a089d424a227_figure1.gif

Figure 1. Alluvial diagrams of the differences in assignment of individual causes of death established by the Complete Diagnostic Autopsy (CDA) and InterVA (Interpreting Verbal Autopsy) model by study group.

The stacked blocks represent the causes of death (CoDs) determined by the CDA (left) and by the InterVA model (right), and their size as proportional to the cause-specific mortality fractions (CSMFs). The branches between blocks represent differences in the composition of the CoDs between the CDA and the InterVA model, being their thickness proportional to the number of cases contained in both blocks connected by the branch. Each CoD is represented by a different color, which is the same in both diagnostic methods. The color of the branches is determined by the cause of actual death (CDA). The concordant cases between the CDA and the InterVA model are represented by branches connected to blocks of the same color. In contrast, misclassified cases are shown as branches connected to blocks of different color.

Cause of death assignment of the model at the individual level among patients dying of infectious diseases

Table 3 shows the performance of the VA model in assigning CoD among cases who died of an infectious disease according to the CDA in all study groups by infection category. The sensitivity of the model in identifying an infectious disease as CoD was low for all infectious categories, being 0% for tuberculosis, diarrhea, disseminated and other infections. Figure 2 shows the alluvial diagram of the comparison of CoD assigned by both methods.

Table 3. Performance of the InterVA (Interpreting Verbal Autopsy) model at individual-level prediction among all cases who died of infectious diseases according to the complete diagnostic autopsy.

Cause of death (CDA)nClassification of casesSensitivity
(%)
Specificity
(%)
PPV
(%)
NPV
(%)
TPTNFPFN
Disseminated infections510139251099073
Pneumonia3613102542336651982
Meningitis15317341220984394
Tuberculosis7018327099096
Diarrhoea2018642098099
HIV/AIDS related5718120153932895575
Malaria621815433972998
Other infections180171318098090

CDA: complete diagnostic autopsy; n: number of cases; TP: true positives; TN: true negatives; FP: false positives; FN: false negatives; PPV: positive predictive value; NPV: negative predictive value; N/A: not applicable

Disseminated infections: bacterial sepsis of newborn (n=21), puerperal sepsis (n=6), streptococcal sepsis (n=5) and other sepsis (n=19)

HIV/AIDS related infections: candidiasis (n=1), congenital viral diseases (n=1), cryptococcosis (n=11), cytomegaloviral disease (n=7), herpes simplex infection (n=1), miliary tuberculosis (n=20), salmonella infections (n=1), pneumocystosis (n=5), pulmonary tuberculosis (n=2), toxoplasmosis (n=7) and tuberculous meningitis (n=1)

Other infections: acute pericarditis (n=2), pyelonephritis (n=2), congenital viral diseases (n=2), chorioamnionitis (n=2), GBS infection (n=2), tetanus (n=1), peritonitis (n=3), rabies (n=3) and zygomycosis (n=1)

6e451b97-2b93-4f4b-b09c-a089d424a227_figure2.gif

Figure 2. Alluvial diagrams of the differences in the individual cause of death as established by the complete diagnostic autopsy (CDA) and the InterVA (Interpreting Verbal Autopsy) model among patients who died of infectious diseases.

The stacked blocks represent the causes of death (CoDs) determined by the CDA (left) and by the InterVA model (right), and their size as proportional to the cause-specific mortality fractions (CSMFs). The branches between blocks represent differences in the composition of the CoDs between the CDA and the InterVA model, being their thickness proportional to the number of cases contained in both blocks connected by the branch. Each CoD is represented by a different color, which is the same in both diagnostic methods. The color of the branches is determined by the cause of actual death (CDA). The concordant cases between the CDA and the InterVA model are represented by branches connected to blocks of the same color. In contrast, misclassification cases are shown as branches connected to blocks of different color. Diseminated infections: bacterial sepsis of the newborn (n=21), puerperal sepsis (n=6), streptococcal sepsis (n=5) and other sepsis (n=19) HIV/AIDS related infections: candidiasis (n=1), congenital viral diseases (n=1), cryptococcosis (n=11), cytomegaloviral disease (n=7), herpes simplex infection (n=1), miliary tuberculosis (n=20), salmonella infection (n=1), pneumocystosis (n=5), respiratory tuberculosis bacteriologically and histologically confirmed (n=2), toxoplasmosis (n=7) and tuberculous meningitis (n=1) Other infections: acute pericarditis (n=2), pyelonephritis (n=2), congenital viral diseases (n=2), chorioamnionitis (n=2), GBS infection (n=2), tetanus (n=1), peritonitis (n=3), rabies (n=3) and zygomycosis (n=1) Non-infectious diseases (by the InterVA model): congenital malformations (n=1), intrapartum complication (n=2), eclampsia (n=12), obstetric haemorrhage (n=10), non-obstetric diseases (n=5)and other diseases (n=29).

Cause of death assignment of the model at the population-level, compared with the CDA

Table 4 shows the CSMFs estimated by the VA model and the CDA aggregated into broad categories of CoD. In stillbirths, the most frequent CoD assigned by the model was congenital malformation (39%); however, no case of congenital malformation was identified by the CDA. In addition, fetal growth restriction (FGR) was the most frequent CoD in stillbirths determined by the CDA (39%), but only one case was estimated as such by the model. Infectious diseases were responsible for 22% of stillbirths by the CDA, but no stillbirth was assigned to infectious diseases by the VA model. According to the VA model, no deaths were assigned to preterm complications in neonates, while these represented 12% of the neonatal deaths by the CDA. Among children, malignant neoplasms accounted for 13% of the deaths in the CDA, but no case was assigned to this CoD in the model. The model identified eclampsia as the second most prevalent cause of maternal mortality while only in four (4%) cases eclampsia was the cause of maternal death by the CDA. Complications of abortion was diagnosed in nine (10%) cases, none of them being identified by the VA method.

Table 4. Cause-Specific Mortality Fractions and associated measures of validation of the InterVA (Interpreting Verbal Autopsy) model compared with the complete diagnostic autopsy (CDA) at the population level.

GroupCause of deathInterVACDACSMF Accuracy
n*CSMF (%)nCSMF (%)UncorrectedChance
Corrected
StillbirthsInfections00.0422.20.11-1.00
Fetal growth restriction15.0738.9
Intrapartum hypoxia00.0316.7
Intrauterine hypoxia00.0211.1
Congenital malformations738.900.0
Non conclusive1056.1211.1
Overall 18 100.0 18 100.0
NeonatesInfections3791.02765.90.730.50
Congenital malformations12.449.8
Preterm complications00.0512.2
Intrapartum complications24.537.3
Other diseases00.024.9
Non-conclusive12.100.0
Overall 41 100.0 41 100.0
ChildrenInfections4073.94277.80.770.62
Congenital malformations00.023.7
Malignant neoplasms00.0713.0
Other diseases713.735.6
Non-conclusive712.500.0
Overall 54 100.0 54 100.0
Maternal deathsInfections* 2224.13942.90.59-0.05
Abortion00.099.9
Eclampsia2527.544.4
Obstetric hemorrhage2628.11617.6
Other obstetric
complications
00.466.6
Non-obstetric diseases** 1112.01617.6
Non-conclusive77.911.1
Overall 91 100.0 91 100.0
Other adultsInfections6558.28071.40.760.49
Malignant neoplasms43.71614.3
Other diseases3329.81614.3
Non-conclusive98.400.0
Overall 112 100.0 112 100.0

CDA: Complete diagnostic autopsy; n*: sum of cases estimated by InterVA model (in cause 1 or 2) weighted by their associated likelihood. The residual likelihoods count as non-conclusive case fractions; n: sum of cases established by the CDA; CSMF: cause-specific mortality fractions; CSMF Accuracy:measures the quality at the population level, quantifying how closely the estimated CSMF values approximate the truth; Uncorrected: median cause-specific mortality fractions accuracy across 500 Dirichlet draws. It ranges from zero to one; Chance corrected: Median Cause-Specific Mortality Fractions Accuracy for random allocation across 500 Dirichlet draws. A score of zero indicates predictive accuracy equal to random allocation.

* Includes all infections, both obstetric and non-obstetric.

** Non-obstetric diseases do not include infections.

The model was less accurate in stillbirths (CSMFA of 0.11) than in the other groups (CSMFA ranging from 0.59 to 0.77). When corrected by chance, the accuracy of the model compared to the CDA was not better than that expected by chance in stillbirths (negative CCCSMFA), close to chance in maternal deaths (close to zero CCCSMFA) and better than that expected by chance but far from perfection in the other groups (Table 4).

Discussion

It is recognized that accurate information on what is causing deaths is essential to reduce mortality. In this study, we have assessed for the first time to our knowledge, the validity of a commonly used VA method in establishing the CoD compared with the gold standard (the CDA) in different age groups of patients dying at a tertiary-level hospital in Maputo, Mozambique. The agreement of the VA model was overall poor across all age groups and conditions, both at the individual and at the population level.

The two reference standards that have been used for validating computer-coded VA, i.e. physician-coded VA methods and health facility medical records, cannot be considered true gold standards. The comparison between computer-coded and physician-coded VA methods lacks an external reference or gold standard comparator11,31. On the other hand, although health facility-derived information is considered as an appropriate reference standard for VA validation27, reports from both high and low-income countries indicate that this information frequently contains clinical errors4,5. It seems quite evident that if the main source of input to the VA tool is inaccurate, the output of the VA will not be precise either. Furthermore, if clinical errors are frequent even in well-equipped hospitals, it is expected that their frequency would be higher in VA data.

In this study, the performance of the VA model was overall poor in identifying CoDs in stillbirths. These findings disagree with those of a report from Pakistan using clinical data as reference standard, indicating that a physician-coded VA tool was valid to ascertain causes of stillbirths, specially congenital malformations32. In neonates, the sensitivity of the model in identifying preterm complications as a CoD, was also very low (0%), which may be relevant for pre-term birth prevention programs. In contrast, the performance of the model in identifying infectious diseases as a cause of neonatal death had a high sensitivity, suggesting that it may be an adequate method to identify neonatal sepsis at the community. Among children, the sensitivity of the model was only high in detecting infectious diseases as a CoD but it did not identify deaths due congenital malformations and malignant neoplasms. In maternal deaths, the sensitivity of the model was high in assigning eclampsia as a cause of maternal mortality; however, the probability that a maternal death identified as eclampsia by the model was actually eclampsia was quite low. Although there were 26 maternal deaths assigned to eclampsia as the most probable CoD according to the VA model, only four were actually due to eclampsia (most misdiagnosed cases died of infectious diseases), suggesting a significant overestimation of eclampsia as a cause of maternal mortality by this method. This is in agreement with a previous report where a high frequency of false positive clinical diagnosis of eclampsia compared to the CDA was also found, being most of them deaths from infectious diseases4. These findings are of relevance to eclampsia prevention programs, which may fail in reducing maternal mortality due to misdiagnosis. In adults, the sensitivity of the VA model was higher for infectious diseases compared to other CoD, but low in identifying malignant neoplasms as cause of mortality. According to these results, the model would underestimate malignant neoplasms as CoD in adults, which may be important for prevention programs of this condition in high mortality settings.

The performance of the model in identifying the specific infection CoD among patients who died of an infectious disease was overall low. The sensitivity of the model in identifying tuberculosis as a CoD was very low, which may be of public health relevance in high burden countries. Regarding malaria infection, the VA model and the CDA only agreed in two cases, while in the other four cases established by the CDA, the model assigned three of them to a non-infectious disease CoD and one as non-conclusive (Figure 2). Lack of precision at the individual level in assigning malaria infection as a CoD may be important to target malaria control efforts in the community and increasing programme’s effectiveness.

The main use of the VA information is to determine cause-specific mortality and distribution of CoD at the population level33; for this reason we also estimated the CSMF accuracy between the two methods. Both methods differed in the distribution of the proportion of the deaths assigned to several disease categories. When corrected by chance, the accuracy of the model in predicting in the population the CoD was poor, especially in stillbirths and maternal deaths and imperfect in the other groups.

A possible limitation of our study that might have influenced the predictions of the model, is that the indicators used to estimate the CoD by the VA model were extracted from medical records, since VAs were not done, which relates to the absence in the clinical records of some indicators of the model (43 indicators, 18% of the total). Nevertheless, most of these indicators (n=24) were secondary questions related to the duration of the event and therefore, not pertinent if the event did not occur. On the other hand, the most likely explanation for the lack of registration in the clinical record of the other 19 indicators (8%) is that they were not identified. The fact that the study is based in a large hospital might be seen as a limitation to extrapolate findings to deaths occurring in rural health-facilities or at home, since cause-composition of deaths in the community may be different to that of those occurring in a hospital. However, it is important to remember that this is a validation study and therefore, the objective was not that the deaths included were representative of those occurring in the community, but rather that the comparator of the VA was as true gold standard as possible. Thus, we needed a set of deaths, whose causes were established by the CDA, and therefore they had to occur in a hospital setting with autopsy facilities. On the other hand, to avoid that the cause-composition of deaths in that particular hospital and/or time-period affected the accuracy of the estimates of the VA, we created multiple test datasets with widely varying cause-compositions as it has been suggested28.

As explained in the methods section, we used InterVA version 4 because version 5 was not available at the time of this analysis. Even if the estimated CoD might differ between the two InterVA versions, a change in the group of CoD would not be expected. Otherwise it would mean that the two versions provide different results, requiring a revision of all published information using the previous InterVA model.

The post-2015 Development Agenda expects that high burden countries should have reliable information on number and CoD to reduce their main health problems34. However, this goal cannot continue to rely on imprecise measurement tools. The main shortcoming to achieve the SDGs is the imprecision of the currently used methods to establish CoD. These findings highlight the need of improving the quality and performance of current VA techniques by developing more precise tools for CoD ascertainment.

In conclusion, the “data revolution” of the post-2015 Development Agenda expects that high burden countries should have reliable information on number and causes of death in order to reduce main health problems through evidence-based decision-making, and target and monitor health programs29. However, this goal cannot continue to rely on imprecise measurement tools. The main shortcoming to achieve the SDGs, is not the scarce availability of physicians to carry out death certificates or VA codification, nor the solution is the available automated methods created to overcome some of the physician-coded VA limitations, but rather the imprecision of these methods to reliably establish causes of death. The findings of this study should serve to highlight the need to implement autopsy methods where they may be feasible, but even more importantly to improve the quality and performance of current VA techniques and to develop more precise CoD ascertainment tools.

Consent

Written informed consent for publication of the patients’ details and/or their images was obtained from the parents/guardian/relative of the patient.

Data availability

Underlying data

Study data cannot be shared in a public domain due to their sensitive nature and, being such as small sample, especially for some age-specific causes of death, it would be relatively easy to identify study individuals even if anonymized. However, deidentified data will be made available from the corresponding author on reasonable request. Requesters will be required to sign a letter of agreement detailing the mechanisms by which the data will be kept secure and access restricted to their study team. The agreements will also state that the recipient will not share the data with anyone outside of their research team.

Extended data

Open Science Framework: Limitations to current methods to estimate cause of death: a validation study of a verbal autopsy model.

https://doi.org/10.17605/OSF.IO/UMJV225

This project contains the following extended data:

  • -     Clinical_questionnaire.pdf (Clinical and epidemiological data collection questionnaire)

  • -     VA_validation_extended_data.pdf (PDF containing supplementary figures and tables)

    •       Figure S1. Outline of statistical methods

    •       Table S1. Description of metrics

    •       Table S2. Study group and number of causes of death and their associated likelihoods as established by the InterVA method

    •       Table S3. Number of cases and mean likelihood agreement between the InterVA’s predicted cause of death and that established by the CDAby study group

Data are available under the terms of the Creative Commons Zero “No rights reserved” data waiver (CC0 1.0 Public domain dedication).

Comments on this article Comments (2)

Version 3
VERSION 3 PUBLISHED 05 May 2021
Revised
Version 1
VERSION 1 PUBLISHED 28 May 2020
Discussion is closed on this version, please comment on the latest version above.
  • Author Response 22 Jun 2020
    Llorenç Quintó, Barcelona Institute for Global Health (ISGlobal), Hospital Clinic of Barcelona, Universitat de Barcelona, Barcelona, Spain
    22 Jun 2020
    Author Response
    We really appreciate the comments on this paper made by Prof Byass, whose opinions we always considered very highly.

    In fact, since the beginning of this analysis (more than 2 years ... Continue reading
  • Reader Comment 01 Jul 2020
    Peter Byass, InterVA, Sweden
    01 Jul 2020
    Reader Comment
    Although this unreviewed pre-print is potentially interesting, there are some aspects that make its findings questionable.

    1. This pre-print was published on 28 May 2020, including the statement "We ... Continue reading
  • Discussion is closed on this version, please comment on the latest version above.
Author details Author details
Competing interests
Grant information
Copyright
Download
 
Export To
metrics
Views Downloads
Gates Open Research - -
PubMed Central
Data from PMC are received and updated monthly.
- -
Citations
CITE
how to cite this article
Menéndez C, Quintó L, Castillo P et al. Limitations to current methods to estimate cause of death: a validation study of a verbal autopsy model [version 3; peer review: 2 approved, 1 approved with reservations]. Gates Open Res 2021, 4:55 (https://doi.org/10.12688/gatesopenres.13132.3)
NOTE: If applicable, it is important to ensure the information in square brackets after the title is included in all citations of this article.
track
receive updates on this article
Track an article to receive email alerts on any updates to this article.

Comments on this article Comments (2)

Version 3
VERSION 3 PUBLISHED 05 May 2021
Revised
Version 1
VERSION 1 PUBLISHED 28 May 2020
Discussion is closed on this version, please comment on the latest version above.
  • Author Response 22 Jun 2020
    Llorenç Quintó, Barcelona Institute for Global Health (ISGlobal), Hospital Clinic of Barcelona, Universitat de Barcelona, Barcelona, Spain
    22 Jun 2020
    Author Response
    We really appreciate the comments on this paper made by Prof Byass, whose opinions we always considered very highly.

    In fact, since the beginning of this analysis (more than 2 years ... Continue reading
  • Reader Comment 01 Jul 2020
    Peter Byass, InterVA, Sweden
    01 Jul 2020
    Reader Comment
    Although this unreviewed pre-print is potentially interesting, there are some aspects that make its findings questionable.

    1. This pre-print was published on 28 May 2020, including the statement "We ... Continue reading
  • Discussion is closed on this version, please comment on the latest version above.
Alongside their report, reviewers assign a status to the article:
Approved - the paper is scientifically sound in its current form and only minor, if any, improvements are suggested
Approved with reservations - A number of small changes, sometimes more significant revisions are required to address specific details and improve the papers academic merit.
Not approved - fundamental flaws in the paper seriously undermine the findings and conclusions

Are you a Gates-funded researcher?

If you are a previous or current Gates grant holder, sign up for information about developments, publishing and publications from Gates Open Research.

You must provide your first name
You must provide your last name
You must provide a valid email address
You must provide an institution.

Thank you!

We'll keep you updated on any major new updates to Gates Open Research

Sign In
If you've forgotten your password, please enter your email address below and we'll send you instructions on how to reset your password.

The email address should be the one you originally registered with F1000.

Email address not valid, please try again

You registered with F1000 via Google, so we cannot reset your password.

To sign in, please click here.

If you still need help with your Google account password, please click here.

You registered with F1000 via Facebook, so we cannot reset your password.

To sign in, please click here.

If you still need help with your Facebook account password, please click here.

Code not correct, please try again
Email us for further assistance.
Server error, please try again.