Role of survey response rates on valid inference: An application to HIV prevalence estimates

Miguel Marino, Marcello Pagano

Research output: Contribution to journalReview article

Abstract

Background: Nationally-representative surveys suggest that females have a higher prevalence of HIV than males in most African countries. Unfortunately, these results are made on the basis of surveys with non-ignorable missing data. This study evaluates the impact that differential survey nonresponse rates between males and females can have on the point estimate of the HIV prevalence ratio of these two classifiers. Methods: We study 29 Demographic and Health Surveys (DHS) from 2001 to 2010. Instead of employing often used multiple imputation models with a Missing at Random assumption that may not hold in this setting, we assess the effect of ignoring the information contained in the missing HIV information for males and females through three proposed statistical measures. These measures can be used in settings where the interest is comparing the prevalence of a disease between two groups. The proposed measures do not utilize parametric models and can be implemented by researchers of any level. They are: (1) an upper bound on the potential bias of the usual practise of using reported HIV prevalence estimates that ignore subjects who have missing HIV outcomes. (2) Plausible range intervals to account for nonresponses, without any additional parametric modeling assumptions. (3) Prevalence ratio inflation factors to correct the point estimate of the HIV prevalence ratio, if estimates of nonresponders' HIV prevalences were known. Results: In 86% of countries, males have higher upper bounds of HIV prevalence than females, this is consonant with males possibly having higher infection rates than females. Additionally, 74% of surveys have a plausible range that crosses 1.0, suggesting a plausible equivalence between male and female HIV prevalences. Conclusions: It is quite reasonable to conclude that there is so much DHS nonresponse in evaluating the HIV status question, that existing data is plausibly generated by the situation where the virus is equally distributed between the sexes.

Original languageEnglish (US)
Article number6
JournalEmerging Themes in Epidemiology
Volume15
Issue number1
DOIs
StatePublished - Mar 5 2018

Fingerprint

HIV
Demography
Surveys and Questionnaires
Economic Inflation
Research Personnel
Viruses
Infection

Keywords

  • HIV reporting
  • HIV testing
  • Missing at random
  • Nonresponse
  • Survey bias

ASJC Scopus subject areas

  • Epidemiology

Cite this

Role of survey response rates on valid inference : An application to HIV prevalence estimates. / Marino, Miguel; Pagano, Marcello.

In: Emerging Themes in Epidemiology, Vol. 15, No. 1, 6, 05.03.2018.

Research output: Contribution to journalReview article

@article{4dcea449ebf141de9f44dd740876b90e,
title = "Role of survey response rates on valid inference: An application to HIV prevalence estimates",
abstract = "Background: Nationally-representative surveys suggest that females have a higher prevalence of HIV than males in most African countries. Unfortunately, these results are made on the basis of surveys with non-ignorable missing data. This study evaluates the impact that differential survey nonresponse rates between males and females can have on the point estimate of the HIV prevalence ratio of these two classifiers. Methods: We study 29 Demographic and Health Surveys (DHS) from 2001 to 2010. Instead of employing often used multiple imputation models with a Missing at Random assumption that may not hold in this setting, we assess the effect of ignoring the information contained in the missing HIV information for males and females through three proposed statistical measures. These measures can be used in settings where the interest is comparing the prevalence of a disease between two groups. The proposed measures do not utilize parametric models and can be implemented by researchers of any level. They are: (1) an upper bound on the potential bias of the usual practise of using reported HIV prevalence estimates that ignore subjects who have missing HIV outcomes. (2) Plausible range intervals to account for nonresponses, without any additional parametric modeling assumptions. (3) Prevalence ratio inflation factors to correct the point estimate of the HIV prevalence ratio, if estimates of nonresponders' HIV prevalences were known. Results: In 86{\%} of countries, males have higher upper bounds of HIV prevalence than females, this is consonant with males possibly having higher infection rates than females. Additionally, 74{\%} of surveys have a plausible range that crosses 1.0, suggesting a plausible equivalence between male and female HIV prevalences. Conclusions: It is quite reasonable to conclude that there is so much DHS nonresponse in evaluating the HIV status question, that existing data is plausibly generated by the situation where the virus is equally distributed between the sexes.",
keywords = "HIV reporting, HIV testing, Missing at random, Nonresponse, Survey bias",
author = "Miguel Marino and Marcello Pagano",
year = "2018",
month = "3",
day = "5",
doi = "10.1186/s12982-018-0074-x",
language = "English (US)",
volume = "15",
journal = "Emerging Themes in Epidemiology",
issn = "1742-7622",
publisher = "BioMed Central",
number = "1",

}

TY - JOUR

T1 - Role of survey response rates on valid inference

T2 - An application to HIV prevalence estimates

AU - Marino, Miguel

AU - Pagano, Marcello

PY - 2018/3/5

Y1 - 2018/3/5

N2 - Background: Nationally-representative surveys suggest that females have a higher prevalence of HIV than males in most African countries. Unfortunately, these results are made on the basis of surveys with non-ignorable missing data. This study evaluates the impact that differential survey nonresponse rates between males and females can have on the point estimate of the HIV prevalence ratio of these two classifiers. Methods: We study 29 Demographic and Health Surveys (DHS) from 2001 to 2010. Instead of employing often used multiple imputation models with a Missing at Random assumption that may not hold in this setting, we assess the effect of ignoring the information contained in the missing HIV information for males and females through three proposed statistical measures. These measures can be used in settings where the interest is comparing the prevalence of a disease between two groups. The proposed measures do not utilize parametric models and can be implemented by researchers of any level. They are: (1) an upper bound on the potential bias of the usual practise of using reported HIV prevalence estimates that ignore subjects who have missing HIV outcomes. (2) Plausible range intervals to account for nonresponses, without any additional parametric modeling assumptions. (3) Prevalence ratio inflation factors to correct the point estimate of the HIV prevalence ratio, if estimates of nonresponders' HIV prevalences were known. Results: In 86% of countries, males have higher upper bounds of HIV prevalence than females, this is consonant with males possibly having higher infection rates than females. Additionally, 74% of surveys have a plausible range that crosses 1.0, suggesting a plausible equivalence between male and female HIV prevalences. Conclusions: It is quite reasonable to conclude that there is so much DHS nonresponse in evaluating the HIV status question, that existing data is plausibly generated by the situation where the virus is equally distributed between the sexes.

AB - Background: Nationally-representative surveys suggest that females have a higher prevalence of HIV than males in most African countries. Unfortunately, these results are made on the basis of surveys with non-ignorable missing data. This study evaluates the impact that differential survey nonresponse rates between males and females can have on the point estimate of the HIV prevalence ratio of these two classifiers. Methods: We study 29 Demographic and Health Surveys (DHS) from 2001 to 2010. Instead of employing often used multiple imputation models with a Missing at Random assumption that may not hold in this setting, we assess the effect of ignoring the information contained in the missing HIV information for males and females through three proposed statistical measures. These measures can be used in settings where the interest is comparing the prevalence of a disease between two groups. The proposed measures do not utilize parametric models and can be implemented by researchers of any level. They are: (1) an upper bound on the potential bias of the usual practise of using reported HIV prevalence estimates that ignore subjects who have missing HIV outcomes. (2) Plausible range intervals to account for nonresponses, without any additional parametric modeling assumptions. (3) Prevalence ratio inflation factors to correct the point estimate of the HIV prevalence ratio, if estimates of nonresponders' HIV prevalences were known. Results: In 86% of countries, males have higher upper bounds of HIV prevalence than females, this is consonant with males possibly having higher infection rates than females. Additionally, 74% of surveys have a plausible range that crosses 1.0, suggesting a plausible equivalence between male and female HIV prevalences. Conclusions: It is quite reasonable to conclude that there is so much DHS nonresponse in evaluating the HIV status question, that existing data is plausibly generated by the situation where the virus is equally distributed between the sexes.

KW - HIV reporting

KW - HIV testing

KW - Missing at random

KW - Nonresponse

KW - Survey bias

UR - http://www.scopus.com/inward/record.url?scp=85042944798&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85042944798&partnerID=8YFLogxK

U2 - 10.1186/s12982-018-0074-x

DO - 10.1186/s12982-018-0074-x

M3 - Review article

AN - SCOPUS:85042944798

VL - 15

JO - Emerging Themes in Epidemiology

JF - Emerging Themes in Epidemiology

SN - 1742-7622

IS - 1

M1 - 6

ER -