Unknown

Dataset Information

0

Improved prediction accuracy for disease risk mapping using Gaussian process stacked generalization.


ABSTRACT: Maps of infectious disease-charting spatial variations in the force of infection, degree of endemicity and the burden on human health-provide an essential evidence base to support planning towards global health targets. Contemporary disease mapping efforts have embraced statistical modelling approaches to properly acknowledge uncertainties in both the available measurements and their spatial interpolation. The most common such approach is Gaussian process regression, a mathematical framework composed of two components: a mean function harnessing the predictive power of multiple independent variables, and a covariance function yielding spatio-temporal shrinkage against residual variation from the mean. Though many techniques have been developed to improve the flexibility and fitting of the covariance function, models for the mean function have typically been restricted to simple linear terms. For infectious diseases, known to be driven by complex interactions between environmental and socio-economic factors, improved modelling of the mean function can greatly boost predictive power. Here, we present an ensemble approach based on stacked generalization that allows for multiple nonlinear algorithmic mean functions to be jointly embedded within the Gaussian process framework. We apply this method to mapping Plasmodium falciparum prevalence data in sub-Saharan Africa and show that the generalized ensemble approach markedly outperforms any individual method.

SUBMITTER: Bhatt S 

PROVIDER: S-EPMC5636278 | biostudies-literature | 2017 Sep

REPOSITORIES: biostudies-literature

altmetric image

Publications

Improved prediction accuracy for disease risk mapping using Gaussian process stacked generalization.

Bhatt Samir S   Cameron Ewan E   Flaxman Seth R SR   Weiss Daniel J DJ   Smith David L DL   Gething Peter W PW  

Journal of the Royal Society, Interface 20170901 134


Maps of infectious disease-charting spatial variations in the force of infection, degree of endemicity and the burden on human health-provide an essential evidence base to support planning towards global health targets. Contemporary disease mapping efforts have embraced statistical modelling approaches to properly acknowledge uncertainties in both the available measurements and their spatial interpolation. The most common such approach is Gaussian process regression, a mathematical framework com  ...[more]

Similar Datasets

| S-EPMC6414317 | biostudies-literature
| S-EPMC5995745 | biostudies-literature
| S-EPMC8633453 | biostudies-literature
| S-EPMC3681788 | biostudies-literature
2017-10-08 | GSE104714 | GEO
| S-EPMC7433704 | biostudies-literature
| S-EPMC6195267 | biostudies-literature
| S-EPMC6936821 | biostudies-literature
| S-EPMC2777180 | biostudies-literature
| S-EPMC7846191 | biostudies-literature