1st Edition

Model-based Geostatistics for Global Public Health Methods and Applications

By Peter J. Diggle, Emanuele Giorgi Copyright 2019
    274 Pages
    by Chapman & Hall

    274 Pages
    by Chapman & Hall

    Model-based Geostatistics for Global Public Health: Methods and Applications provides an introductory account of model-based geostatistics, its implementation in open-source software and its application in public health research. In the public health problems that are the focus of this book, the authors describe and explain the pattern of spatial variation in a health outcome or exposure measurement of interest. Model-based geostatistics uses explicit probability models and established principles of statistical inference to address questions of this kind.





    Features:









    • Presents state-of-the-art methods in model-based geostatistics.






    • Discusses the application these methods some of the most challenging global public health problems including disease mapping, exposure mapping and environmental epidemiology.






    • Describes exploratory methods for analysing geostatistical data, including: diagnostic checking of residuals standard linear and generalized linear models; variogram analysis; Gaussian process models and geostatistical design issues.






    • Includes a range of more complex geostatistical problems where research is ongoing.






    • All of the results in the book are reproducible using publicly available R code and data-sets, as well as a dedicated R package.






    This book has been written to be accessible not only to statisticians but also to students and researchers in the public health sciences.





    The Authors





    Peter Diggle is Distinguished University Professor of Statistics in the Faculty of Health and Medicine, Lancaster University. He also holds honorary positions at the Johns Hopkins University School of Public Health, Columbia University International Research Institute for Climate and Society, and Yale University School of Public Health. His research involves the development of statistical methods for analyzing spatial and longitudinal data and their applications in the biomedical and health sciences.





    Dr Emanuele Giorgi is a Lecturer in Biostatistics and member of the CHICAS research group at Lancaster University, where he formerly obtained a PhD in Statistics and Epidemiology in 2015. His research interests involve the development of novel geostatistical methods for disease mapping, with a special focus on malaria and other tropical diseases. In 2018, Dr Giorgi was awarded the Royal Statistical Society Research Prize "for outstanding published contribution at the interface of statistics and epidemiology." He is also the lead developer of PrevMap, an R package where all the methodology found in this book has been implemented.



    1 Introduction

    Motivating example: mapping river-blindness in Africa

    Empirical or mechanistic models

    What is in this book?

    2 Regression modelling for spatially referenced data

    Linear regression models

    Malnutrition in Ghana

    Generalized linear models

    Logistic Binomial regression: river-blindness in Liberia

    Log-linear Poisson regression: abundance of Anopheles

    Gambia mosquitoes in Southern Cameroon

    Questioning the assumption of independence

    Testing for residual spatial correlation: the empirical variogram

    3 Theory

    Gaussian processes

    Families of spatial correlation functions

    The exponential family

    The Matter family

    The spherical family

    The theoretical variogram and the nugget variance

    Statistical inference

    Likelihood-based inference

    Bayesian Inference

    Predictive inference

    Approximations to Gaussian processes

    Low-rank approximations

    Gaussian Markov random held approximations via stochastic partial differential equations

    Contents

    4 The linear geostatistical model

    Model formulation

    Inference

    Likelihood-based inference

    Maximum likelihood estimation

    Bayesian inference

    Trans-Gaussian models

    Model validation

    Scenario 1: omission of the nugget effect

    Scenario 2: miss-specification of the smoothness parameter

    Scenario 3: non-Gaussian data

    Spatial prediction

    Applications

    Heavy metal monitoring in Galicia

    Malnutrition in Ghana (continued)

    Spatial predictions for the target population

    5 Generalized linear geostatistical models 85

    Model formulation

    Binomial sampling

    Poisson sampling

    Negative binomial sampling?

    Inference

    Likelihood-based inference

    Laplace approximation

    Monte Carlo maximum likelihood

    Bayesian inference

    Model validation

    Spatial prediction

    Applications

    River-blindness in Liberia (continued)

    Abundance of Anopheles Gambia mosquitoes in Southern

    Cameroon (continued)

    A link between geostatistical models and point processes

    A link between geostatistical models and spatially discrete processes

    6 Geostatistical design

    Introduction

    Definitions

    Non-adaptive designs

    Two extremes: completely random and completely regular designs

    Inhibitory designs

    Contents

    Inhibitory-plus-close-pairs designs

    Comparing designs: a simple example

    Modified regular lattice designs

    Application: rolling malaria indicator survey sampling in the Manjeet perimeter, southern Malawi

    Adaptive designs

    An adaptive design algorithm

    Application: sampling for malaria prevalence in the Manjeet perimeter (continued)

    Discussion

    7 Preferential sampling

    Definitions

    Preferential sampling methodology

    Non-uniform designs need not be preferential

    Adaptive designs need not be strongly preferential

    The Diggle, Menezes and Su model

    The Patti, Reich and Dunson model

    Monte Carlo maximum likelihood using stochastic partial differential equations

    Lead pollution in Galicia

    Mapping ozone concentration in Eastern United States

    Discussion

    8 Zero-inaction

    Models with zero-inaction

    Inference

    Spatial prediction

    Applications

    River blindness mapping in Sudan and South Sudan

    Loa loa: mapping prevalence and intensity of infection

    9 Spatio-temporal geostatistical analysis

    Setting the context

    Is the sampling design preferential?

    Geostatistical methods for spatio-temporal analysis

    Exploratory analysis: the spatio-temporal variogram

    Diagnostics and novel extensions

    Example: a model for disease prevalence with

    temporally varying variance

    Defining targets for prediction

    Accounting for parameter uncertainty using classical

    methods of inference

    Visualization

    Contents

    Historical mapping of malaria prevalence in Senegal from 1905 to 2014

    Discussion

    10 Further topics in model-based geostatistics

    Combining data from multiple surveys

    Using school and community surveys to estimate

    malaria prevalence in Nyanza province, Kenya

    Combining multiple instruments

    Case I: Predicting prevalence for a gold-standard diagnostic

    Case II: Joint prediction of prevalence from two complementary

    diagnostics

    Incomplete data

    Positional error

    Missing locations

    Modelling of the sampling design

    Appendices

    A Background statistical theory

    Probability distributions

    The Binomial distribution

    The Poisson distribution

    The Normal distribution

    Independent and dependent random variables

    Statistical models: responses, covariates, parameters and random

    effects

    Statistical inference

    The likelihood and log-likelihood functions

    Estimation, testing and prediction

    Classical inference

    Bayesian inference

    Prediction

    Monte Carlo methods

    Direct simulation

    Markov chain Monte Carlo

    Monte Carlo maximum likelihood

    B Spatial data handling 225

    Handling shape-_les in R

    Handling raster-_les in R

    Creating spatial covariates

    Maps and animations

    References

    Biography

    Peter Diggle is Distinguished University Professor of Statistics in the Faculty of Health and Medicine, Lancaster University. He also holds honorary positions at the Johns Hopkins University School of Public Health, Columbia University International Research Institute for Climate and Society, and Yale University School of Public Health. His research involves the development of statistical methods for analyzing spatial and longitudinal data and their applications in the biomedical and health sciences.



    Dr Emanuele Giorgi is a Lecturer in Biostatistics and member of the CHICAS research group at Lancaster University, where he formerly obtained a PhD in Statistics and Epidemiology in 2015. His research interests involve the development of novel geostatistical methods for disease mapping, with a special focus on malaria and other tropical diseases. In 2018, Dr Giorgi was awarded the Royal Statistical Society Research Prize "for outstanding published contribution at the interface of statistics and epidemiology." He is also the lead developer of PrevMap, an R package where all the methodology found in this book has been implemented.

    "This is an excellent source for public health professionals so far as the needed state-of-the-art concepts and methods that are needed to analyse and interpret geostatistical data. Basic knowledge of mathematical statistics is necessary to read through this well-written book... Focus has been made on disease mapping, environmental epidemiology, generalized linear models, variogram, and R-codes. The references are thorough and up-to-date. The examples are real-life oriented and interesting... Some unique features of this well-written book are the illustrations and they include river blindness in Liberia, heavy metal monitoring in Galicia, malnutrition in Ghana, rolling malaria in Malawi, ozone concentration in Eastern United States, prevalence and intensity of infection among others.This book is quite suitable to be a textbook for a graduate level course in global public health or geo-statistics. Researchers and doctoral graduate students seeking thesis topic ought to read this book. I enjoyed reading this book. I recommend this book to statistics and computing professionals."
    - Ramalingam Shanmugam, in the Journal of Statistical Computation and Simulation, April 2020

    "This book was written primarily to introduce geostatistics to public health researchers...The text goes beyond introductory descriptions and provides a fairly comprehensive guide to geostatistics, ranging from the design of geostatistical experiments to the analysis of complicated datasets. While the book’s target audience is mainly public health researchers, the material is also helpful to PhD students and even statistics faculty that want an introduction to geostatistics. Each chapter can be read as an independent guide or read jointly to gain a more complete understanding of geostatistical research from data collection to analysis... The text is well-written and genuinely enjoyable to read. One of the main attractions of the book is that the authors offer tidbits of advice from their own expert experience analyzing geostatistical data...While other texts can lose the readers in the seemingly endless modeling choices, Diggle and Giorgi guide their audience to make informed decisions from the first design stages to the final visualizations."
    - Ian Laga and Xiaoyue Niu, JASA 2020

    "The book provides an integrated mix of statistical theory and applications, working up from linear regression through to generalised geostatistical models and on to specialised topics, such as zero-inflation in geostatistical models, spatiotemporal models and approaches to combining data from multiple sources...The relevant case studies developed throughout the course of the book provide an excellent demonstration of the methods and potential insight available from using geostatistical approaches. Furthermore, the emphasis on the communication of model results is a beneficial addition for any statistician working in a collaborative environment. Model-based Geostatistics for Global Public Health provides a good grounding in geostatistical modelling with excellent worked case studies in the global public health domain. It offers particular value to applied statisticians with its technical detail and thorough case studies. The book is supported by an open-source R package, PrevMap."
    - Kirsty L. Hassall, Rothamsted Research, Harpenden, UK