Bias in mobility datasets drives divergence in modeled outbreak dynamics
Publication
NetSI authors
Research area
Resources
Abstract
Background
Digital data sources such as mobile phone call detail records (CDRs) are increasingly being used to estimate population mobility fluxes and to predict the spatiotemporal dynamics of infectious disease outbreaks. Differences in mobile phone operators’ geographic coverage, however, may result in biased mobility estimates.
Methods
We leverage a unique dataset consisting of CDRs from three mobile phone operators in Bangladesh and digital trace data from Meta’s Data for Good program to compare mobility patterns across these sources. We use a metapopulation model to compare the sources’ effects on simulated outbreak trajectories, and compare results with a benchmark model with data from all three operators, representing around 100 million subscribers across the country.
Results
We show that mobility sources can vary significantly in their coverage of travel routes and geographic mobility patterns. Differences in projected outbreak dynamics are more pronounced at finer spatial scales, especially if the outbreak is seeded in smaller and/or geographically isolated regions. In some instances, a simple diffusion (gravity) model was better able to capture the timing and spatial spread of the outbreak compared to the sparser mobility sources.
Conclusions
Our results highlight the potential biases in predicted outbreak dynamics from a metapopulation model parameterized with non-population representative data, and the limits to the generalizability of models built on these types of novel human behavioral data.