Bias in mobility datasets drives divergence in modeled outbreak dynamics

Taylor Chin, Michael A. Johansson, Anir Chowdhury, Shayan Chowdhury, Kawsar Hosan, Md Tanvir Quader, Caroline O. Buckee & Ayesha S. Mahmud
Communications Medicine
5, Article number: 8
January 7, 2025

Abstract

Background

Digital data sources such as mobile phone call detail records (CDRs) are increasingly being used to estimate population mobility fluxes and to predict the spatiotemporal dynamics of infectious disease outbreaks. Differences in mobile phone operators’ geographic coverage, however, may result in biased mobility estimates.

Methods

We leverage a unique dataset consisting of CDRs from three mobile phone operators in Bangladesh and digital trace data from Meta’s Data for Good program to compare mobility patterns across these sources. We use a metapopulation model to compare the sources’ effects on simulated outbreak trajectories, and compare results with a benchmark model with data from all three operators, representing around 100 million subscribers across the country.

Results

We show that mobility sources can vary significantly in their coverage of travel routes and geographic mobility patterns. Differences in projected outbreak dynamics are more pronounced at finer spatial scales, especially if the outbreak is seeded in smaller and/or geographically isolated regions. In some instances, a simple diffusion (gravity) model was better able to capture the timing and spatial spread of the outbreak compared to the sparser mobility sources.

Conclusions

Our results highlight the potential biases in predicted outbreak dynamics from a metapopulation model parameterized with non-population representative data, and the limits to the generalizability of models built on these types of novel human behavioral data.

Related publications