Digital Trace Data and Demographic Forecasting: How Well Did Google Predict the US COVID‐19 Baby Bust?
Wilde J., Chen W., Lohmann S., Abdel Ghany J.
AbstractAt the onset of the first wave of COVID‐19 in the United States, the pandemic's effect on future birthrates was unknown. In this paper, we assess whether digital trace data—often touted as a panacea for traditional data scarcity—held the potential to accurately predict fertility change caused by the COVID‐19 pandemic in the United States. Specifically, we produced state‐level, dynamic future predictions of the pandemic's effect on birthrates in the United States using pregnancy‐related Google search data. Importantly, these predictions were made in October 2020 (and revised in February 2021), well before the birth effect of the pandemic could have possibly been known. Our analysis predicted that between November 2020 and February 2021, monthly United States births would drop sharply by approximately 12 percent, then begin to rebound while remaining depressed through August 2021. While these predictions were generally accurate in terms of the magnitude and timing of the trough, there were important misses regarding the speed at which these reductions materialized and rebounded. This ex post evaluation of an ex ante prediction serves as a powerful demonstration of the “promise and pitfalls” of digital trace data in demographic research.