Can Wikipedia forecast the flu?

Researchers at Los Alamos National Laboratory say Wikipedia article access logs are highly correlated with historical influenza-like illness records.

Wikimedia image: Wikipedia logo.

Wikipedia can not only tell you obscure things like where Bala Cynwyd is, it can also tell you if the Pennsylvania town is a current hotspot for the flu bug -- if you know how to ask.

Researchers at Los Alamos National Laboratory say they have learned how to glean information that can be used to forecast the upcoming flu season and other infectious diseases by analyzing views of Wikipedia articles.

Research teams at the laboratory recently published “Forecasting the 2013-2014 Influenza Season Using Wikipedia,” in the Public Library of Science.

Understanding influenza or other infectious disease dynamics and forecasting their impact is fundamental for developing prevention and mitigation strategies. To do that, Los Alamos researchers combined modern data assimilation methods with Wikipedia access logs and Center for Disease Control influenza-like illness (ILI) reports to create a weekly forecast for seasonal influenza.

The research taps into the angst people have when they come down with the flu bug and are searching for more information online. Los Alamos said its researchers found Wikipedia article access logs are highly correlated with historical ILI records and allow for accurate prediction of ILI data several weeks before it becomes available. The researchers' results showed that prior to the peak of the flu season, their forecasting method projected the actual outcome with a high probability.

“The ability to more accurately forecast the flu season and other infectious diseases will transform the way health departments prepare for and respond to epidemics, ultimately saving lives,” scientist Sara Del Valle said.

“We used techniques often seen in weather forecasting to iteratively tune a model of influenza dynamics based on Wikipedia observations so that our forecast agrees with the most current ILI data,” said researcher Kyle Hickmann, the lead author of the paper and a member of Del Valle’s team.

Los Alamos said the methods were applied to the 2013-2014 influenza season but are standard enough to forecast any disease outbreak, given incidence or case count data.

“Disease forecasting is still in its infancy and there is much more to learn in this field,” Del Valle said. “We are continuing to refine our approach so our forecasts can be used for actionable decision-making.”