An Automated Approach for Finding Spatio-Temporal Patterns of Seasonal Influenza in the United States: Algorithm Validation Study

Background Agencies such as the Centers for Disease Control and Prevention (CDC) currently release influenza-like illness incidence data, along with descriptive summaries of simple spatio-temporal patterns and trends. However, public health researchers, government agencies, as well as the general public, are often interested in deeper patterns and insights into how the disease is spreading, with additional context. Analysis by domain experts is needed for deriving such insights from incidence data. Objective Our goal was to develop an automated approach for finding interesting spatio-temporal patterns in the spread of a disease over a large region, such as regions which have specific characteristics (eg, high incidence in a particular week, those which showed a sudden change in incidence) or regions which have significantly different incidence compared to earlier seasons. Methods We developed techniques from the area of transactional data mining for characterizing and finding interesting spatio-temporal patterns in disease spread in an automated manner. A key part of our approach involved using the principle of minimum description length for representing a given target set in terms of combinations of attributes (referred to as clauses); we considered both positive and negative clauses, relaxed descriptions which approximately represent the set, and used integer programming to find such descriptions. Finally, we designed an automated approach, which examines a large space of sets corresponding to different spatio-temporal patterns, and ranks them based on the ratio of their size to their description length (referred to as the compression ratio). Results We applied our methods using minimum description length to find spatio-temporal patterns in the spread of seasonal influenza in the United States using state level influenza-like illness activity indicator data from the CDC. We observed that the compression ratios were over 2.5 for 50% of the chosen sets, when approximate descriptions and negative clauses were allowed. Sets with high compression ratios (eg, over 2.5) corresponded to interesting patterns in the spatio-temporal dynamics of influenza-like illness. Our approach also outperformed description by solution in terms of the compression ratio. Conclusions Our approach, which is an unsupervised machine learning method, can provide new insights into patterns and trends in the disease spread in an automated manner. Our results show that the description complexity is an effective approach for characterizing sets of interest, which can be easily extended to other diseases and regions beyond influenza in the US. Our approach can also be easily adapted for automated generation of narratives.

Download Full-text

An Automated Approach for Finding Spatio-Temporal Patterns of Seasonal Influenza in the United States: Algorithm Validation Study (Preprint)

10.2196/preprints.12842 ◽

2018 ◽

Author(s):

Prathyush Sambaturu ◽

Parantapa Bhattacharya ◽

Jiangzhuo Chen ◽

Bryan Lewis ◽

Madhav Marathe ◽

...

Keyword(s):

United States ◽

Compression Ratio ◽

Seasonal Influenza ◽

Minimum Description Length ◽

Temporal Patterns ◽

The United States ◽

Disease Spread ◽

Influenza Like Illness ◽

Incidence Data ◽

Spatio Temporal

BACKGROUND Agencies such as the Centers for Disease Control and Prevention (CDC) currently release influenza-like illness incidence data, along with descriptive summaries of simple spatio-temporal patterns and trends. However, public health researchers, government agencies, as well as the general public, are often interested in deeper patterns and insights into how the disease is spreading, with additional context. Analysis by domain experts is needed for deriving such insights from incidence data. OBJECTIVE Our goal was to develop an automated approach for finding interesting spatio-temporal patterns in the spread of a disease over a large region, such as regions which have specific characteristics (eg, high incidence in a particular week, those which showed a sudden change in incidence) or regions which have significantly different incidence compared to earlier seasons. METHODS We developed techniques from the area of transactional data mining for characterizing and finding interesting spatio-temporal patterns in disease spread in an automated manner. A key part of our approach involved using the principle of minimum description length for representing a given target set in terms of combinations of attributes (referred to as clauses); we considered both positive and negative clauses, relaxed descriptions which approximately represent the set, and used integer programming to find such descriptions. Finally, we designed an automated approach, which examines a large space of sets corresponding to different spatio-temporal patterns, and ranks them based on the ratio of their size to their description length (referred to as the compression ratio). RESULTS We applied our methods using minimum description length to find spatio-temporal patterns in the spread of seasonal influenza in the United States using state level influenza-like illness activity indicator data from the CDC. We observed that the compression ratios were over 2.5 for 50% of the chosen sets, when approximate descriptions and negative clauses were allowed. Sets with high compression ratios (eg, over 2.5) corresponded to interesting patterns in the spatio-temporal dynamics of influenza-like illness. Our approach also outperformed description by solution in terms of the compression ratio. CONCLUSIONS Our approach, which is an unsupervised machine learning method, can provide new insights into patterns and trends in the disease spread in an automated manner. Our results show that the description complexity is an effective approach for characterizing sets of interest, which can be easily extended to other diseases and regions beyond influenza in the US. Our approach can also be easily adapted for automated generation of narratives.

Download Full-text

Spatio-temporal analysis of influenza-like illness and prediction of incidence in high-risk regions, in the United States from 2011 to 2020

10.21203/rs.3.rs-220805/v1 ◽

2021 ◽

Author(s):

Zhijuan Song ◽

Xiaocan Jia ◽

Junzhe Bao ◽

Yongli Yang ◽

Huili Zhu ◽

...

Keyword(s):

United States ◽

Statistical Significance ◽

Temporal Analysis ◽

The United States ◽

Incidence Rates ◽

Influenza Like Illness ◽

Temporal Cluster ◽

Control And Prevention ◽

Spatio Temporal ◽

And Control

Abstract Introduction: About 8% of Americans get influenza during an average season from the Centers for Disease Control and Prevention in the United States. It is necessary to strengthen the early warning of influenza and the prediction of public health. Methods In this study, we analyzed the characteristics of Influenza-like Illness (ILI) by Geographic Information System and SARIMA model, respectively. Spatio-temporal cluster analysis detected 23 clusters of ILI during the study period. Results The highest incidence of ILI was mainly concentrated in the states of Louisiana, District of Columbia and Virginia. The Local spatial autocorrelation analysis revealed the High-High cluster was mainly located in Louisiana and Mississippi. This means that if the influenza incidence is high in Louisiana and Mississippi, the neighboring states will also have higher influenza incidence rates. The regression model SARIMA(1, 0, 0)(1, 1, 0)52 with statistical significance was obtained to forecast the ILI incidence of Mississippi. Conclusions The study showed, the ILI incidence will begin to increase in the 45th week 2020 and peak in the 6th week 2021. To conclude, notable epidemiological differences were observed across states, indicating that some states should pay more attention to prevent and control respiratory infectious diseases.

Download Full-text

Landscape determinants of spatio-temporal patterns of aerosol optical depth in the two most polluted metropolitans in the United States

The Science of The Total Environment ◽

10.1016/j.scitotenv.2017.07.273 ◽

2017 ◽

Vol 609 ◽

pp. 1556-1565 ◽

Cited By ~ 15

Author(s):

Chenghao Wang ◽

Chuyuan Wang ◽

Soe W. Myint ◽

Zhi-Hua Wang

Keyword(s):

United States ◽

Aerosol Optical Depth ◽

Optical Depth ◽

Temporal Patterns ◽

The United States ◽

Spatio Temporal

Download Full-text

Spatio-Temporal Analysis of Influenza-Like Illness and Prediction of Incidence in High-Risk Regions in the United States from 2011 to 2020

International Journal of Environmental Research and Public Health ◽

10.3390/ijerph18137120 ◽

2021 ◽

Vol 18 (13) ◽

pp. 7120

Author(s):

Zhijuan Song ◽

Xiaocan Jia ◽

Junzhe Bao ◽

Yongli Yang ◽

Huili Zhu ◽

...

Keyword(s):

United States ◽

High Risk ◽

Moving Average ◽

Temporal Analysis ◽

The United States ◽

Influenza Like Illness ◽

Important Health ◽

Control And Prevention ◽

Spatio Temporal ◽

Predicted Values

About 8% of the Americans contract influenza during an average season according to the Centers for Disease Control and Prevention in the United States. It is necessary to strengthen the early warning for influenza and the prediction of public health. In this study, Spatial autocorrelation analysis and spatial scanning analysis were used to identify the spatiotemporal patterns of influenza-like illness (ILI) prevalence in the United States, during the 2011–2020 transmission seasons. A seasonal autoregressive integrated moving average (SARIMA) model was constructed to predict the influenza incidence of high-risk states. We found the highest incidence of ILI was mainly concentrated in the states of Louisiana, District of Columbia and Virginia. Mississippi was a high-risk state with a higher influenza incidence, and exhibited a high-high cluster with neighboring states. A SARIMA (1, 0, 0) (1, 1, 0)52 model was suitable for forecasting the ILI incidence of Mississippi. The relative errors between actual values and predicted values indicated that the predicted values matched the actual values well. Influenza is still an important health problem in the United States. The spread of ILI varies by season and geographical region. The peak season of influenza was the winter and spring, and the states with higher influenza rates are concentrated in the southeast. Increased surveillance in high-risk states could help control the spread of the influenza.

Download Full-text

Corrigendum to “Landscape determinants of spatio-temporal patterns of aerosol optical depth in the two most polluted metropolitans in the United States” [Sci. Total Environ. 609 (2017) 1556–1565]

The Science of The Total Environment ◽

10.1016/j.scitotenv.2018.01.039 ◽

2018 ◽

Vol 626 ◽

pp. 1502-1504

Author(s):

Chenghao Wang ◽

Chuyuan Wang ◽

Soe W. Myint ◽

Zhi-Hua Wang

Keyword(s):

United States ◽

Aerosol Optical Depth ◽

Optical Depth ◽

Temporal Patterns ◽

The United States ◽

Spatio Temporal

Download Full-text

Response of crop yield to different time-scales of drought in the United States: Spatio-temporal patterns and climatic and environmental drivers

Agricultural and Forest Meteorology ◽

10.1016/j.agrformet.2018.09.019 ◽

2019 ◽

Vol 264 ◽

pp. 40-55 ◽

Cited By ~ 26

Author(s):

Marina Peña-Gallardo ◽

Sergio M. Vicente-Serrano ◽

Steven Quiring ◽

Marc Svoboda ◽

Jamie Hannaford ◽

...

Keyword(s):

United States ◽

Time Scales ◽

Crop Yield ◽

Temporal Patterns ◽

The United States ◽

Environmental Drivers ◽

Spatio Temporal ◽

Different Time Scales

Download Full-text

Synthesis of public water supply use in the United States: Spatio‐temporal patterns and socio‐economic controls

Earth s Future ◽

10.1002/2016ef000511 ◽

2017 ◽

Vol 5 (7) ◽

pp. 771-788 ◽

Cited By ~ 12

Author(s):

A. Sankarasubramanian ◽

J. L. Sabo ◽

K. L. Larson ◽

S. B. Seo ◽

T. Sinha ◽

...

Keyword(s):

United States ◽

Water Supply ◽

Temporal Patterns ◽

The United States ◽

Public Water Supply ◽

Spatio Temporal

Download Full-text

Retinopathy of Prematurity: An Estimate of Vision Loss in the United States—1979

PEDIATRICS ◽

10.1542/peds.67.6.924 ◽

1981 ◽

Vol 67 (6) ◽

pp. 924-926

Author(s):

Dale L. Phelps

Keyword(s):

United States ◽

Birth Weight ◽

Retinopathy Of Prematurity ◽

Simple Formula ◽

Vision Loss ◽

The United States ◽

Regional Data ◽

Incidence Data ◽

Prevention And Treatment

The number of infants blinded from retinopathy of prematurity in the United States in 1979 is estimated to be 546, based on birth-weight-specific published survival statistics and ROP incidence data. Approximately 2,100 infants will be affected by cicatricial disease annually. A simple formula is presented that permits estimation of incidence data based on other regional data. It is suggested that increased attention be focused on this old enemy in order to document its incidence worldwide and to learn more about its prevention and treatment.

Download Full-text

The Effects of Mask Wearing, Mobility Change and SARS-CoV-2 Interference on Seasonal Influenza in Northern China, Southern China, England, and the United States

SSRN Electronic Journal ◽

10.2139/ssrn.3943137 ◽

2021 ◽

Author(s):

Shasha Han ◽

Ting Zhang ◽

Yan Lyu ◽

Shengjie Lai ◽

Peixi Dai ◽

...

Keyword(s):

United States ◽

Seasonal Influenza ◽

Southern China ◽

Northern China ◽

The United States

Download Full-text

Fine-grained, spatio-temporal datasets measuring 200 years of land development in the United States

10.5194/essd-2020-217 ◽

2020 ◽

Cited By ~ 2

Author(s):

Johannes H. Uhl ◽

Stefan Leyk ◽

Caitlin M. McShane ◽

Anna E. Braswell ◽

Dylan S. Connor ◽

...

Keyword(s):

United States ◽

Temporal Resolution ◽

Data Extraction ◽

Historical Analysis ◽

Land Development ◽

Remote Sensing Data ◽

The United States ◽

Building Stock ◽

Fine Grained ◽

Spatio Temporal

Abstract. The collection, processing and analysis of remote sensing data since the early 1970s has rapidly improved our understanding of change on the Earth’s surface. While satellite-based earth observation has proven to be of vast scientific value, these data are typically confined to recent decades of observation and often lack important thematic detail. Here, we advance in this arena by constructing new spatially-explicit settlement data for the United States that extend back to the early nineteenth century, and is consistently enumerated at fine spatial and temporal granularity (i.e., 250 m spatial, and 5 a temporal resolution). We create these time series using a large, novel building stock database to extract and map retrospective, fine-grained spatial distributions of built-up properties in the conterminous United States from 1810 to 2015. From our data extraction, we analyse and publish a series of gridded geospatial datasets that enable novel retrospective historical analysis of the built environment at unprecedented spatial and temporal resolution. The datasets are available at https://dataverse.harvard.edu/dataverse/hisdacus (Uhl and Leyk, 2020a, b, c, d).

Download Full-text