scholarly journals Application of functional data analysis to classify communities according to the pattern of malaria incidence, to guide targeted control strategies.

2020 ◽  
Author(s):  
Sokhna DIENG ◽  
Pierre Michel ◽  
Abdoulaye Guindo ◽  
Kankoe Sallah ◽  
El-hadj Ba ◽  
...  

Abstract Background Effective targeting of malaria control in low transmission areas requires identification of transmission foci or hotspots. We investigated the use of functional data analysis to identify and describe spatio-temporal pattern of malaria incidence in an area with seasonal transmission in west-central Senegal. Method Malaria surveillance was maintained over 5 years from 2008 to 2012 at health facilities serving a population of 500,000 in 575 villages in two health districts in Senegal. Smooth functions were fitted from the time series of malaria incidence for each village, using cubic B-spline basis functions. The resulting smooth functions for each village were classified using hierarchical clustering (Ward’s method), using several different dissimilarity measures. The optimal number of clusters was then determined based on four cluster validity indices, to determine the main types of distinct temporal pattern of malaria incidence. Epidemiological indicators characterizing the resulting malaria incidence pattern in terms of the timing of seasonal outbreaks, were calculated based on the slope (velocity) and rate of change of the slope (acceleration) of the incidence over time. Results Three distinct patterns of malaria incidence were identified. A pattern characterized by high incidence, in 12/575 (2%) villages, with average incidence of 114 cases/1000 person-years over the 5 year study period; a pattern with intermediate incidence in 97 villages (17%), with average incidence of 13 cases/1000 person-years; and a pattern with low incidence in 466 (81%) villages, with average incidence 2.6 cases/1000 person-years. Epidemiological indicators characterizing the fluctuations in malaria incidence showed that seasonal outbreaks started later, and ended earlier, in the low incidence pattern. Conclusion Functional data analysis can be used to classify communities based on time series of malaria incidence, and to identify high incidence communities. Indicators can be derived from the fitted functions which characterize the timing of outbreaks. These tools may help to better target control measures.

Author(s):  
Sokhna Dieng ◽  
Pierre Michel ◽  
Abdoulaye Guindo ◽  
Kankoe Sallah ◽  
El-Hadj Ba ◽  
...  

We introduce an approach based on functional data analysis to identify patterns of malaria incidence to guide effective targeting of malaria control in a seasonal transmission area. Using functional data method, a smooth function (functional data or curve) was fitted from the time series of observed malaria incidence for each of 575 villages in west-central Senegal from 2008 to 2012. These 575 smooth functions were classified using hierarchical clustering (Ward’s method), and several different dissimilarity measures. Validity indices were used to determine the number of distinct temporal patterns of malaria incidence. Epidemiological indicators characterizing the resulting malaria incidence patterns were determined from the velocity and acceleration of their incidences over time. We identified three distinct patterns of malaria incidence: high-, intermediate-, and low-incidence patterns in respectively 2% (12/575), 17% (97/575), and 81% (466/575) of villages. Epidemiological indicators characterizing the fluctuations in malaria incidence showed that seasonal outbreaks started later, and ended earlier, in the low-incidence pattern. Functional data analysis can be used to identify patterns of malaria incidence, by considering their temporal dynamics. Epidemiological indicators derived from their velocities and accelerations, may guide to target control measures according to patterns.


2019 ◽  
Vol 109 (1) ◽  
pp. 96-110 ◽  
Author(s):  
D. A. Shah ◽  
E. D. De Wolf ◽  
P. A. Paul ◽  
L. V. Madden

In past efforts, input weather variables for Fusarium head blight (FHB) prediction models in the United States were identified after following some version of the window-pane algorithm, which discretizes a continuous weather time series into fixed-length windows before searching for summary variables associated with FHB risk. Functional data analysis, on the other hand, reconstructs the assumed continuous process (represented by a series of recorded weather data) by using smoothing functions, and is an alternative way of working with time series data with respect to FHB risk. Our objective was to functionally model weather-based time series data linked to 865 observations of FHB (covering 16 states and 31 years in total), classified as epidemics (FHB disease index ≥ 10%) and nonepidemics (FHB disease index < 10%). Altogether, 94 different time series variables were modeled by penalized cubic B-splines for the smoothing function, from 120 days pre-anthesis to 20 days post-anthesis. Functional mean curves, standard deviations, and first derivatives were plotted for FHB epidemics relative to nonepidemics. Function-on-scalar regressions assessed the temporal trends of the magnitude and significance of the mean difference between functionally represented weather time series associated with FHB epidemics and nonepidemics. The mean functional weather-variable curve for epidemics started to deviate, in general, from that for nonepidemics as early as 40 days pre-anthesis for several weather variables. The greatest deviations were often near anthesis, the period of maximum susceptibility of wheat to FHB-causing fungi. The most consistent separations between the mean functional curves were seen with the daily averages of moisture-related variables (such as average relative humidity) and with variables summarizing the daily variation in temperature (as opposed to the daily mean). Functional data analysis was useful for extending our knowledge of relationships between weather variables and FHB epidemics.


2015 ◽  
Vol 2015 ◽  
pp. 1-11 ◽  
Author(s):  
Keshav P. Pokhrel ◽  
Chris P. Tsokos

Incidence and mortality rates are considered as a guideline for planning public health strategies and allocating resources. We apply functional data analysis techniques to model age-specific brain cancer mortality trend and forecast entire age-specific functions using exponential smoothing state-space models. The age-specific mortality curves are decomposed using principal component analysis and fit functional time series model with basis functions. Nonparametric smoothing methods are used to mitigate the existing randomness in the observed data. We use functional time series model on age-specific brain cancer mortality rates and forecast mortality curves with prediction intervals using exponential smoothing state-space model. We also present a disparity of brain cancer mortality rates among the age groups together with the rate of change of mortality rates. The data were obtained from the Surveillance, Epidemiology and End Results (SEER) program of the United States. The brain cancer mortality rates, classified under International Classification Disease code ICD-O-3, were extracted from SEER*Stat software.


This handbook presents the state-of-the-art of the statistics dealing with functional data analysis. With contributions from international experts in the field, it discusses a wide range of the most important statistical topics (classification, inference, factor-based analysis, regression modeling, resampling methods, time series, random processes) while also taking into account practical, methodological, and theoretical aspects of the problems. The book is organised into three sections. Part I deals with regression modeling and covers various statistical methods for functional data such as linear/nonparametric functional regression, varying coefficient models, and linear/nonparametric functional processes (i.e. functional time series). Part II considers related benchmark methods/tools for functional data analysis, including curve registration methods for preprocessing functional data, functional principal component analysis, and resampling/bootstrap methods. Finally, Part III examines some of the fundamental mathematical aspects of the infinite-dimensional setting, with a focus on the stochastic background and operatorial statistics: vector-valued function integration, spectral and random measures linked to stationary processes, operator geometry, vector integration and stochastic integration in Banach spaces, and operatorial statistics linked to quantum statistics.


2019 ◽  
Vol 12 (2) ◽  
pp. 735-747 ◽  
Author(s):  
Eva Holtanová ◽  
Thomas Mendlik ◽  
Jan Koláček ◽  
Ivanka Horová ◽  
Jiří Mikšovský

Abstract. Despite the abundance of available global climate model (GCM) and regional climate model (RCM) outputs, their use for evaluation of past and future climate change is often complicated by substantial differences between individual simulations and the resulting uncertainties. In this study, we present a methodological framework for the analysis of multi-model ensembles based on a functional data analysis approach. A set of two metrics that generalize the concept of similarity based on the behavior of entire simulated climatic time series, encompassing both past and future periods, is introduced. To our knowledge, our method is the first to quantitatively assess similarities between model simulations based on the temporal evolution of simulated values. To evaluate mutual distances of the time series, we used two semimetrics based on Euclidean distances between the simulated trajectories and based on differences in their first derivatives. Further, we introduce an innovative way of visualizing climate model similarities based on a network spatialization algorithm. Using the layout graphs, the data are ordered on a two-dimensional plane which enables an unambiguous interpretation of the results. The method is demonstrated using two illustrative cases of air temperature over the British Isles (BI) and precipitation in central Europe, simulated by an ensemble of EURO-CORDEX RCMs and their driving GCMs over the 1971–2098 period. In addition to the sample results, interpretational aspects of the applied methodology and its possible extensions are also discussed.


2016 ◽  
Vol 11 (1) ◽  
Author(s):  
Caroline Bayr ◽  
Heinz Gallaun ◽  
Ulrike Kleb ◽  
Birgit Kornberger ◽  
Martin Steinegger ◽  
...  

For detecting anomalies or interventions in the field of forest monitoring we propose an approach based on the spatial and temporal forecast of satellite time series data. For each pixel of the satellite image three different types of forecasts are provided, namely spatial, temporal and combined spatio-temporal forecast. Spatial forecast means that a clustering algorithm is used to group the time series data based on the features normalised difference vegetation index (NDVI) and the short-wave infrared band (SWIR). For estimation of the typical temporal trajectory of the NDVI and SWIR during the vegetation period of each spatial cluster, we apply several methods of functional data analysis including functional principal component analysis, and a novel form of random regression forests with online learning (streaming) capability. The temporal forecast is carried out by means of functional time series analysis and an autoregressive integrated moving average model. The combination of the temporal forecasts, which is based on the past of the considered pixel, and spatial forecasts, which is based on highly correlated pixels within one cluster and their past, is performed by functional data analysis, and a variant of random regression forests adapted to online learning capabilities. For evaluation of the methods, the approaches are applied to a study area in Germany for monitoring forest damages caused by wind-storm, and to a study area in Spain for monitoring forest fires.


2012 ◽  
Vol 2012 ◽  
pp. 1-30 ◽  
Author(s):  
Piotr Kokoszka

This paper reviews recent research on dependent functional data. After providing an introduction to functional data analysis, we focus on two types of dependent functional data structures: time series of curves and spatially distributed curves. We review statistical models, inferential methodology, and possible extensions. The paper is intended to provide a concise introduction to the subject with plentiful references.


Sign in / Sign up

Export Citation Format

Share Document