Applying Task-Aggregating Wrapper to CUDA-Based Method of Query Selectivity Calculation Using Multidimensional Kernel Estimator

The Nadaraya-Watson kernel estimator is among the most popular nonparameteric regression technique thanks to its simplicity. Its asymptotic bias has been studied by Rosenblatt in 1969 and has been reported in several related literature. However, given its asymptotic nature, it gives no access to a hard bound. The increasing popularity of predictive tools for automated decision-making surges the need for hard (non-probabilistic) guarantees. To alleviate this issue, we propose an upper bound of the bias which holds for finite bandwidths using Lipschitz assumptions and mitigating some of the prerequisites of Rosenblatt’s analysis. Our bound has potential applications in fields like surgical robots or self-driving cars, where some hard guarantees on the prediction-error are needed.

Download Full-text

On the Convergence Rate for a Kernel Estimate of the Regression Function

International Journal of Statistics and Probability ◽

10.5539/ijsp.v5n2p29 ◽

2016 ◽

Vol 5 (2) ◽

pp. 29

Author(s):

Mounir ARFI

Keyword(s):

Convergence Rate ◽

Uniform Convergence ◽

Kernel Estimator ◽

Regression Function ◽

Kernel Estimate ◽

Compact Sets

We give the rate of the uniform convergence for the kernel estimate of the regression function over a sequence of compact sets which increases to $\mathbb{R}^{d}$ when $n$ approaches the infinity and when the observed process is $\varphi$-mixing. The used estimator for the regression function is the kernel estimator proposed by Nadaraya, Watson (1964).

Download Full-text

Predicting frequency distribution and influence of sociodemographic and behavioral risk factors of Schistosoma mansoni infection and analysis of co-infection with intestinal parasites

Geospatial health ◽

10.4081/gh.2015.303 ◽

2015 ◽

Vol 10 (1) ◽

Cited By ~ 5

Author(s):

Carla V.V. Rollemberg ◽

Marília M.B.L. Silva ◽

Karla C. Rollemberg ◽

Fábio R. Amorim ◽

Nayanna M.N. Lessa ◽

...

Keyword(s):

Schistosoma Mansoni ◽

Low Income ◽

Intestinal Parasites ◽

Kernel Estimation ◽

Kernel Estimator ◽

Integrated Approach ◽

Parasitic Infections ◽

Behavioral Risk ◽

Water Contact ◽

Untreated Water

Geospatial analysis was used to study the epidemiology of Schistosoma mansoni, intestinal parasites and co-infections in an area (Ilha das Flores) in Sergipe, Brazil. We collected individually georeferenced sociodemographic, behavioral and parasitological data from 500 subjects, analyzed them by conventional statistics, and produced risk maps by Kernel estimation. The prevalence rates found were: S. mansoni (24.0%), Trichuris trichiura (54.8%), Ascaris lumbricoides (49.2%), Hookworm (17.6%) and Entamoeba histolytica (7.0%). Only 59/500 (11.8%) individuals did not present any of these infections, whereas 279/500 (55.8%) were simultaneously infected by three or more parasites. We observed associations between S. mansoni infection and various variables such as male gender, being rice farmer or fisherman, low educational level, low income, water contact and drinking untreated water. The Kernel estimator indicated that high-risk areas coincide with the poorest regions of the villages as well as with the part of the villages without an adequate sewage system. We also noted associations between both A. lumbricoides and hookworm infections with low education and low income. A. lumbricoides infection and T. trichiura infection were both associated with drinking untreated water and residential open-air sewage. These findings call for an integrated approach to effectively control multiple parasitic infections.

Download Full-text

A Kernel Estimator for Stochastic Subsurface Characterization

Ground Water ◽

10.1111/j.1745-6584.1996.tb02052.x ◽

1996 ◽

Vol 34 (4) ◽

pp. 647-658 ◽

Cited By ~ 2

Author(s):

A. I. All ◽

U. Lall

Keyword(s):

Kernel Estimator ◽

Subsurface Characterization

Download Full-text

Spatial distribution of schistosomiasis and geohelminthiasis cases in the rural areas of Pernambuco, Brazil

Revista da Sociedade Brasileira de Medicina Tropical ◽

10.1590/s0037-86822012000500017 ◽

2012 ◽

Vol 45 (5) ◽

pp. 633-638 ◽

Cited By ~ 5

Author(s):

Verônica Santos Barbosa ◽

Karina Conceição Araújo ◽

Onicio Batista Leal Neto ◽

Constança Simões Barbosa

Keyword(s):

Spatial Distribution ◽

Rural Areas ◽

Public Health Problem ◽

Secondary Data ◽

Kernel Estimator ◽

Distribution Patterns ◽

Number Of Children ◽

Density Maps ◽

Significant Public Health Problem ◽

Significant Public Health

INTRODUCTION: The prevalence and intensity of geohelminth infections and schistosomiasis remain high in the rural areas of Zona da Mata, Pernambuco (ZMP), Brazil, where these parasites still represent a significant public health problem. The present study aimed to spatially assess the occurrences of schistosomiasis and geohelminthiasis in the ZMP. METHODS: The ZMP has a population of 1,132,544 inhabitants, formed by 43 municipalities. An ecological study was conducted, using secondary data relating to positive human cases and parasite loads of schistosomiasis and positive human cases of geohelminthiasis that were worked up in Excel 2007. We used the coordinates of the municipal headquarters to represent the cities which served as the unit of analysis of this study. The Kernel estimator was used to spatially analyze the data and identify distribution patterns and case densities, with analysis done in ArcGIS software. RESULTS: Spatial analysis from the Kernel intensity estimator made it possible to construct density maps showing that the northern ZMP was the region with the greatest number of children infected with parasites and the populations most intensely infected by Schistosoma mansoni. In relation to geohelminths, there was higher spatial distribution of cases of Ascaris lumbricoides and Trichuris trichiura in the southern ZMP, and greater occurrence of hookworms in the northern/central ZMP. CONCLUSIONS: Despite several surveys and studies showing occurrences of schistosomiasis and geohelminthiasis in the ZMP, no preventive measures that are known to have been effective in decreasing these health hazards have yet been implemented in the endemic area.

Download Full-text

Estimating a Distribution Function at the Boundary

Austrian Journal of Statistics ◽

10.17713/ajs.v49i1.801 ◽

2020 ◽

Vol 49 (1) ◽

pp. 1-23

Author(s):

Shunpu Zhang ◽

Zhong Li ◽

Zhiying Zhang

Keyword(s):

Distribution Function ◽

Kernel Estimation ◽

Kernel Estimator ◽

Distribution Functions ◽

Practical Application ◽

Real World Applications ◽

Distribution Kernel ◽

Estimation Of Distribution ◽

Simulation Results ◽

Boundary Correction

Estimation of distribution functions has many real-world applications. We study kernel estimation of a distribution function when the density function has compact support. We show that, for densities taking value zero at the endpoints of the support, the kernel distribution estimator does not need boundary correction. Otherwise, boundary correction is necessary. In this paper, we propose a boundary distribution kernel estimator which is free of boundary problem and provides non-negative and non-decreasing distribution estimates between zero and one. Extensive simulation results show that boundary distribution kernel estimator provides better distribution estimates than the existing boundary correction methods. For practical application of the proposed methods, a data-dependent method for choosing the bandwidth is also proposed.

Download Full-text

Discriminating between Long Memory and Volatility Shifts

Austrian Journal of Statistics ◽

10.17713/ajs.v36i4.338 ◽

2016 ◽

Vol 36 (4) ◽

Cited By ~ 8

Author(s):

Aonan Zhang ◽

Robertas Gabrys ◽

Piotr Kokoszka

Keyword(s):

Long Memory ◽

Change Point ◽

Kernel Estimator ◽

Practical Implementation ◽

Point Model ◽

Variance Estimators ◽

Long Run ◽

The Us ◽

Change Point Model ◽

Stock Indexes

We develop a practical implementation of the test proposed in Berkes, Horv´ath, Kokoszka, and Shao (2006) designed to distinguish between a change-point model and a long memory model. Our implementation is calibrated to distinguish between a shift in volatility of returns and long memory in squared returns. It uses a kernel estimator of the long-run variance of squared returns with the maximal lag selected by a data driven procedure which depends on the sample size, the location of the estimated change point and the direction of the apparent volatility shift (increase versus decrease). In a simulations study, we also consider other long-run variance estimators, including the VARHAC estimator, but we find that they lead to tests with inferior performance. Applied to returns on indexes and individual stocks, our test indicates that even for the same asset, a change-point model may be preferable for a certain period of time, whereas there is evidence of long memory in another period of time. Generally there is stronger evidence for long memory in the eight years ending June 2006 than in the eight years starting January 1992. This pattern is most pronounced for US stock indexes and shares in the US financial sector.

Download Full-text

On the Adaptive Nadaraya-Watson Kernel Estimator for the Discontinuity in the Presence of Jump Size

Süleyman Demirel Üniversitesi Fen Bilimleri Enstitüsü Dergisi ◽

10.19113/sdufbed.70996 ◽

2018 ◽

Vol 22 (2) ◽

pp. 511 ◽

Cited By ~ 2

Author(s):

Muhammad HANIF ◽

Shabnam SHAHZADI ◽

Usman SHAHZAD ◽

Nursel KOYUNCU

Keyword(s):

Kernel Estimator ◽

Jump Size

Download Full-text