independence tests
Recently Published Documents


TOTAL DOCUMENTS

52
(FIVE YEARS 14)

H-INDEX

8
(FIVE YEARS 0)

Symmetry ◽  
2022 ◽  
Vol 14 (1) ◽  
pp. 149
Author(s):  
Waqar Khan ◽  
Lingfu Kong ◽  
Brekhna Brekhna ◽  
Ling Wang ◽  
Huigui Yan

Streaming feature selection has always been an excellent method for selecting the relevant subset of features from high-dimensional data and overcoming learning complexity. However, little attention is paid to online feature selection through the Markov Blanket (MB). Several studies based on traditional MB learning presented low prediction accuracy and used fewer datasets as the number of conditional independence tests is high and consumes more time. This paper presents a novel algorithm called Online Feature Selection Via Markov Blanket (OFSVMB) based on a statistical conditional independence test offering high accuracy and less computation time. It reduces the number of conditional independence tests and incorporates the online relevance and redundant analysis to check the relevancy between the upcoming feature and target variable T, discard the redundant features from Parents-Child (PC) and Spouses (SP) online, and find PC and SP simultaneously. The performance OFSVMB is compared with traditional MB learning algorithms including IAMB, STMB, HITON-MB, BAMB, and EEMB, and Streaming feature selection algorithms including OSFS, Alpha-investing, and SAOLA on 9 benchmark Bayesian Network (BN) datasets and 14 real-world datasets. For the performance evaluation, F1, precision, and recall measures are used with a significant level of 0.01 and 0.05 on benchmark BN and real-world datasets, including 12 classifiers keeping a significant level of 0.01. On benchmark BN datasets with 500 and 5000 sample sizes, OFSVMB achieved significant accuracy than IAMB, STMB, HITON-MB, BAMB, and EEMB in terms of F1, precision, recall, and running faster. It finds more accurate MB regardless of the size of the features set. In contrast, OFSVMB offers substantial improvements based on mean prediction accuracy regarding 12 classifiers with small and large sample sizes on real-world datasets than OSFS, Alpha-investing, and SAOLA but slower than OSFS, Alpha-investing, and SAOLA because these algorithms only find the PC set but not SP. Furthermore, the sensitivity analysis shows that OFSVMB is more accurate in selecting the optimal features.


Entropy ◽  
2021 ◽  
Vol 23 (11) ◽  
pp. 1450
Author(s):  
Ádám Zlatniczki ◽  
Marcell Stippinger ◽  
Zsigmond Benkő ◽  
Zoltán Somogyvári ◽  
András Telcs

This work is about observational causal discovery for deterministic and stochastic dynamic systems. We explore what additional knowledge can be gained by the usage of standard conditional independence tests and if the interacting systems are located in a geodesic space.


2021 ◽  
Vol 5 (1) ◽  
pp. 31
Author(s):  
Felix Laumann ◽  
Julius von Kügelgen ◽  
Mauricio Barahona

Two-sample and independence tests with the kernel-based mmd and hsic have shown remarkable results on i.i.d. data and stationary random processes. However, these statistics are not directly applicable to nonstationary random processes, a prevalent form of data in many scientific disciplines. In this work, we extend the application of mmd and hsic to nonstationary settings by assuming access to independent realisations of the underlying random process. These realisations—in the form of nonstationary time-series measured on the same temporal grid—can then be viewed as i.i.d. samples from a multivariate probability distribution, to which mmd and hsic can be applied. We further show how to choose suitable kernels over these high-dimensional spaces by maximising the estimated test power with respect to the kernel hyperparameters. In experiments on synthetic data, we demonstrate superior performance of our proposed approaches in terms of test power when compared to current state-of-the-art functional or multivariate two-sample and independence tests. Finally, we employ our methods on a real socioeconomic dataset as an example application.


2020 ◽  
Vol 15 (46) ◽  

The aim of this work was to compare students’ physical activity and sedentary levels between two high schools, one with a natural playground and the other with a traditional one, according to the recess periods (mid-morning and lunchtime) and the sex of students. The sample consisted of all the students attending their school playground at the time of the measurement. The System for Observing Play and Leisure Activity in Youth (SOPLAY) was used to determine students’ activity levels. A total of 36 scans were conducted and three categories of activity emerged: very active, walker and sedentary. To perform intra- and inter-recess comparisons, Chi-square independence tests were carried out. Results revealed that students were more active and less sedentary in a natural playground than in a traditional one. In addition, the natural playground encouraged the same levels of PA regardless of the recess period. Moreover, boys were more active than girls during both periods in the traditional playground and during lunchtime in the natural playground. In view of the results, further research is needed to understand whether natural playgrounds can contribute to increasing physical activity levels, especially in adolescence, when building active identities becomes more important.


Symmetry ◽  
2020 ◽  
Vol 12 (9) ◽  
pp. 1415
Author(s):  
Jesús E. García ◽  
Verónica A. González-López

In this paper, we show how the longest non-decreasing subsequence, identified in the graph of the paired marginal ranks of the observations, allows the construction of a statistic for the development of an independence test in bivariate vectors. The test works in the case of discrete and continuous data. Since the present procedure does not require the continuity of the variables, it expands the proposal introduced in Independence tests for continuous random variables based on the longest increasing subsequence (2014). We show the efficiency of the procedure in detecting dependence in real cases and through simulations.


Sign in / Sign up

Export Citation Format

Share Document