class variable
Recently Published Documents


TOTAL DOCUMENTS

38
(FIVE YEARS 15)

H-INDEX

6
(FIVE YEARS 3)

2022 ◽  
Vol 16 (4) ◽  
pp. 1-24
Author(s):  
Kui Yu ◽  
Yajing Yang ◽  
Wei Ding

Causal feature selection aims at learning the Markov blanket (MB) of a class variable for feature selection. The MB of a class variable implies the local causal structure among the class variable and its MB and all other features are probabilistically independent of the class variable conditioning on its MB, this enables causal feature selection to identify potential causal features for feature selection for building robust and physically meaningful prediction models. Missing data, ubiquitous in many real-world applications, remain an open research problem in causal feature selection due to its technical complexity. In this article, we discuss a novel multiple imputation MB (MimMB) framework for causal feature selection with missing data. MimMB integrates Data Imputation with MB Learning in a unified framework to enable the two key components to engage with each other. MB Learning enables Data Imputation in a potentially causal feature space for achieving accurate data imputation, while accurate Data Imputation helps MB Learning identify a reliable MB of the class variable in turn. Then, we further design an enhanced kNN estimator for imputing missing values and instantiate the MimMB. In our comprehensively experimental evaluation, our new approach can effectively learn the MB of a given variable in a Bayesian network and outperforms other rival algorithms using synthetic and real-world datasets.


Entropy ◽  
2021 ◽  
Vol 23 (12) ◽  
pp. 1703
Author(s):  
Shouta Sugahara ◽  
Maomi Ueno

Earlier studies have shown that classification accuracies of Bayesian networks (BNs) obtained by maximizing the conditional log likelihood (CLL) of a class variable, given the feature variables, were higher than those obtained by maximizing the marginal likelihood (ML). However, differences between the performances of the two scores in the earlier studies may be attributed to the fact that they used approximate learning algorithms, not exact ones. This paper compares the classification accuracies of BNs with approximate learning using CLL to those with exact learning using ML. The results demonstrate that the classification accuracies of BNs obtained by maximizing the ML are higher than those obtained by maximizing the CLL for large data. However, the results also demonstrate that the classification accuracies of exact learning BNs using the ML are much worse than those of other methods when the sample size is small and the class variable has numerous parents. To resolve the problem, we propose an exact learning augmented naive Bayes classifier (ANB), which ensures a class variable with no parents. The proposed method is guaranteed to asymptotically estimate the identical class posterior to that of the exactly learned BN. Comparison experiments demonstrated the superior performance of the proposed method.


Mathematics ◽  
2021 ◽  
Vol 9 (20) ◽  
pp. 2564
Author(s):  
Shenglei Chen ◽  
Zhonghui Zhang ◽  
Linyuan Liu

As an important improvement to naive Bayes, Tree-Augmented Naive Bayes (TAN) exhibits excellent classification performance and efficiency since it allows that every attribute depends on at most one other attribute in addition to the class variable. However, its performance might be lowered as some attributes might be redundant. In this paper, we propose an attribute Selective Tree-Augmented Naive Bayes (STAN) algorithm which builds a sequence of approximate models each involving only the top certain attributes and searches the model to minimize the cross validation risk. Five different approaches to ranking the attributes have been explored. As the models can be evaluated simultaneously in one pass learning through the data, it is efficient and can avoid local optima in the model space. The extensive experiments on 70 UCI data sets demonstrated that STAN achieves superior performance while maintaining the efficiency and simplicity.


2021 ◽  
Vol 03 (04) ◽  
pp. 1-11
Author(s):  
Jafar Muhammad Aref JARADAT ◽  
Fawqia Muhammad Aref JARADAT

This study aims at identifying the level of anxiety from online examination in the blended learning for middle stage studants in Palestine " Hitten primary school as a study case" . This sample consists of 272 students from this school chosen simply and randomly based on ascale of online examination anxiety prepared by Mr.Atiyya Abu Al-sheikh . Some significant results came out from this study: First, there is a big level of anxiety when sitting for online exams during the process of blended learning. Second, there is statistical significance when considering the class variable . Thirdly, it is noticed that there is also statistical significance when considering the device used. Some necessary recommendations are given and the most important one is to adopt a councelling supporting blended program to reduce the level of stress and anxiety.


2021 ◽  
Vol 73 (1) ◽  
pp. 114-119
Author(s):  
Z.G. Ualiyev ◽  
◽  
G. Ualiyev ◽  

This paper presents the model of a high class variable structure mechanism. These mechanisms have not gained widespread acceptance in practice, despite obvious improvements in the ability to transmit motion and power. The article presents a synthesized high class mechanism. The block diagram of the formation of the mechanism is shown, the results of synthesis are presented. It is shown that with a relatively compact kinematic scheme, it is possible, with one drive, to create a system with several working links providing various technological operations, that is, on the basis of one mechanism, it becomes possible to create automatic mechanisms. In these studies, the development of a methodology for the dynamic design of the variable structure mechanism based on the analysis of dynamics and assessment of the influence of nonlinear factors on the quality of the machine as a whole was carried out. To study the motion of such variable structure mechanism, mathematical modeling of dynamics with discontinuous coefficients and nonlinear external forces is used.


Author(s):  
Md Amir Khusru Akhtar ◽  
Mohit Kumar

Naive Bayes classifiers are a set of categorization techniques based on Bayes' theorem. It is a collection of algorithms where all these algorithms share a common principle. This chapter presents the detection of DDos attack using scoreboard dataset. The dataset is separated into two parts, that is, feature vector and the reaction vector. Feature vector contains all the rows of dataset in which each vector consists of the value of dependent features such as IP address, port, counter, flag, syncnt, no. of packets, etc. The reaction vector contains the value of class variable (prediction or output) for each row. Result shows the effectiveness of the model in preventing DDoS attack by classifying request.


2020 ◽  
pp. 001316442092512
Author(s):  
Yan Wang ◽  
Eunsook Kim ◽  
John M. Ferron ◽  
Robert F. Dedrick ◽  
Tony X. Tan ◽  
...  

Factor mixture modeling (FMM) has been increasingly used to investigate unobserved population heterogeneity. This study examined the issue of covariate effects with FMM in the context of measurement invariance testing. Specifically, the impact of excluding and misspecifying covariate effects on measurement invariance testing and class enumeration was investigated via Monte Carlo simulations. Data were generated based on FMM models with (1) a zero covariate effect, (2) a covariate effect on the latent class variable, and (3) covariate effects on both the latent class variable and the factor. For each population model, different analysis models that excluded or misspecified covariate effects were fitted. Results highlighted the importance of including proper covariates in measurement invariance testing and evidenced the utility of a model comparison approach in searching for the correct specification of covariate effects and the level of measurement invariance. This approach was demonstrated using an empirical data set. Implications for methodological and applied research are discussed.


2020 ◽  
Vol 34 (04) ◽  
pp. 4312-4319 ◽  
Author(s):  
Bin-Bin Jia ◽  
Min-Ling Zhang

Multi-dimensional classification (MDC) assumes heterogenous class spaces for each example, where class variables from different class spaces characterize semantics of the example along different dimensions. Due to the heterogeneity of class spaces, the major difficulty in designing margin-based MDC techniques lies in that the modeling outputs from different class spaces are not comparable to each other. In this paper, a first attempt towards maximum margin multi-dimensional classification is investigated. Following the one-vs-one decomposition within each class space, the resulting models are optimized by leveraging classification margin maximization on individual class variable and model relationship regularization across class variables. We derive convex formulation for the maximum margin MDC problem, which can be tackled with alternating optimization admitting QP or closed-form solution in either alternating step. Experimental studies over real-world MDC data sets clearly validate effectiveness of the proposed maximum margin MDC techniques.


2020 ◽  
Vol 33 (5) ◽  
pp. 1821-1844 ◽  
Author(s):  
Sanaa Hobeichi ◽  
Gab Abramowitz ◽  
Jason Evans

AbstractAccurate estimates of terrestrial water and energy cycle components are needed to better understand climate processes and improve models’ ability to simulate future change. Various observational estimates are available for the individual budget terms; however, these typically show inconsistencies when combined in a budget. In this work, a Conserving Land–Atmosphere Synthesis Suite (CLASS) of estimates of simultaneously balanced surface water and energy budget components is developed. Individual CLASS variable datasets, where possible, 1) combine a range of existing variable product estimates, and hence overcome the limitations of estimates from a single source; 2) are observationally constrained with in situ measurements; 3) have uncertainty estimates that are consistent with their agreement with in situ observations; and 4) are consistent with each other by being able to solve the water and energy budgets simultaneously. First, available datasets of a budget variable are merged by implementing a weighting method that accounts both for the ability of datasets to match in situ measurements and the error covariance between datasets. Then, the budget terms are adjusted by applying an objective variational data assimilation technique (DAT) that enforces the simultaneous closure of the surface water and energy budgets linked through the equivalence of evapotranspiration and latent heat. Comparing component estimates before and after applying the DAT against in situ measurements of energy fluxes and streamflow showed that modified estimates agree better with in situ observations across various metrics, but also revealed some inconsistencies between water budget terms in June over the higher latitudes. CLASS variable estimates are freely available via https://doi.org/10.25914/5c872258dc183.


Entropy ◽  
2020 ◽  
Vol 22 (1) ◽  
pp. 123 ◽  
Author(s):  
Ana D. Maldonado ◽  
María Morales ◽  
Pedro A. Aguilera ◽  
Antonio Salmerón

Socio-ecological systems are recognized as complex adaptive systems whose multiple interactions might change as a response to external or internal changes. Due to its complexity, the behavior of the system is often uncertain. Bayesian networks provide a sound approach for handling complex domains endowed with uncertainty. The aim of this paper is to analyze the impact of the Bayesian network structure on the uncertainty of the model, expressed as the Shannon entropy. In particular, three strategies for model structure have been followed: naive Bayes (NB), tree augmented network (TAN) and network with unrestricted structure (GSS). Using these network structures, two experiments are carried out: (1) the impact of the Bayesian network structure on the entropy of the model is assessed and (2) the entropy of the posterior distribution of the class variable obtained from the different structures is compared. The results show that GSS constantly outperforms both NB and TAN when it comes to evaluating the uncertainty of the entire model. On the other hand, NB and TAN yielded lower entropy values of the posterior distribution of the class variable, which makes them preferable when the goal is to carry out predictions.


Sign in / Sign up

Export Citation Format

Share Document