Ensemble Methods and Rule Generation

Author(s):  
Yongdai Kim ◽  
Jinseog Kim ◽  
Jongwoo Jeon
Author(s):  
Tengyue Li ◽  
Simon Fong

To compare with two datasets based on attributes by using classification algorithms, for the attributes, the authors need to select them by rules and the system is known as rule-based reasoning system which classifies a given test instance into a particular outcome from the learned rules. The test instance carries multiple attributes, which are usually the values of diagnostic tests. In this article, the authors propose a classifier ensemble-based method for comparison of two breast cancer datasets. The ensemble data mining learning methods are applied to rule generation, and a multi-criterion evaluation approach is used for selecting reliable rules over the results of the ensemble methods. The efficacy of the proposed methodology is illustrated via an example of two breast cancer datasets. This article introduces a novel fuzzy rule-based classification method called FURIA, to obtain a relationship between two breast cancer datasets. Hence, it can find the similarity between these two datasets. The new method is compared vis-à-vis with other classical statistical approaches such as correlation and mutual information gain.


2009 ◽  
Vol 20 (10) ◽  
pp. 2655-2666 ◽  
Author(s):  
Dong LIU ◽  
Xiang-Wu MENG ◽  
Jun-Liang CHEN ◽  
Ya-Mei XIA

2021 ◽  
Vol 11 (8) ◽  
pp. 3347
Author(s):  
Siqi Ma ◽  
Xin Wang ◽  
Xiaochen Wang ◽  
Hanyu Liu ◽  
Runtong Zhang

Although urban rail transit provides significant daily assistance to users, traffic risk remains. Turn-back faults are a common cause of traffic accidents. To address turn-back faults, machines are able to learn the complicated and detailed rules of the train’s internal communication codes, and engineers must understand simple external features for quick judgment. Focusing on turn-back faults in urban rail, in this study we took advantage of related accumulated data to improve algorithmic and human diagnosis of this kind of fault. In detail, we first designed a novel framework combining rules and algorithms to help humans and machines understand the fault characteristics and collaborate in fault diagnosis, including determining the category to which the turn-back fault belongs, and identifying the simple and complicated judgment rules involved. Then, we established a dataset including tabular and text data for real application scenarios and carried out corresponding analysis of fault rule generation, diagnostic classification, and topic modeling. Finally, we present the fault characteristics under the proposed framework. Qualitative and quantitative experiments were performed to evaluate the proposed method, and the experimental results show that (1) the framework is helpful in understanding the faults of trains that occur in three types of turn-back: automatic turn-back (ATB), automatic end change (AEC), and point mode end change (PEC); (2) our proposed framework can assist in diagnosing turn-back faults.


2021 ◽  
Vol 13 (2) ◽  
pp. 238
Author(s):  
Zhice Fang ◽  
Yi Wang ◽  
Gonghao Duan ◽  
Ling Peng

This study presents a new ensemble framework to predict landslide susceptibility by integrating decision trees (DTs) with the rotation forest (RF) ensemble technique. The proposed framework mainly includes four steps. First, training and validation sets are randomly selected according to historical landslide locations. Then, landslide conditioning factors are selected and screened by the gain ratio method. Next, several training subsets are produced from the training set and a series of trained DTs are obtained by using a DT as a base classifier couple with different training subsets. Finally, the resultant landslide susceptibility map is produced by combining all the DT classification results using the RF ensemble technique. Experimental results demonstrate that the performance of all the DTs can be effectively improved by integrating them with the RF ensemble technique. Specifically, the proposed ensemble methods achieved the predictive values of 0.012–0.121 higher than the DTs in terms of area under the curve (AUC). Furthermore, the proposed ensemble methods are better than the most popular ensemble methods with the predictive values of 0.005–0.083 in terms of AUC. Therefore, the proposed ensemble framework is effective to further improve the spatial prediction of landslides.


Author(s):  
Hamid Reza Pourghasemi ◽  
Fatemeh Honarmandnejad ◽  
Mahrooz Rezaei ◽  
Mohammad Hassan Tarazkar ◽  
Nitheshnirmal Sadhasivam

Entropy ◽  
2021 ◽  
Vol 23 (2) ◽  
pp. 216 ◽  
Author(s):  
Jianjia Wang ◽  
Xichen Wu ◽  
Mingrui Li ◽  
Hui Wu ◽  
Edwin Hancock

This paper seeks to advance the state-of-the-art in analysing fMRI data to detect onset of Alzheimer’s disease and identify stages in the disease progression. We employ methods of network neuroscience to represent correlation across fMRI data arrays, and introduce novel techniques for network construction and analysis. In network construction, we vary thresholds in establishing BOLD time series correlation between nodes, yielding variations in topological and other network characteristics. For network analysis, we employ methods developed for modelling statistical ensembles of virtual particles in thermal systems. The microcanonical ensemble and the canonical ensemble are analogous to two different fMRI network representations. In the former case, there is zero variance in the number of edges in each network, while in the latter case the set of networks have a variance in the number of edges. Ensemble methods describe the macroscopic properties of a network by considering the underlying microscopic characterisations which are in turn closely related to the degree configuration and network entropy. When applied to fMRI data in populations of Alzheimer’s patients and controls, our methods demonstrated levels of sensitivity adequate for clinical purposes in both identifying brain regions undergoing pathological changes and in revealing the dynamics of such changes.


Sign in / Sign up

Export Citation Format

Share Document