An Explainable Bayesian Decision Tree Algorithm

Bayesian Decision Trees provide a probabilistic framework that reduces the instability of Decision Trees while maintaining their explainability. While Markov Chain Monte Carlo methods are typically used to construct Bayesian Decision Trees, here we provide a deterministic Bayesian Decision Tree algorithm that eliminates the sampling and does not require a pruning step. This algorithm generates the greedy-modal tree (GMT) which is applicable to both regression and classification problems. We tested the algorithm on various benchmark classification data sets and obtained similar accuracies to other known techniques. Furthermore, we show that we can statistically analyze how was the GMT derived from the data and demonstrate this analysis with a financial example. Notably, the GMT allows for a technique that provides explainable simpler models which is often a prerequisite for applications in finance or the medical industry.

Download Full-text

Booster in High Dimensional Data Classification using CNN and Decision Tree Algorithm

International Journal of Recent Technology and Engineering - 2 ◽

10.35940/ijrte.b1031.0782s519 ◽

2019 ◽

Vol 8 (2S5) ◽

pp. 148-152

Keyword(s):

Decision Tree ◽

High Dimensional Data ◽

High Dimensional ◽

Decision Tree Algorithm ◽

Classification Problems ◽

Medical Field ◽

Sensitive Data ◽

Tree Algorithm ◽

Evaluation Measure ◽

The Stability

Classification problems in high dimensional data with small number of observations are becoming more common especially in microarray data. The performance in terms of accuracy is essential while handling sensitive data particularly in medical field. For this the stability of the selected features must be evaluated. Therefore, this paper proposes a new evaluation measure that incorporates the stability of the selected feature subsets and accuracy of the prediction. Booster in feature selection algorithm helps to achieve the same. The proposed work resolves both structured and unstructured data using convolution neural network based multimodal disease prediction and decision tree algorithm respectively. The algorithm is tested on heart disease dataset retrieved from UCI repository and the analysis shows the improved prediction accuracy.

Download Full-text

A class skew-insensitive ACO-based decision tree algorithm for imbalanced data sets

Indonesian Journal of Electrical Engineering and Computer Science ◽

10.11591/ijeecs.v21.i1.pp412-419 ◽

2021 ◽

Vol 21 (1) ◽

pp. 412

Author(s):

Muhamad Hasbullah Bin Mohd Razali ◽

Rizauddin Bin Saian ◽

Yap Bee Wah ◽

Ku Ruhana Ku-Mahamud

Keyword(s):

Decision Tree ◽

Statistical Significance ◽

Imbalanced Data ◽

Predictive Ability ◽

Significance Test ◽

Data Sets ◽

Decision Tree Algorithm ◽

Tree Algorithm ◽

Imbalanced Data Sets ◽

Class Skew

<span>Ant-tree-miner (ATM) has an advantage over the conventional decision tree algorithm in terms of feature selection. However, real world applications commonly involved imbalanced class problem where the classes have different importance. This condition impeded the entropy-based heuristic of existing ATM algorithm to develop effective decision boundaries due to its biasness towards the dominant class. Consequently, the induced decision trees are dominated by the majority class which lack in predictive ability on the rare class. This study proposed an enhanced algorithm called hellinger-ant-tree-miner (HATM) which is inspired by ant colony optimization (ACO) metaheuristic for imbalanced learning using decision tree classification algorithm. The proposed algorithm was compared to the existing algorithm, ATM in nine (9) publicly available imbalanced data sets. Simulation study reveals the superiority of HATM when the sample size increases with skewed class (Imbalanced Ratio < 50%). Experimental results demonstrate the performance of the existing algorithm measured by BACC has been improved due to the class skew-insensitiveness of hellinger distance. The statistical significance test shows that HATM has higher mean BACC score than ATM.</span>

Download Full-text

Outsourcing Privacy Preserving ID3 Decision Tree Algorithm over Encrypted Data-sets for Two-Parties

2017 IEEE Trustcom/BigDataSE/ICESS ◽

10.1109/trustcom/bigdatase/icess.2017.354 ◽

2017 ◽

Cited By ~ 3

Author(s):

Ye Li ◽

Zoe L. Jiang ◽

Xuan Wang ◽

S.M. Yiu ◽

Peng Zhang

Keyword(s):

Decision Tree ◽

Privacy Preserving ◽

Data Sets ◽

Decision Tree Algorithm ◽

Tree Algorithm ◽

Encrypted Data

Download Full-text

A new multi-decision based on Bayesian decision tree algorithm for ship recognition

Journal of Physics Conference Series ◽

10.1088/1742-6596/1802/3/032090 ◽

2021 ◽

Vol 1802 (3) ◽

pp. 032090

Author(s):

Qi Xia ◽

Yu Wang ◽

Jian Zhou ◽

Shengqing Pei ◽

Zhiqiang Geng ◽

...

Keyword(s):

Decision Tree ◽

Bayesian Decision ◽

Decision Tree Algorithm ◽

Tree Algorithm ◽

Ship Recognition

Download Full-text

A Robust Decision Tree Algorithm for Imbalanced Data Sets

Proceedings of the 2010 SIAM International Conference on Data Mining ◽

10.1137/1.9781611972801.67 ◽

2010 ◽

Cited By ~ 56

Author(s):

Wei Liu ◽

Sanjay Chawla ◽

David A. Cieslak ◽

Nitesh V. Chawla

Keyword(s):

Decision Tree ◽

Imbalanced Data ◽

Data Sets ◽

Decision Tree Algorithm ◽

Tree Algorithm ◽

Imbalanced Data Sets

Download Full-text

Innovations in Financial Management: Recursive Prediction Model Based on Decision Trees

Marketing and Management of Innovations ◽

10.21272/mmi.2020.3-20 ◽

2020 ◽

pp. 276-292

Author(s):

Ivana Podhorska ◽

Jaromir Vrbka ◽

George Lazaroiu ◽

Maria Kovacova

Keyword(s):

Decision Tree ◽

Emerging Markets ◽

Prediction Model ◽

Decision Trees ◽

Financial Distress ◽

Financial Management ◽

Test Sample ◽

Decision Tree Algorithm ◽

Tree Algorithm ◽

The Creation

The issue of enterprise financial distress represents the actual and interdisciplinary topic for the economic community. The bankrupt is thus one of the major externalities of today’s modern economies, which cannot be avoided even with every effort. Where there are investment opportunities, there are individuals and businesses that are willing to assume their financial obligations and the resulting risks to maintain and develop their standard of living or their economic activities. The decision tree algorithm is one of the most intuitive methods of data mining that can be used for financial distress prediction. Systematization literary sources and approaches prove that decision trees represent the part of the innovations in financial management. The main propose of the research is a possibility of application of a decision tree algorithm for the creation of the prediction model, which can be used in economy practice. The Paper's main aim is to create a comprehensive prediction model of enterprise financial distress based on decision trees, under the conditions of emerging markets. Paper methods are based on the decision tree, with emphasis on algorithm CART. Emerging markets included 17 countries: Slovak Republic, Czech Republic, Poland, Hungary, Romania, Bulgaria, Lithuania, Latvia, Estonia, Slovenia, Croatia, Serbia, Russia, Ukraine, Belarus, Montenegro, and Macedonia. Paper research is focused on the possibilities of implementation of a decision tree algorithm for the creation of a prediction model in the condition of emerging markets. Used data contained 2,359,731 enterprises from emerging markets (30% of total amount); divided into prosperous enterprises (1,802,027) and non-prosperous enterprises (557,704); obtained from Amadeus database. Input variables for the model represented 24 financial indicators, 3 dummy variables, and the countries' GDP data, in the years 2015 and 2016. The 80% of enterprises represented the training sample and 20% test sample, for model creation. The model correctly classified 93.2% of enterprises from both the training and test sample. Correctly classification of non-prosperous enterprises was 83.5% in both samples. The result of the research brings a new model for the identification of bankrupt enterprises. The created prediction model can be considered sufficiently suitable for classifying enterprises in emerging markets. Keywords prediction model, decision tree, emerging markets.

Download Full-text

Application of Bayesian Decision Tree Algorithm in Breast Cancer Prediction

Journal of Vibration Testing and System Dynamics ◽

10.5890/jvtsd.2020.03.002 ◽

2020 ◽

Vol 4 (1) ◽

pp. 43-49

Author(s):

Yang Xiang ◽

Lin-Lu Dong ◽

Hong Song ◽

Kun-jian Yu

Keyword(s):

Breast Cancer ◽

Decision Tree ◽

Bayesian Decision ◽

Decision Tree Algorithm ◽

Tree Algorithm ◽

Cancer Prediction

Download Full-text

Classification Based on Decision Tree Algorithm for Machine Learning

Journal of Applied Science and Technology Trends ◽

10.38094/jastt20165 ◽

2021 ◽

Vol 2 (01) ◽

pp. 20-28

Author(s):

Bahzad Charbuty ◽

Adnan Abdulazeez

Keyword(s):

Machine Learning ◽

Pattern Recognition ◽

Decision Tree ◽

Decision Trees ◽

Text Classification ◽

Decision Tree Algorithm ◽

Tree Algorithm ◽

Different Types ◽

Classification Images ◽

Disease Analysis

Decision tree classifiers are regarded to be a standout of the most well-known methods to data classification representation of classifiers. Different researchers from various fields and backgrounds have considered the problem of extending a decision tree from available data, such as machine study, pattern recognition, and statistics. In various fields such as medical disease analysis, text classification, user smartphone classification, images, and many more the employment of Decision tree classifiers has been proposed in many ways. This paper provides a detailed approach to the decision trees. Furthermore, paper specifics, such as algorithms/approaches used, datasets, and outcomes achieved, are evaluated and outlined comprehensively. In addition, all of the approaches analyzed were discussed to illustrate the themes of the authors and identify the most accurate classifiers. As a result, the uses of different types of datasets are discussed and their findings are analyzed.

Download Full-text

Distributed Communication Decision Tree Algorithm for Disseminated and Heterogeneous Environment

Advanced Materials Research ◽

10.4028/www.scientific.net/amr.403-408.1002 ◽

2011 ◽

Vol 403-408 ◽

pp. 1002-1007

Author(s):

Chandra Chandra ◽

P. Ajitha

Keyword(s):

Decision Tree ◽

Decision Trees ◽

Communication Cost ◽

Classification Algorithms ◽

Heterogeneous Environment ◽

Decision Tree Algorithm ◽

Distributed Environment ◽

Tree Algorithm ◽

Current Classification ◽

Distributed Communication

Current Classification algorithms require large amounts of data to be stored enduringly in the memory for long assortment and amount of time. Diverse classification techniques had been already proposed in the literature for both in the run of the mill environment and distributed environment. Mining of decision trees in the distributed environment can be able to handle the large amount of data but with high communication cost. A new distributed communication decision tree algorithm is proposed here which reduces the communication cost for the transmission of the data in the distributed and heterogeneous environment.

Download Full-text

Artificial Intelligence and Improve the Accuracy of the Decision Tree Algorithm in Classification Problems

38. mednarodna konferenca o razvoju organizacijskih znanosti: Ekosistem organizacij v dobi digitalizacije: konferenčni zbornik ◽

10.18690/978-961-286-250-3.55 ◽

2019 ◽

Author(s):

Jasmina Đ. Novakovic ◽

Suzana Markovic

Keyword(s):

Artificial Intelligence ◽

Decision Tree ◽

Decision Tree Algorithm ◽

Classification Problems ◽

Tree Algorithm

Download Full-text