Automated Development of Clinical Strategies Using Multistage Decision Analysis

SummaryThe development of investigative strategies by decision analysis has been achieved by explicitly drawing the decision tree, either by hand or on computer. This paper discusses the feasibility of automatically generating and analysing decision trees from a description of the investigations and the treatment problem. The investigation of cholestatic jaundice is used to illustrate the technique.Methods to decrease the number of calculations required are presented. It is shown that this method makes practical the simultaneous study of at least half a dozen investigations. However, some new problems arise due to the possible complexity of the resulting optimal strategy. If protocol errors and delays due to testing are considered, simpler strategies become desirable. Generation and assessment of these simpler strategies are discussed with examples.

Download Full-text

Perbandingan Decision Tree J48, REPTREE, dan Random Tree dalam Menentukan Prediksi Produksi Minyak Kelapa Sawit Menggunakan Fuzzy Tsukamoto

Jurnal Teknologi Informasi dan Ilmu Komputer ◽

10.25126/jtiik.2021833108 ◽

2021 ◽

Vol 8 (3) ◽

pp. 473

Author(s):

Tundo Tundo ◽

Shofwatul 'Uyun

Keyword(s):

Decision Tree ◽

Decision Analysis ◽

Decision Trees ◽

Palm Oil ◽

Error Rate ◽

Oil Production ◽

Random Tree ◽

Actual Data ◽

Fuzzy Method ◽

Forecasting Error

<h2 align="center"> </h2>Penelitian ini menerangkan analisis decision tree J48, REPTree dan Random Tree dengan menggunakan metode fuzzy Tsukamoto dalam penentuan jumlah produksi minyak kelapa sawit di perusahaan PT Tapiana Nadenggan dengan tujuan untuk mengetahui decision tree mana yang hasilnya mendekati dari data sesungguhnya. Digunakannya decision tree J48, REPTree, dan Random Tree yaitu untuk mempercepat dalam pembuatan rule yang digunakan tanpa harus berkonsultasi dengan para pakar dalam menentukan rule yang digunakan. Berdasarkan data yang digunakan akurasi pembentukan rule dari decision tree J48 adalah 95,2381%, REPTree adalah 90,4762%, dan Random Tree adalah 95,2381%. Hasil dari penelitian yang telah dihitung bahwa metode fuzzy Tsukamoto dengan menggunakan REPTree mempunyai error Average Forecasting Error Rate (AFER) yang lebih kecil sebesar 23,17 % dibandingkan dengan menggunakan J48 sebesar 24,96 % dan Random Tree sebesar 36,51 % pada prediksi jumlah produksi minyak kelapa sawit. Oleh sebab itu ditemukan sebuah gagasan bahwa akurasi pohon keputusan yang terbentuk menggunakan tools WEKA tidak menjamin akurasi yang terbesar adalah yang terbaik, buktinya dari kasus ini REPTree memiliki akurasi rule paling kecil, akan tetapi hasil prediksi memiliki tingkat error paling kecil, dibandingkan dengan J48 dan Random Tree. Abstract<div>This study explains the J48, REPTree and Tree Random tree decision analysis using Tsukamoto's fuzzy method in determining the amount of palm oil production in PT Tapiana Nadenggan's company with the aim of finding out which decision tree results are close to the actual data. The decision tree J48, REPTree, and Random Tree is used to accelerate the making of rules that are used without having to consult with experts in determining the rules used. Based on the data used the accuracy of the rule formation of the J48 decision tree is 95.2381%, REPTree is 90.4762%, and the Random Tree is 95.2381%. The results of the study have calculated that the Tsukamoto fuzzy method using REPTree has a smaller Average Forecasting Error Rate (AFER) rate of 23.17% compared to using J48 of 24.96% and Tree Random of 36.51% in the prediction of the amount of palm oil production. Therefore an idea was found that the accuracy of decision trees formed using WEKA tools does not guarantee the greatest accuracy is the best, the proof of this case REPTree has the smallest rule accuracy, but the predicted results have the smallest error rate, compared to J48 and Tree Random.</div>

Download Full-text

Performance Improvement of Decision Tree: A Robust Classifier Using Tabu Search Algorithm

Applied Sciences ◽

10.3390/app11156728 ◽

2021 ◽

Vol 11 (15) ◽

pp. 6728

Author(s):

Muhammad Asfand Hafeez ◽

Muhammad Rashid ◽

Hassan Tariq ◽

Zain Ul Abideen ◽

Saud S. Alotaibi ◽

...

Keyword(s):

Machine Learning ◽

Tabu Search ◽

Decision Tree ◽

Decision Trees ◽

Search Algorithm ◽

Learning Algorithms ◽

Performance Comparison ◽

Machine Learning Algorithms ◽

Supervised Machine Learning ◽

Tabu Search Algorithm

Classification and regression are the major applications of machine learning algorithms which are widely used to solve problems in numerous domains of engineering and computer science. Different classifiers based on the optimization of the decision tree have been proposed, however, it is still evolving over time. This paper presents a novel and robust classifier based on a decision tree and tabu search algorithms, respectively. In the aim of improving performance, our proposed algorithm constructs multiple decision trees while employing a tabu search algorithm to consistently monitor the leaf and decision nodes in the corresponding decision trees. Additionally, the used tabu search algorithm is responsible to balance the entropy of the corresponding decision trees. For training the model, we used the clinical data of COVID-19 patients to predict whether a patient is suffering. The experimental results were obtained using our proposed classifier based on the built-in sci-kit learn library in Python. The extensive analysis for the performance comparison was presented using Big O and statistical analysis for conventional supervised machine learning algorithms. Moreover, the performance comparison to optimized state-of-the-art classifiers is also presented. The achieved accuracy of 98%, the required execution time of 55.6 ms and the area under receiver operating characteristic (AUROC) for proposed method of 0.95 reveals that the proposed classifier algorithm is convenient for large datasets.

Download Full-text

A Practical Tutorial for Decision Tree Induction

ACM Computing Surveys ◽

10.1145/3429739 ◽

2021 ◽

Vol 54 (1) ◽

pp. 1-38

Author(s):

Víctor Adrián Sosa Hernández ◽

Raúl Monroy ◽

Miguel Angel Medina-Pérez ◽

Octavio Loyola-González ◽

Francisco Herrera

Keyword(s):

Decision Tree ◽

Decision Trees ◽

Machine Learning Techniques ◽

Evaluation Measures ◽

Decision Tree Induction ◽

Learning Techniques ◽

Tree Models ◽

Evaluation Measure ◽

Main Components ◽

Support Decision Making

Experts from different domains have resorted to machine learning techniques to produce explainable models that support decision-making. Among existing techniques, decision trees have been useful in many application domains for classification. Decision trees can make decisions in a language that is closer to that of the experts. Many researchers have attempted to create better decision tree models by improving the components of the induction algorithm. One of the main components that have been studied and improved is the evaluation measure for candidate splits. In this article, we introduce a tutorial that explains decision tree induction. Then, we present an experimental framework to assess the performance of 21 evaluation measures that produce different C4.5 variants considering 110 databases, two performance measures, and 10× 10-fold cross-validation. Furthermore, we compare and rank the evaluation measures by using a Bayesian statistical analysis. From our experimental results, we present the first two performance rankings in the literature of C4.5 variants. Moreover, we organize the evaluation measures into two groups according to their performance. Finally, we introduce meta-models that automatically determine the group of evaluation measures to produce a C4.5 variant for a new database and some further opportunities for decision tree models.

Download Full-text

Evolutionary Algorithm for Improving Decision Tree with Global Discretization in Manufacturing

Sensors ◽

10.3390/s21082849 ◽

2021 ◽

Vol 21 (8) ◽

pp. 2849

Author(s):

Sungbum Jun

Keyword(s):

Decision Tree ◽

Evolutionary Algorithm ◽

Decision Trees ◽

Manufacturing Systems ◽

Ensemble Methods ◽

Machine Learning Techniques ◽

Learning Techniques ◽

Industrial Internet ◽

Tree Models ◽

Real World Datasets

Due to the recent advance in the industrial Internet of Things (IoT) in manufacturing, the vast amount of data from sensors has triggered the need for leveraging such big data for fault detection. In particular, interpretable machine learning techniques, such as tree-based algorithms, have drawn attention to the need to implement reliable manufacturing systems, and identify the root causes of faults. However, despite the high interpretability of decision trees, tree-based models make a trade-off between accuracy and interpretability. In order to improve the tree’s performance while maintaining its interpretability, an evolutionary algorithm for discretization of multiple attributes, called Decision tree Improved by Multiple sPLits with Evolutionary algorithm for Discretization (DIMPLED), is proposed. The experimental results with two real-world datasets from sensors showed that the decision tree improved by DIMPLED outperformed the performances of single-decision-tree models (C4.5 and CART) that are widely used in practice, and it proved competitive compared to the ensemble methods, which have multiple decision trees. Even though the ensemble methods could produce slightly better performances, the proposed DIMPLED has a more interpretable structure, while maintaining an appropriate performance level.

Download Full-text

Induction of fuzzy decision trees and its refinement using gradient projected-neuro-fuzzy decision tree

International Journal of Advanced Intelligence Paradigms ◽

10.1504/ijaip.2014.066983 ◽

2014 ◽

Vol 6 (4) ◽

pp. 346 ◽

Cited By ~ 6

Author(s):

Swathi Jamjala Narayanan ◽

Rajen B. Bhatt ◽

Ilango Paramasivam ◽

M. Khalid ◽

B.K. Tripathy

Keyword(s):

Decision Tree ◽

Decision Trees ◽

Fuzzy Decision ◽

Fuzzy Decision Tree ◽

Neuro Fuzzy ◽

Fuzzy Decision Trees

Download Full-text

Forest Pruning Based on Branch Importance

Computational Intelligence and Neuroscience ◽

10.1155/2017/3162571 ◽

2017 ◽

Vol 2017 ◽

pp. 1-11 ◽

Cited By ~ 2

Author(s):

Xiangkui Jiang ◽

Chang-an Wu ◽

Huaping Guo

Keyword(s):

Decision Tree ◽

Decision Trees ◽

Selection Algorithm ◽

Ensemble Size ◽

Ensemble Pruning ◽

Generalization Ability ◽

Ensemble Selection ◽

Novel Strategy

A forest is an ensemble with decision trees as members. This paper proposes a novel strategy to pruning forest to enhance ensemble generalization ability and reduce ensemble size. Unlike conventional ensemble pruning approaches, the proposed method tries to evaluate the importance of branches of trees with respect to the whole ensemble using a novel proposed metric called importance gain. The importance of a branch is designed by considering ensemble accuracy and the diversity of ensemble members, and thus the metric reasonably evaluates how much improvement of the ensemble accuracy can be achieved when a branch is pruned. Our experiments show that the proposed method can significantly reduce ensemble size and improve ensemble accuracy, no matter whether ensembles are constructed by a certain algorithm such as bagging or obtained by an ensemble selection algorithm, no matter whether each decision tree is pruned or unpruned.

Download Full-text

Methods of decision analysis: Protocols, decision trees, and algorithms in medicine

World Journal of Surgery ◽

10.1007/bf01659028 ◽

1989 ◽

Vol 13 (3) ◽

pp. 240-244 ◽

Cited By ~ 4

Author(s):

J. M. Greep ◽

L. M. L. C. Siezenis

Keyword(s):

Decision Analysis ◽

Decision Trees

Download Full-text

Optimal Decision Trees on Simplicial Complexes

The Electronic Journal of Combinatorics ◽

10.37236/1900 ◽

2005 ◽

Vol 12 (1) ◽

Cited By ~ 1

Author(s):

Jakob Jonsson

Keyword(s):

Decision Tree ◽

Decision Trees ◽

Simplicial Complex ◽

Elementary Theory ◽

Simplicial Complexes ◽

Optimal Decision ◽

Property A ◽

Recursive Definition ◽

Topological Combinatorics ◽

Definition Of

We consider topological aspects of decision trees on simplicial complexes, concentrating on how to use decision trees as a tool in topological combinatorics. By Robin Forman's discrete Morse theory, the number of evasive faces of a given dimension $i$ with respect to a decision tree on a simplicial complex is greater than or equal to the $i$th reduced Betti number (over any field) of the complex. Under certain favorable circumstances, a simplicial complex admits an "optimal" decision tree such that equality holds for each $i$; we may hence read off the homology directly from the tree. We provide a recursive definition of the class of semi-nonevasive simplicial complexes with this property. A certain generalization turns out to yield the class of semi-collapsible simplicial complexes that admit an optimal discrete Morse function in the analogous sense. In addition, we develop some elementary theory about semi-nonevasive and semi-collapsible complexes. Finally, we provide explicit optimal decision trees for several well-known simplicial complexes.

Download Full-text

Analisis Rekam Medis Pasien Diabetes Mellitus Melalui Implementasi Teknik Data Mining di RSUP Dr. Sardjito Yogyakarta

Jurnal Kesehatan Vokasional ◽

10.22146/jkesvo.30331 ◽

2018 ◽

Vol 2 (2) ◽

pp. 167

Author(s):

Marko Ferdian Salim ◽

Sugeng Sugeng

Keyword(s):

Diabetes Mellitus ◽

Data Mining ◽

Decision Tree ◽

Decision Trees ◽

Cross Sectional ◽

Diabetes Melitus

Latar Belakang: Diabetes mellitus adalah penyakit kronis yang mempengaruhi beban ekonomi dan sosial secara luas. Data pasien dicatat melalui sistem rekam medis pasien yang tersimpan dalam database sistem informasi rumah sakit, data yang tercatat belum dianalisis secara efektif untuk menghasilkan informasi yang berharga. Teknik data mining bisa digunakan untuk menghasilkan informasi yang berharga tersebut.Tujuan: Mengidentifikasi karakteristik pasien Diabetes mellitus, kecenderungan dan tipe Diabetes melitus melalui penerapan teknik data mining di RSUP Dr. Sardjito Yogyakarta.Metode: Penelitian ini merupakan penelitian deskriptif observasional dengan rancangan cross sectional. Teknik pengumpulan data dilakukan secara retrospektif melalui observasi dan studi dokumentasi rekam medis elektronik di RSUP Dr. Sardjito Yogyakarta. Data yang terkumpul kemudian dilakukan analisis dengan menggunakan aplikasi Weka.Hasil: Pasien Diabetes mellitus di RSUP Dr. Sardjito tahun 2011-2016 berjumlah 1.554 orang dengan tren yang cenderung menurun. Pasien paling banyak berusia 56 - 63 tahun (27,86%). Kejadian Diabetes mellitus didominasi oleh Diabetes mellitus tipe 2 dengan komplikasi tertinggi adalah hipertensi, nefropati, dan neuropati. Dengan menggunakan teknik data mining dengan algoritma decision tree J48 (akurasi 88.42%) untuk analisis rekam medis pasien telah menghasilkan beberapa rule.Kesimpulan: Teknik klasifikasi data mining (akurasi 88.42%) dan decision trees telah berhasil mengidentifikasi karakteristik pasien dan menemukan beberapa rules yang dapat digunakan pihak rumah sakit dalam pengambilan keputusan mengenai penyakit Diabetes mellitus.

Download Full-text

LEARNING HYPERPLANES THAT CAPTURES THE GEOMETRIC STRUCTURE OF CLASS REGIONS

Graduate Research in Engineering and Technology ◽

10.47893/gret.2013.1003 ◽

2013 ◽

pp. 7-12

Author(s):

PRAMOD PATIL ◽

ALKA LONDHE ◽

PARAG KULKARNI

Keyword(s):

Decision Tree ◽

Decision Trees ◽

Geometric Structure ◽

Gini Index ◽

Decision Tree Algorithm ◽

Top Down ◽

Angle Bisector ◽

Eigen Value ◽

Tree Algorithms ◽

Left And Right

Most of the decision tree algorithms rely on impurity measures to evaluate the goodness of hyperplanes at each node while learning a decision tree in a top-down fashion. These impurity measures are not differentiable with relation to the hyperplane parameters. Therefore the algorithms for decision tree learning using impurity measures need to use some search techniques for finding the best hyperplane at every node. These impurity measures don’t properly capture the geometric structures of the data. In this paper a Two-Class algorithm for learning oblique decision trees is proposed. Aggravated by this, the algorithm uses a strategy, to evaluate the hyperplanes in such a way that the (linear) geometric structure in the data is taken into consideration. At each node of the decision tree, algorithm finds the clustering hyperplanes for both the classes. The clustering hyperplanes are obtained by solving the generalized Eigen-value problem. Then the data is splitted based on angle bisector and recursively learn the left and right sub-trees of the node. Since, in general, there will be two angle bisectors; one is selected which is better based on an impurity measure gini index. Thus the algorithm combines the ideas of linear tendencies in data and purity of nodes to find better decision trees. This idea leads to small decision trees and better performance.

Download Full-text