A Functional Form Approach to Test Set Coverage in Tree Networks

1979 ◽  
Vol C-28 (1) ◽  
pp. 50-52 ◽  
Author(s):  
Agarwal ◽  
Masson
Author(s):  
Naila Alam ◽  
Muhammad Hanif

The Model assisted estimators are approximately design unbiased, consistent and provides robustness in the case of large sample sizes. The model assisted estimators result in reduction of the design variance if underlying model reasonably defines the regression relationship.  If the model is misspecified, then model assisted estimators might result in an increase of the design variance but remain approximately design unbiased and show robustness against model-misspecification. The well-known model assisted estimators, generalized regression estimators are members of a larger class of calibration estimators. Calibration method generates calibration weights that meet the calibration constraints and have minimum distance from the sampling design weights. By using different distance measures, classical calibration approach generates different calibration estimators but with asymptotically identical properties. The constraint of distance minimization was reduced for studying the properties of calibration estimators by proposing a simple functional form approach. The approach generates calibration weights that prove helpful to control the changes in calibration weights by using different choices of auxiliary variable’s functions.  This paper is an extended work on model assisted approach by using functional form of calibration weights. Some new model assisted estimators are considered to get efficient and stabilized regression weights by introducing a control matrix. The asymptotic un-biasedness of the proposed estimators is verified and the expressions for MSE are derived in three different cases.  A simulation study is done to compare and evaluate the efficiency of the proposed estimators with some existing model assisted estimators.


2019 ◽  
Vol 10 (11) ◽  
pp. 2020-2033
Author(s):  
Rubén Cabrera ◽  
Jhoana Díaz-Larrea ◽  
Schery Umanzor ◽  
Laura Georgina Núñez García

2018 ◽  
Vol 29 (2) ◽  
pp. 353-371 ◽  
Author(s):  
William A. Barnett ◽  
Neepa B. Gaekwad

1990 ◽  
Vol 29 (03) ◽  
pp. 167-181 ◽  
Author(s):  
G. Hripcsak

AbstractA connectionist model for decision support was constructed out of several back-propagation modules. Manifestations serve as input to the model; they may be real-valued, and the confidence in their measurement may be specified. The model produces as its output the posterior probability of disease. The model was trained on 1,000 cases taken from a simulated underlying population with three conditionally independent manifestations. The first manifestation had a linear relationship between value and posterior probability of disease, the second had a stepped relationship, and the third was normally distributed. An independent test set of 30,000 cases showed that the model was better able to estimate the posterior probability of disease (the standard deviation of residuals was 0.046, with a 95% confidence interval of 0.046-0.047) than a model constructed using logistic regression (with a standard deviation of residuals of 0.062, with a 95% confidence interval of 0.062-0.063). The model fitted the normal and stepped manifestations better than the linear one. It accommodated intermediate levels of confidence well.


Author(s):  
William Finnigan ◽  
Lorna J. Hepworth ◽  
Nicholas J. Turner ◽  
Sabine Flitsch

As the enzyme toolbox for biocatalysis has expanded, so has the potential for the construction of powerful enzymatic cascades for efficient and selective synthesis of target molecules. Additionally, recent advances in computer-aided synthesis planning (CASP) are revolutionizing synthesis design in both synthetic biology and organic chemistry. However, the potential for biocatalysis is not well captured by tools currently available in either field. Here we present RetroBioCat, an intuitive and accessible tool for computer-aided design of biocatalytic cascades, freely available at retrobiocat.com. Our approach uses a set of expertly encoded reaction rules encompassing the enzyme toolbox for biocatalysis, and a system for identifying literature precedent for enzymes with the correct substrate specificity where this is available. Applying these rules for automated biocatalytic retrosynthesis, we show our tool to be capable of identifying promising biocatalytic pathways to target molecules, validated using a test-set of recent cascades described in the literature.


2021 ◽  
Vol 12 (2) ◽  
Author(s):  
Mohammad Haekal ◽  
Henki Bayu Seta ◽  
Mayanda Mega Santoni

Untuk memprediksi kualitas air sungai Ciliwung, telah dilakukan pengolahan data-data hasil pemantauan secara Online Monitoring dengan menggunakan Metode Data Mining. Pada metode ini, pertama-tama data-data hasil pemantauan dibuat dalam bentuk tabel Microsoft Excel, kemudian diolah menjadi bentuk Pohon Keputusan yang disebut Algoritma Pohon Keputusan (Decision Tree) mengunakan aplikasi WEKA. Metode Pohon Keputusan dipilih karena lebih sederhana, mudah dipahami dan mempunyai tingkat akurasi yang sangat tinggi. Jumlah data hasil pemantauan kualitas air sungai Ciliwung yang diolah sebanyak 5.476 data. Hasil klarifikasi dengan Pohon Keputusan, dari 5.476 data ini diperoleh jumlah data yang mengindikasikan sungai Ciliwung Tidak Tercemar sebanyak 1.059 data atau sebesar 19,3242%, dan yang mengindikasikan Tercemar sebanyak 4.417 data atau 80,6758%. Selanjutnya data-data hasil pemantauan ini dievaluasi menggunakan 4 Opsi Tes (Test Option) yaitu dengan Use Training Set, Supplied Test Set, Cross-Validation folds 10, dan Percentage Split 66%. Hasil evaluasi dengan 4 opsi tes yang digunakan ini, semuanya menunjukkan tingkat akurasi yang sangat tinggi, yaitu diatas 99%. Dari data-data hasil peneltian ini dapat diprediksi bahwa sungai Ciliwung terindikasi sebagai sungai tercemar bila mereferensi kepada Peraturan Pemerintah Republik Indonesia nomor 82 tahun 2001 dan diketahui pula bahwa penggunaan aplikasi WEKA dengan Algoritma Pohon Keputusan untuk mengolah data-data hasil pemantauan dengan mengambil tiga parameter (pH, DO dan Nitrat) adalah sangat akuran dan tepat. Kata Kunci : Kualitas air sungai, Data Mining, Algoritma Pohon Keputusan, Aplikasi WEKA.


Sign in / Sign up

Export Citation Format

Share Document