scholarly journals Model Regresi Semiparametrik Spline Hasil Produksi Padi di Kabupaten Lombok Timur

2019 ◽  
Vol 1 (1) ◽  
pp. 11
Author(s):  
Bidayani Bidayani ◽  
Mustika Hadijati ◽  
Nurul Fitriyani

This study was conducted with the aim of determining the semiparametric spline regression model in the analysis of factors that influence rice production in East Lombok District in 2014 and finding out what factors influence the rice production results. The method used was semiparametric spline regression, with the selection of the optimum knot points using Generalized Cross Validation. The results obtained indicate that the variable that significantly affects rice production was the height of the area above sea level, with the determination coefficient value of 99.71% and the RMSEP value of 41.65.

Author(s):  
Syafruddin Side ◽  
Wahidah Sanusi ◽  
Mustati'atul Waidah Maksum

Abstrak. Regresi semiparametrik merupakan model regresi yang memuat komponen parametrik dan komponen nonparametrik dalam suatu model. Pada penelitian ini digunakan model regresi semiparametrik spline untuk data longitudinal dengan studi kasus penderita Demam Berdarah Dengue (DBD) di Rumah Sakit Universitas Hasanuddin Makassar periode bulan  Januari sampai bulan Maret 2018. Estimasi model regresi terbaik didapat dari pemilihan titik knot optimal dengan melihat nilai Generalized Cross Validation (GCV) dan Mean Square Error (MSE) yang minimum. Komponen parametrik pada penelitian ini adalah hemoglobin (g/dL) dan umur (tahun), suhu tubuh ( ), trombosit ( ) sebagai komponen nonparametrik dengan nilai GCV minimum sebesar 221,67745153 dicapai pada titik knot yaitu 14,552; 14,987; dan 15,096; nilai MSE sebesar 199,1032; dan nilai koefisien determinasi sebesar 75,3% yang diperoleh dari model regresi semiparametrik spline linear dengan tiga titik knot..Kata Kunci: regresi semiparametrik, spline, knot, Generalized Cross Validation, Demam Berdarah Dengue.Abstract. Semiparametric regression is a regression model that includes parametric and nonparametric components in it. The regression model in this research is spline semiparametric regression with case studies of patients with Dengue Hemorrahagic Fever (DHF) at University of Hasanuddin Makassar Hospital during the period of January to March 2018. The best regression model estimation is obtained from the selection of optimal knot which has minimum Generalized Cross Validation (GCV) and Mean Square Error (MSE). Parametric component in this research is hemoglobin (g/dL) and age (years), body temperature ( ), platelets ( ) as a nonparametric components. The minimum value of GCV is 221,67745153 achieved at the point 14,552; 14,987; and 15,096 knot; MSE value of 199,1032; and the value of coefficient determination is 75,3% obtained from semiparametric regression model linear spline with third point of knots.Keywords: semiparametric regression, spline, knot, Generalized Cross Validation, Dengue Hemorrahagic Fever.


1988 ◽  
Vol 110 (1) ◽  
pp. 37-41 ◽  
Author(s):  
C. R. Dohrmann ◽  
H. R. Busby ◽  
D. M. Trujillo

Smoothing and differentiation of noisy data using spline functions requires the selection of an unknown smoothing parameter. The method of generalized cross-validation provides an excellent estimate of the smoothing parameter from the data itself even when the amount of noise associated with the data is unknown. In the present model only a single smoothing parameter must be obtained, but in a more general context the number may be larger. In an earlier work, smoothing of the data was accomplished by solving a minimization problem using the technique of dynamic programming. This paper shows how the computations required by generalized cross-validation can be performed as a simple extension of the dynamic programming formulas. The results of numerical experiments are also included.


2011 ◽  
Vol 3 (1) ◽  
pp. 9
Author(s):  
Agustini Tripena Br. Sb.

This paper discusses aselection of smoothing parameters for the linier spline regression estimation on the data of electrical voltage differences in the wastewater. The selection methods are based on the mean square errorr (MSE) and generalized cross validation (GCV). The results show that in selection of smooting paranceus the mean square error (MSE) method gives smaller value , than that of the generalized cross validatio (GCV) method. It means that for our data case the errorr mean square (MSE) is the best selection method of smoothing parameter for the linear spline regression estimation.


2018 ◽  
Vol 7 (3) ◽  
pp. 259
Author(s):  
NI LUH SUKERNI ◽  
I KOMANG GDE SUKARSA ◽  
NI LUH PUTU SUCIPTAWATI

The study is aimed to estimate the best spline regression model for toddler’s weight growth patterns. Spline is one of the nonparametric regression estimation method which has a high flexibility and is able to handle data that change in particular subintervals so thus resulting in model which fitted the data. This study uses data of toddler’s weight growth at Posyandu Mekar Sari, Desa Suwug, Kabupaten Buleleng. The best spline regression model is chosen based on the minimum Generalized Cross Validation (GCV) value. The study shows that the best spline regression model for the data is quadratic spline regression model with six optimal knot points. The minimum GCV value is 0,900683471925 with the determination coefficient  equals to 0,954609.


2010 ◽  
Vol 25 (1) ◽  
pp. 27-47 ◽  
Author(s):  
Zoran Bosnić ◽  
Igor Kononenko

AbstractIn machine learning and its risk-sensitive applications (e.g. medicine, engineering, business), the reliability estimates for individual predictions provide more information about the individual prediction error (the difference between the true label and regression prediction) than the average accuracy of predictive model (e.g. relative mean squared error). Furthermore, they enable the users to distinguish between more and less reliable predictions. The empirical evaluations of the existing individual reliability estimates revealed that the successful estimates’ performance depends on the used regression model and on the particular problem domain. In the current paper, we focus on that problem as such and propose and empirically evaluate two approaches for automatic selection of the most appropriate estimate for a given domain and regression model: the internal cross-validation approach and the meta-learning approach. The testing results of both approaches demonstrated an advantage in the performance of dynamically chosen reliability estimates to the performance of the individual reliability estimates. The best results were achieved using the internal cross-validation procedure, where reliability estimates significantly positively correlated with the prediction error in 73% of experiments. In addition, the preliminary testing of the proposed methodology on a medical domain demonstrated the potential for its usage in practice.


2018 ◽  
Vol 7 (3) ◽  
pp. 211 ◽  
Author(s):  
NI PUTU RINA ANGGRENI ◽  
NI LUH PUTU SUCIPTAWATI ◽  
I GUSTI AYU MADE SRINADI

Tuberculosis is a contagious disease caused by Mycobacterium tuberculosis. Based on data from the health office of Bali Province, in 2015 tuberculosis cases found 0,96%, while in 2016 tuberculosis cases increase to 1,05%. This research used truncated spline nonparametric regression to model tuberculosis cases in Bali Province in 2016. This method was used because truncated spline has high flexibility compared to other polynomial models. The truncated spline function has a connecting point called knots. The best estimation of truncated spline regression model is obtained from optimal knot point selection by calculating minimum generalized cross validation. The estimated truncated model is linear with one knot point with determination coefficient equals to 70,48 %. In addition, it is also found in order to reduce tuberculosis cases the government of Bali Province should increase percentage of family who lives clean and healthy.


Author(s):  
Carlos Alberto Huaira Contreras ◽  
Carlos Cristiano Hasenclever Borges ◽  
Camila Borelli Zeller ◽  
Amanda Romanelli

The paper proposes a weighted cross-validation (WCV) algorithm  to select a linear regression model with change-point under a scale mixtures of normal (SMN) distribution that yields the best prediction results. SMN distributions are used to construct robust regression models to the influence of outliers on the parameter estimation process. Thus, we relaxed the usual assumption of normality of the regression models and considered that the random errors follow a SMN distribution, specifically the Student-t distribution. In addition, we consider the fact that the parameters of the regression model can change from a specific and unknown point, called change-point. In this context, the estimations of the model parameters, which include the change-point, are obtained via the EM-type algorithm (Expectation-Maximization). The WCV method is used in the selection of the model that presents greater robustness and that offers a smaller prediction error, considering that the weighting values come from step E of the EM-type algorithm. Finally, numerical examples considering simulated and real data (data from television audiences) are presented to illustrate the proposed methodology.


Sign in / Sign up

Export Citation Format

Share Document