Regression trees with splitting based on changes of dependencies among covariates

2021 ◽  
Vol 25 (3) ◽  
pp. 687-710
Author(s):  
Mostafa Boskabadi ◽  
Mahdi Doostparast

Regression trees are powerful tools in data mining for analyzing data sets. Observations are usually divided into homogeneous groups, and then statistical models for responses are derived in the terminal nodes. This paper proposes a new approach for regression trees that considers the dependency structures among covariates for splitting the observations. The mathematical properties of the proposed method are discussed in detail. To assess the accuracy of the proposed model, various criteria are defined. The performance of the new approach is assessed by conducting a Monte-Carlo simulation study. Two real data sets on classification and regression problems are analyzed by using the obtained results.

Author(s):  
Fiaz Ahmad Bhatti ◽  
Gauss M. Cordeiro ◽  
Mustafa Ç. Korkmaz ◽  
G.G. Hamedani

We introduce a four-parameter lifetime model with flexible hazard rate called the Burr XII gamma (BXIIG) distribution.  We derive the BXIIG distribution from (i) the T-X family technique and (ii) nexus between the exponential and gamma variables. The failure rate function for the BXIIG distribution is flexible as it can accommodate various shapes such as increasing, decreasing, decreasing-increasing, increasing-decreasing-increasing, bathtub and modified bathtub.  Its density function can take shapes such as exponential, J, reverse-J, left-skewed, right-skewed and symmetrical. To illustrate the importance of the BXIIG distribution, we establish various mathematical properties such as random number generator, ordinary moments, generating function, conditional moments, density functions of record values, reliability measures and characterizations.  We address the maximum likelihood estimation for the parameters. We estimate the adequacy of the estimators via a simulation study. We consider applications to two real data sets to prove empirically the potentiality of the proposed model.


Author(s):  
Ehab Mohamed Almetwally ◽  
Ahmed Z. Afify ◽  
G. G. Hamedani

In this paper, we introduce a new there-parameter Rayleigh distribution, called the Marshall-Olkin alpha power Rayleigh (MOAPR) distribution. Some statistical properties of the MOAPR distribution are obtained. The proposed model is characterized based on truncated moments and reverse hazard function. The maximum likelihood and bootstrap estimation methods are considered to estimate the MOPAR parameters. A Monte Carlo simulation study is performed to compare the maximum likelihood and bootstrap estimation methods. Superiority of the MOAPR distribution over some well-known distributions is illustrated by means of two real data sets.


2019 ◽  
Vol XVI (2) ◽  
pp. 1-11
Author(s):  
Farrukh Jamal ◽  
Hesham Mohammed Reyad ◽  
Soha Othman Ahmed ◽  
Muhammad Akbar Ali Shah ◽  
Emrah Altun

A new three-parameter continuous model called the exponentiated half-logistic Lomax distribution is introduced in this paper. Basic mathematical properties for the proposed model were investigated which include raw and incomplete moments, skewness, kurtosis, generating functions, Rényi entropy, Lorenz, Bonferroni and Zenga curves, probability weighted moment, stress strength model, order statistics, and record statistics. The model parameters were estimated by using the maximum likelihood criterion and the behaviours of these estimates were examined by conducting a simulation study. The applicability of the new model is illustrated by applying it on a real data set.


2020 ◽  
Vol 70 (4) ◽  
pp. 953-978
Author(s):  
Mustafa Ç. Korkmaz ◽  
G. G. Hamedani

AbstractThis paper proposes a new extended Lindley distribution, which has a more flexible density and hazard rate shapes than the Lindley and Power Lindley distributions, based on the mixture distribution structure in order to model with new distribution characteristics real data phenomena. Its some distributional properties such as the shapes, moments, quantile function, Bonferonni and Lorenz curves, mean deviations and order statistics have been obtained. Characterizations based on two truncated moments, conditional expectation as well as in terms of the hazard function are presented. Different estimation procedures have been employed to estimate the unknown parameters and their performances are compared via Monte Carlo simulations. The flexibility and importance of the proposed model are illustrated by two real data sets.


2017 ◽  
Vol 6 (3) ◽  
pp. 141 ◽  
Author(s):  
Thiago A. N. De Andrade ◽  
Luz Milena Zea Fernandez ◽  
Frank Gomes-Silva ◽  
Gauss M. Cordeiro

We study a three-parameter model named the gamma generalized Pareto distribution. This distribution extends the generalized Pareto model, which has many applications in areas such as insurance, reliability, finance and many others. We derive some of its characterizations and mathematical properties including explicit expressions for the density and quantile functions, ordinary and incomplete moments, mean deviations, Bonferroni and Lorenz curves, generating function, R\'enyi entropy and order statistics. We discuss the estimation of the model parameters by maximum likelihood. A small Monte Carlo simulation study and two applications to real data are presented. We hope that this distribution may be useful for modeling survival and reliability data.


2020 ◽  
Vol 9 (1) ◽  
pp. 61-81
Author(s):  
Lazhar BENKHELIFA

A new lifetime model, with four positive parameters, called the Weibull Birnbaum-Saunders distribution is proposed. The proposed model extends the Birnbaum-Saunders distribution and provides great flexibility in modeling data in practice. Some mathematical properties of the new distribution are obtained including expansions for the cumulative and density functions, moments, generating function, mean deviations, order statistics and reliability. Estimation of the model parameters is carried out by the maximum likelihood estimation method. A simulation study is presented to show the performance of the maximum likelihood estimates of the model parameters. The flexibility of the new model is examined by applying it to two real data sets.


2021 ◽  
Vol 19 (1) ◽  
Author(s):  
Sandeep Kumar Maurya ◽  
Sanjay K Singh ◽  
Umesh Singh

A one parameter right skewed, upside down bathtub type, heavy-tailed distribution is derived. Various statistical properties and maximum likelihood approaches for estimation purpose are studied. Five different real data sets with four different models are considered to illustrate the suitability of the proposed model.


Geophysics ◽  
2020 ◽  
Vol 85 (2) ◽  
pp. V223-V232 ◽  
Author(s):  
Zhicheng Geng ◽  
Xinming Wu ◽  
Sergey Fomel ◽  
Yangkang Chen

The seislet transform uses the wavelet-lifting scheme and local slopes to analyze the seismic data. In its definition, the designing of prediction operators specifically for seismic images and data is an important issue. We have developed a new formulation of the seislet transform based on the relative time (RT) attribute. This method uses the RT volume to construct multiscale prediction operators. With the new prediction operators, the seislet transform gets accelerated because distant traces get predicted directly. We apply our method to synthetic and real data to demonstrate that the new approach reduces computational cost and obtains excellent sparse representation on test data sets.


2020 ◽  
Vol 2020 ◽  
pp. 1-9
Author(s):  
Saima K. Khosa ◽  
Ahmed Z. Afify ◽  
Zubair Ahmad ◽  
Mi Zichuan ◽  
Saddam Hussain ◽  
...  

In this article, a new approach is used to introduce an additional parameter to a continuous class of distributions. The new class is referred to as a new extended-F family of distributions. The new extended-Weibull distribution, as a special submodel of this family, is discussed. General expressions for some mathematical properties of the proposed family are derived, and maximum likelihood estimators of the model parameters are obtained. Furthermore, a simulation study is provided to evaluate the validity of the maximum likelihood estimators. Finally, the flexibility of the proposed method is illustrated via two applications to real data, and the comparison is made with the Weibull and some of its well-known extensions such as Marshall–Olkin Weibull, alpha power-transformed Weibull, and Kumaraswamy Weibull distributions.


Author(s):  
Muhammad Mansoor ◽  
M. H. Tahir ◽  
Aymaan Alzaatreh ◽  
Gauss M. Cordeiro

A new three-parameter compounded extended-exponential distribution “Poisson Nadarajah–Haghighi” is introduced and studied, which is quite flexible and can be used effectively in modeling survival data. It can have increasing, decreasing, upside-down bathtub and bathtub-shaped failure rate. A comprehensive account of the mathematical properties of the model is presented. We discuss maximum likelihood estimation for complete and censored data. The suitability of the maximum likelihood method to estimate its parameters is assessed by a Monte Carlo simulation study. Four empirical illustrations of the new model are presented to real data and the results are quite satisfactory.


Sign in / Sign up

Export Citation Format

Share Document