Imputing missing distances in molecular phylogenetics

Missing data are frequently encountered in molecular phylogenetics, but there has been no accurate distance imputation method available for distance-based phylogenetic reconstruction. The general framework for distance imputation is to explore tree space and distance values to find an optimal combination of output tree and imputed distances. Here I develop a least-square method coupled with multivariate optimization to impute multiple missing distance in a distance matrix or from a set of aligned sequences with missing genes so that some sequences share no homologous sites (whose distances therefore need to be imputed). I show that phylogenetic trees can be inferred from distance matrices with about 10% of distances missing, and the accuracy of the resulting phylogenetic tree is almost as good as the tree from full information. The new method has the advantage over a recently published one in that it does not assume a molecular clock and is more accurate (comparable to maximum likelihood method based on simulated sequences). I have implemented the function in DAMBE software, which is freely available athttp://dambe.bio.uottawa.ca.

Download Full-text

A New Distribution for Modeling Lifetime Data with Different Methods of Estimation and Censored Regression Modeling

Statistics Optimization & Information Computing ◽

10.19139/soic-2310-5070-678 ◽

2020 ◽

Vol 8 (2) ◽

pp. 610-630 ◽

Cited By ~ 2

Author(s):

Mohamed Ibrahim ◽

Emrah Altun EA ◽

Haitham M. Yousof

Keyword(s):

Maximum Likelihood ◽

Maximum Likelihood Method ◽

Estimation Method ◽

Least Square Method ◽

Least Square ◽

Likelihood Method ◽

Survival Times ◽

New Model ◽

Von Mises ◽

Bootstrapping Method

In this paper and after introducing a new model along with its properties, we estimate the unknown parameter of the new model using the Maximum likelihood method, Cram er-Von-Mises method, bootstrapping method, least square method and weighted least square method. We assess the performance of all estimation method employing simulations. All methods perform well but bootstrapping method is the best in modeling relief times whereas the maximum likelihood method is the best in modeling survival times. Censored data modeling with covariates is addressed along with the index plot of the modified deviance residuals and its Q-Q plot.

Download Full-text

Identification of the characteristic parameters for gas pipe sections using the maximum likelihood method and least square method

Engineering Reports ◽

10.1002/eng2.12471 ◽

2021 ◽

Author(s):

Huiyu Chen ◽

Da Qi ◽

Hui Wang ◽

Qiang Zhang ◽

Yaran Bu ◽

...

Keyword(s):

Maximum Likelihood ◽

Maximum Likelihood Method ◽

Least Square Method ◽

Least Square ◽

Likelihood Method ◽

Characteristic Parameters

Download Full-text

Stress-Strength Reliability for P(T

Al-Qadisiyah Journal Of Pure Science ◽

10.29350/qjps.2021.26.2.1259 ◽

2021 ◽

Vol 26 (2) ◽

Author(s):

Ali Mutair ◽

Nada Sabah Karam

Keyword(s):

Maximum Likelihood ◽

Maximum Likelihood Method ◽

Pareto Distribution ◽

Least Square Method ◽

Least Square ◽

Likelihood Method ◽

Method Of Moment ◽

Important Conclusion ◽

Large Samples ◽

The Mean

In this paper, the reliability formula of the stress-strength model is derived for probability of a component having strength X falling between two stresses T and Z, based on The New Weibull-Pareto Distribution with unknown parameter and known and common parameters and . Four methods for estimating the The New Weibull-Pareto parameters are discussed which are the Maximum Likelihood, Method of Moment, Least Square Method and Weighted Least Square Method, and the comparison between these estimations based on a simulation study by the mean square error criteria for each of the small, medium and large samples. The most important conclusion is that this comparison confirms that the performance of the maximum likelihood estimator works better for all experiments studied.

Download Full-text

Estimating parameters of linear regression with an exponential power distribution of errors by using a polynomial maximization method

Eastern-European Journal of Enterprise Technologies ◽

10.15587/1729-4061.2021.225525 ◽

2021 ◽

Vol 1 (4 (109)) ◽

pp. 64-73

Author(s):

Serhii Zabolotnii ◽

Vladyslav Khotunov ◽

Anatolii Chepynoha ◽

Olexandr Tkachenko

Keyword(s):

Maximum Likelihood ◽

Linear Regression ◽

Power Distribution ◽

Maximum Likelihood Method ◽

Least Square Method ◽

Least Square ◽

Likelihood Method ◽

Random Errors ◽

Exponential Power Distribution ◽

Exponential Power

This paper considers the application of a method for maximizing polynomials in order to find estimates of the parameters of a multifactorial linear regression provided the random errors of the regression model follow an exponential power distribution. The method used is conceptually close to a maximum likelihood method because it is based on the maximization of selective statistics in the neighborhood of the true values of the evaluated parameters. However, in contrast to the classical parametric approach, it employs a partial probabilistic description in the form of a limited number of statistics of higher orders. The adaptive algorithm of statistical estimation has been synthesized, which takes into consideration the properties of regression residues and makes it possible to find refined values for the estimates of the parameters of a linear multifactorial regression using the numerical Newton-Rafson iterative procedure. Based on the apparatus of the quantity of extracted information, the analytical expressions have been derived that make it possible to analyze the theoretical accuracy (asymptotic variances) of estimates for the method of maximizing polynomials depending on the magnitude of the exponential power distribution parameters. Statistical modeling was employed to perform a comparative analysis of the variance of estimates obtained using the method of maximizing polynomials with the accuracy of classical methods: the least squares and maximum likelihood. Regions of the greatest efficiency for each studied method have been constructed, depending on the magnitude of the parameter of the form of exponential power distribution and sample size. It has been shown that estimates from the polynomial maximization method may demonstrate a much lower variance compared to the estimates from a least-square method. And, in some cases (for flat-topped distributions and in the absence of a priori information), may exceed the estimates from the maximum likelihood method in terms of accuracy

Download Full-text

Accelerated Lifetime Data Analysis with a Nonconstant Shape Parameter

Mathematical Problems in Engineering ◽

10.1155/2015/801465 ◽

2015 ◽

Vol 2015 ◽

pp. 1-8 ◽

Cited By ~ 3

Author(s):

Guodong Wang ◽

Zhanwen Niu ◽

Zhen He

Keyword(s):

Shape Parameter ◽

High Reliability ◽

Least Square Method ◽

Least Square ◽

Accelerated Life Test ◽

Stress Factors ◽

Likelihood Method ◽

Accelerated Life ◽

Lifetime Data Analysis ◽

Accelerated Lifetime

Accelerated life test is commonly used for the estimation of high-reliability product. In this paper, we present a simple and efficient approach to estimate the coefficients of acceleration models. Assuming that both scale and shape parameters of Weibull lifetime distribution vary with stress factors, we estimate the parameters of Weibull distribution using maximum likelihood method and reduce the bias of shape parameter estimator. Considering the heteroscedasticity, we compute the estimates of the coefficients of acceleration models through weighted least square method. Additionally, we obtain the confidence interval of low percentile via bootstrapping. We compare the proposed method with other methods using a real lifetime example. Finally, we study the performance of the proposed method by simulation. The simulation results show that our proposed method is effective.

Download Full-text

Judge the Possible Areas in Eastern Part of Inner Mongolia where Strong Earthquakes Occur in the Future by Utilizing b-Value

Advanced Materials Research ◽

10.4028/www.scientific.net/amr.518-523.5616 ◽

2012 ◽

Vol 518-523 ◽

pp. 5616-5622

Author(s):

Xiao Ming Han ◽

Ding Xue ◽

Bo Hu

Keyword(s):

Inner Mongolia ◽

Strong Earthquake ◽

Least Square Method ◽

B Value ◽

Geological Structure ◽

Least Square ◽

Space Distribution ◽

Likelihood Method ◽

Time Scanning ◽

The Future

The Zhalantun district of eastern part of Inner Mongolia is located at the northern section of Greater Khingan seismic belt, and it has complicated geological structure, with relatively dynamic moderately strong earthquake in past times and modern times. The seismic activities in this district is selected as the research object; based on the integrity analysis toward seismic sequence in the district, least square method is used to conduct time scanning calculation of b-value, and maximum likelihood method is used to conduct space scanning calculation of b-value. The b-value during the time scanning is the mean b-value of research zone in every scanning window, so its amplitude of variation is not quite great, with range of variation of b-value basically staying within 0.78-1.13, and range of error staying within 0.04-0.065. The space scanning results indicate that the space distribution range of b-value of Zhalantun district basically stays within 0.4-1.6, and the range of error is 0.045-0.085. The low b-value zone is the north central section of Alun River breakage, with b-value basically distributed within 0.5-0.7, which indicates that the earth crust medium of this zone is under the state of high horizontal stress accumulation and it is the dangerous zone where moderately strong earthquake or more occurs in the future.

Download Full-text

Statistical Properties of the Maximum Likelihood Method of Phylogenetic Estimation and Comparison with Distance Matrix Methods

Systematic Biology ◽

10.2307/2413672 ◽

1994 ◽

Vol 43 (3) ◽

pp. 329 ◽

Cited By ~ 6

Author(s):

Ziheng Yang

Keyword(s):

Maximum Likelihood ◽

Maximum Likelihood Method ◽

Distance Matrix ◽

Statistical Properties ◽

Likelihood Method ◽

Matrix Methods ◽

Phylogenetic Estimation

Download Full-text

EVALUATION OF THE RESTRICTED MAXIMUM-LIKELIHOOD METHOD FOR ESTIMATING PHYLOGENETIC TREES USING SIMULATED ALLELE-FREQUENCY DATA

Evolution ◽

10.1111/j.1558-5646.1988.tb04162.x ◽

1988 ◽

Vol 42 (3) ◽

pp. 581-595 ◽

Cited By ~ 21

Author(s):

F. James Rohlf ◽

Michael C. Wooten

Keyword(s):

Maximum Likelihood ◽

Allele Frequency ◽

Phylogenetic Trees ◽

Maximum Likelihood Method ◽

Restricted Maximum Likelihood ◽

Likelihood Method ◽

Frequency Data ◽

Allele Frequency Data

Download Full-text

The asymptotic behavior of bootstrap support values in molecular phylogenetics

Systematic Biology ◽

10.1093/sysbio/syaa100 ◽

2020 ◽

Author(s):

Jun Huang ◽

Yuting Liu ◽

Tianqi Zhu ◽

Ziheng Yang

Keyword(s):

Asymptotic Behavior ◽

Phylogenetic Trees ◽

Molecular Phylogenetics ◽

Phylogenetic Reconstruction ◽

Strong Support ◽

Large Datasets ◽

Bootstrap Support ◽

Partial Explanation ◽

Empirical Observation ◽

Statistical Confidence

Abstract The phylogenetic bootstrap is the most commonly used method for assessing statistical confidence in estimated phylogenies by non-Bayesian methods such as maximum parsimony and maximum likelihood (ML). It is observed that bootstrap support tends to be high in large genomic datasets whether or not the inferred trees and clades are correct. Here we study the asymptotic behavior of bootstrap support for the ML tree in large datasets when the competing phylogenetic trees are equally right or equally wrong. We consider phylogenetic reconstruction as a problem of statistical model selection when the compared models are nonnested and misspecified. The bootstrap is found to have qualitatively different dynamics from Bayesian inference, and does not exhibit the polarized behavior of posterior model probabilities, consistent with the empirical observation that the bootstrap is more conservative than Bayesian probabilities. Nevertheless bootstrap support similarly shows fluctuations among large datasets, with no convergence to a point value, when the compared models are equally right or equally wrong. Thus in large datasets strong support for wrong trees or models is likely to occur. Our analysis provides a partial explanation for the high bootstrap support values for incorrect clades observed in empirical data analysis.

Download Full-text

Estimasi Parameter Model Autoregressive dengan Metode Yule Walker, Least Square, dan Maximum Likelihood (Studi Kasus Data ROA BPRS di Indonesia)

Quadratic: Journal of Innovation and Technology in Mathematics and Mathematics Education ◽

10.14421/quadratic.2021.011-01 ◽

2021 ◽

Vol 1 (1) ◽

pp. 1-6

Author(s):

Maulida Nurhidayati

Keyword(s):

Maximum Likelihood ◽

Maximum Likelihood Method ◽

Least Square ◽

Likelihood Method ◽

Model Parameters ◽

Likelihood Methods ◽

Simulation Data ◽

Sample Data ◽

Maximum Likelihood Methods ◽

Modeling Data

The Autoregressive model is a time series univariate model for stationary models. In estimating parameters on this model can be done by several methods, namely yule-walker method, Least Square, and Maximum Likelihood. Each method has a different principle for estimating model parameters so that the results obtained will also be different. Based on this, in this study, the AR(1) model parameter estimation was estimated by generating data simulated 1000 times to see the performance of Yule-Walker, Least Square, and Maximum Likelihood methods. In addition, the comparison of these three methods is also done on ROA BPRS data that follows the AR(1) model. The results showed that the Maximum Likelihood method was able to provide mode results and comparison of the most suitable estimation results for simulation data and produce the smallest MAE values in the data in sample and MAPE, MSE, and MAE the smallest in the out sample data. These results show that the Maximum Likelihood method is the best method for modeling data that follows the AR(1) model.

Download Full-text