Adaptation of a Success Story in GAs: Estimation-of-Distribution Algorithms for Tree-based Optimization Problems

Understanding the relationship between a search algorithm and the space of problems is a fundamental issue in the optimization field. In this paper, we lay the foundations to elaborate taxonomies of problems under estimation of distribution algorithms (EDAs). By using an infinite population model and assuming that the selection operator is based on the rank of the solutions, we group optimization problems according to the behavior of the EDA. Throughout the definition of an equivalence relation between functions it is possible to partition the space of problems in equivalence classes in which the algorithm has the same behavior. We show that only the probabilistic model is able to generate different partitions of the set of possible problems and hence, it predetermines the number of different behaviors that the algorithm can exhibit. As a natural consequence of our definitions, all the objective functions are in the same equivalence class when the algorithm does not impose restrictions to the probabilistic model. The taxonomy of problems, which is also valid for finite populations, is studied in depth for a simple EDA that considers independence among the variables of the problem. We provide the sufficient and necessary condition to decide the equivalence between functions and then we develop the operators to describe and count the members of a class. In addition, we show the intrinsic relation between univariate EDAs and the neighborhood system induced by the Hamming distance by proving that all the functions in the same class have the same number of local optima and that they are in the same ranking positions. Finally, we carry out numerical simulations in order to analyze the different behaviors that the algorithm can exhibit for the functions defined over the search space [Formula: see text].

Download Full-text

Data-driven analysis of variables and dependencies in continuous optimization problems and estimation of distribution algorithms

10.14264/uql.2015.520 ◽

2015 ◽

Author(s):

Krishna Mishra

Keyword(s):

Optimization Problems ◽

Continuous Optimization ◽

Data Driven ◽

Estimation Of Distribution Algorithms ◽

Estimation Of Distribution ◽

Distribution Algorithms ◽

Continuous Optimization Problems

Download Full-text

Estimation of Distribution Algorithms as Logistic Regression Regularizers of Microarray Classifiers

Methods of Information in Medicine ◽

10.3414/me9223 ◽

2009 ◽

Vol 48 (03) ◽

pp. 236-241 ◽

Cited By ~ 6

Author(s):

V. Robles ◽

P. Larrañaga ◽

C. Bielza

Keyword(s):

Logistic Regression ◽

Microarray Data ◽

Optimization Problems ◽

Likelihood Function ◽

Recursive Feature Elimination ◽

Parameter Estimates ◽

Data Sets ◽

Estimation Of Distribution Algorithms ◽

Estimation Of Distribution ◽

Distribution Algorithms

Summary Objectives: The “large k (genes), small N (samples)” phenomenon complicates the problem of microarray classification with logistic regression. The indeterminacy of the maximum likelihood solutions, multicollinearity of predictor variables and data over-fitting cause unstable parameter estimates. Moreover, computational problems arise due to the large number of predictor (genes) variables. Regularized logistic regression excels as a solution. However, the difficulties found here involve an objective function hard to be optimized from a mathematical viewpoint and a careful required tuning of the regularization parameters. Methods: Those difficulties are tackled by introducing a new way of regularizing the logistic regression. Estimation of distribution algorithms (EDAs), a kind of evolutionary algorithms, emerge as natural regularizers. Obtaining the regularized estimates of the logistic classifier amounts to maximizing the likelihood function via our EDA, without having to be penalized. Likelihood penalties add a number of difficulties to the resulting optimization problems, which vanish in our case. Simulation of new estimates during the evolutionary process of EDAs is performed in such a way that guarantees their shrinkage while maintaining their probabilistic dependence relationships learnt. The EDA process is embedded in an adapted recursive feature elimination procedure, thereby providing the genes that are best markers for the classification. Results: The consistency with the literature and excellent classification performance achieved with our algorithm are illustrated on four microarray data sets: Breast, Colon, Leukemia and Prostate. Details on the last two data sets are available as supplementary material. Conclusions: We have introduced a novel EDA-based logistic regression regularizer. It implicitly shrinks the coefficients during EDA evolution process while optimizing the usual likelihood function. The approach is combined with a gene subset selection procedure and automatically tunes the required parameters. Empirical results on microarray data sets provide sparse models with confirmed genes and performing better in classification than other competing regularized methods.

Download Full-text

Globally Multimodal Problem Optimization Via an Estimation of Distribution Algorithm Based on Unsupervised Learning of Bayesian Networks

Evolutionary Computation ◽

10.1162/1063656053583432 ◽

2005 ◽

Vol 13 (1) ◽

pp. 43-66 ◽

Cited By ~ 30

Author(s):

J. M. Peña ◽

J. A. Lozano ◽

P. Larrañaga

Keyword(s):

Bayesian Networks ◽

Unsupervised Learning ◽

Genetic Drift ◽

Optimization Problems ◽

Estimation Of Distribution Algorithm ◽

Estimation Of Distribution Algorithms ◽

Estimation Of Distribution ◽

Global Optima ◽

Effectiveness And Efficiency ◽

Distribution Algorithms

Many optimization problems are what can be called globally multimodal, i.e., they present several global optima. Unfortunately, this is a major source of difficulties for most estimation of distribution algorithms, making their effectiveness and efficiency degrade, due to genetic drift. With the aim of overcoming these drawbacks for discrete globally multimodal problem optimization, this paper introduces and evaluates a new estimation of distribution algorithm based on unsupervised learning of Bayesian networks. We report the satisfactory results of our experiments with symmetrical binary optimization problems.

Download Full-text

Minimization of the Total Traveling Distance and Maximum Distance by Using a Transformed-Based Encoding EDA to Solve the Multiple Traveling Salesmen Problem

Mathematical Problems in Engineering ◽

10.1155/2015/640231 ◽

2015 ◽

Vol 2015 ◽

pp. 1-13 ◽

Cited By ~ 3

Author(s):

S. H. Chen

Keyword(s):

Genetic Algorithm ◽

Optimization Problem ◽

Optimization Problems ◽

Solution Space ◽

Maximum Distance ◽

Estimation Of Distribution Algorithms ◽

Estimation Of Distribution ◽

Hard Problems ◽

Distribution Algorithms

Estimation of distribution algorithms (EDAs) have been used to solve numerous hard problems. However, their use with in-group optimization problems has not been discussed extensively in the literature. A well-known in-group optimization problem is the multiple traveling salesmen problem (mTSP), which involves simultaneous assignment and sequencing procedures and are shown in different forms. This paper presents a new algorithm, namedEDAMLA, which is based on self-guided genetic algorithm with a minimum loading assignment (MLA) rule. This strategy uses the transformed-based encoding approach instead of direct encoding. The solution space of the proposed method is onlyn!. We compare the proposed algorithm against the optimal direct encoding technique, the two-part encoding genetic algorithm, and, in experiments on 34 TSP instances drawn from the TSPLIB, find that its solution space isn!n-1m-1. The scale of the experiments exceeded that presented in prior studies. The results show that the proposed algorithm was superior to the two-part encoding genetic algorithm in terms of minimizing the total traveling distance. Notably, the proposed algorithm did not cause a longer traveling distance when the number of salesmen was increased from 3 to 10. The results suggest that EDA researchers should employ the MLA rule instead of direct encoding in their proposed algorithms.

Download Full-text

An Improved Estimation of Distribution Algorithms Based on the Minimal Free Energy

Applied Mechanics and Materials ◽

10.4028/www.scientific.net/amm.373-375.1093 ◽

2013 ◽

Vol 373-375 ◽

pp. 1093-1097

Author(s):

Fa Hong Yu ◽

Mei Jia Chen ◽

Wei Zhi Liao

Keyword(s):

Free Energy ◽

Optimization Problems ◽

Minimum Free Energy ◽

Population Diversity ◽

Estimation Of Distribution Algorithms ◽

Minimal Free Energy ◽

Competitive Mechanism ◽

Estimation Of Distribution ◽

Distribution Algorithms ◽

The Cost

To systematically harmonize the conflict between selective pressure and population diversity in estimation of distribution algorithms, an improved estimation of distribution algorithms based on the minimal free energy (IEDA) is proposed in this paper. IEDA conforms to the principle of minimal free energy in simulating the competitive mechanism between energy and entropy in annealing process, in which population diversity is measured by similarity entropy and the minimum free energy is simulated with an efficient and effective competition by free energy component. Through solving some typical numerical optimization problems, satisfactory results were achieved, which showed that IEDA was a preferable algorithm to avoid the premature convergence effectively and reduce the cost in search to some extent.

Download Full-text

Estimation of Distribution Algorithms with Fuzzy Sampling for Stochastic Programming Problems

Applied Sciences ◽

10.3390/app10196937 ◽

2020 ◽

Vol 10 (19) ◽

pp. 6937

Author(s):

Abdel-Rahman Hedar ◽

Amira A. Allam ◽

Alaa Fahim

Keyword(s):

Stochastic Programming ◽

Optimization Problems ◽

Sampling Technique ◽

Small Sample ◽

Sampling Techniques ◽

Estimation Of Distribution Algorithms ◽

Simulation Based ◽

Simulation Based Optimization ◽

Estimation Of Distribution ◽

Distribution Algorithms

Generating practical methods for simulation-based optimization has attracted a great deal of attention recently. In this paper, the estimation of distribution algorithms are used to solve nonlinear continuous optimization problems that contain noise. One common approach to dealing with these problems is to combine sampling methods with optimal search methods. Sampling techniques have a serious problem when the sample size is small, so estimating the objective function values with noise is not accurate in this case. In this research, a new sampling technique is proposed based on fuzzy logic to deal with small sample sizes. Then, simulation-based optimization methods are designed by combining the estimation of distribution algorithms with the proposed sampling technique and other sampling techniques to solve the stochastic programming problems. Moreover, additive versions of the proposed methods are developed to optimize functions without noise in order to evaluate different efficiency levels of the proposed methods. In order to test the performance of the proposed methods, different numerical experiments were carried out using several benchmark test functions. Finally, three real-world applications are considered to assess the performance of the proposed methods.

Download Full-text