Local Optima Avoidance in GA Biclustering using Map Reduce

Author(s):  
Gowri R. ◽  
Rathipriya R.

One of the prominent issues in Genetic Algorithm (GA) is premature convergence on local optima. This restricts the enhanced optimal solution searching in the entire search space. Population size is one of the influencing factors in Genetic Algorithm. Increasing the population size will improvise the randomized searching and maintains the diversity in the population. It also increases its computational complexity. Especially in GA Biclustering (GABiC), the search should be randomized to find more optimal patterns. In this paper, a novel approach for population setup in MapReduce framework is proposed. The maximal population is split into population sets, and these groups will proceed searching in parallel using MapReduce framework. This approach is attempted for biclustering the gene expression dataset in this paper. The performance of this proposed work seems promising on comparing its results with those obtained from previous hybridized optimization approaches. This approach will also handle data scalability issues and applicable to the big data biclustering problems.

Author(s):  
Prachi Agrawal ◽  
Talari Ganesh ◽  
Ali Wagdy Mohamed

AbstractThis article proposes a novel binary version of recently developed Gaining Sharing knowledge-based optimization algorithm (GSK) to solve binary optimization problems. GSK algorithm is based on the concept of how humans acquire and share knowledge during their life span. A binary version of GSK named novel binary Gaining Sharing knowledge-based optimization algorithm (NBGSK) depends on mainly two binary stages: binary junior gaining sharing stage and binary senior gaining sharing stage with knowledge factor 1. These two stages enable NBGSK for exploring and exploitation of the search space efficiently and effectively to solve problems in binary space. Moreover, to enhance the performance of NBGSK and prevent the solutions from trapping into local optima, NBGSK with population size reduction (PR-NBGSK) is introduced. It decreases the population size gradually with a linear function. The proposed NBGSK and PR-NBGSK applied to set of knapsack instances with small and large dimensions, which shows that NBGSK and PR-NBGSK are more efficient and effective in terms of convergence, robustness, and accuracy.


Author(s):  
CHENGYING MAO ◽  
XINXIN YU

The quality of test data has an important impact on the effect of software testing, so test data generation has always been a key task for finding the potential faults in program code. In structural testing, the primary goal is to cover some kinds of structure elements with some specific inputs. Search-based test data generation provides a rational way to handle this difficult problem. In the past, some well-known meta-heuristic search algorithms have been successfully utilized to solve this issue. In this paper, we introduce a variant of genetic algorithm (GA), called quantum-inspired genetic algorithm (QIGA), to generate the test data with stronger coverage ability. In this new algorithm, the traditional binary bit is replaced by a quantum bit (Q-bit) to enlarge the search space so as to avoid falling into local optimal solution. On the other hand, some other strategies such as quantum rotation gate and catastrophe operation are also used to improve algorithm efficiency and quality of test data. In addition, experimental analysis on eight real-world programs is performed to validate the effectiveness of our method. The results show that QIGA-based method can generate test data with higher coverage in much smaller convergence generations than GA-based method. More importantly, our proposed method is more robust for algorithm parameter change.


F1000Research ◽  
2013 ◽  
Vol 2 ◽  
pp. 139
Author(s):  
Maxinder S Kanwal ◽  
Avinash S Ramesh ◽  
Lauren A Huang

Recent development of large databases, especially those in genetics and proteomics, is pushing the development of novel computational algorithms that implement rapid and accurate search strategies. One successful approach has been to use artificial intelligence and methods, including pattern recognition (e.g. neural networks) and optimization techniques (e.g. genetic algorithms). The focus of this paper is on optimizing the design of genetic algorithms by using an adaptive mutation rate that is derived from comparing the fitness values of successive generations. We propose a novel pseudoderivative-based mutation rate operator designed to allow a genetic algorithm to escape local optima and successfully continue to the global optimum. Once proven successful, this algorithm can be implemented to solve real problems in neurology and bioinformatics. As a first step towards this goal, we tested our algorithm on two 3-dimensional surfaces with multiple local optima, but only one global optimum, as well as on the N-queens problem, an applied problem in which the function that maps the curve is implicit. For all tests, the adaptive mutation rate allowed the genetic algorithm to find the global optimal solution, performing significantly better than other search methods, including genetic algorithms that implement fixed mutation rates.


2013 ◽  
Vol 411-414 ◽  
pp. 1884-1893
Author(s):  
Yong Chun Cao ◽  
Ya Bin Shao ◽  
Shuang Liang Tian ◽  
Zheng Qi Cai

Due to many of the clustering algorithms based on GAs suffer from degeneracy and are easy to fall in local optima, a novel dynamic genetic algorithm for clustering problems (DGA) is proposed. The algorithm adopted the variable length coding to represent individuals and processed the parallel crossover operation in the subpopulation with individuals of the same length, which allows the DGA algorithm clustering to explore the search space more effectively and can automatically obtain the proper number of clusters and the proper partition from a given data set; the algorithm used the dynamic crossover probability and adaptive mutation probability, which prevented the dynamic clustering algorithm from getting stuck at a local optimal solution. The clustering results in the experiments on three artificial data sets and two real-life data sets show that the DGA algorithm derives better performance and higher accuracy on clustering problems.


F1000Research ◽  
2013 ◽  
Vol 2 ◽  
pp. 139
Author(s):  
Maxinder S Kanwal ◽  
Avinash S Ramesh ◽  
Lauren A Huang

The fields of molecular biology and neurobiology have advanced rapidly over the last two decades. These advances have resulted in the development of large proteomic and genetic databases that need to be searched for the prediction, early detection and treatment of neuropathologies and other genetic disorders. This need, in turn, has pushed the development of novel computational algorithms that are critical for searching genetic databases. One successful approach has been to use artificial intelligence and pattern recognition algorithms, such as neural networks and optimization algorithms (e.g. genetic algorithms). The focus of this paper is on optimizing the design of genetic algorithms by using an adaptive mutation rate based on the fitness function of passing generations. We propose a novel pseudo-derivative based mutation rate operator designed to allow a genetic algorithm to escape local optima and successfully continue to the global optimum. Once proven successful, this algorithm can be implemented to solve real problems in neurology and bioinformatics. As a first step towards this goal, we tested our algorithm on two 3-dimensional surfaces with multiple local optima, but only one global optimum, as well as on the N-queens problem, an applied problem in which the function that maps the curve is implicit. For all tests, the adaptive mutation rate allowed the genetic algorithm to find the global optimal solution, performing significantly better than other search methods, including genetic algorithms that implement fixed mutation rates.


Author(s):  
A. K. Dash ◽  
D. K. Agarwalla ◽  
H. C. Das ◽  
M. K. Pradhan ◽  
S. K. Bhuyan

Machines and beam like structures used in various industries require continuous monitoring for the fault identification for ensuring uninterrupted service. Different non-destructive techniques (NDT) are generally used for this purpose, but they are costly and time consuming. Vibration based methods can be useful to detect cracks in structures using various artificial intelligence (AI) techniques. The modal parameters from the dynamic response of the structure are used for the purpose. In the current analysis, the vibration characteristics of a glass fiber reinforced composite cracked cantilever beam having different crack locations and depths have been studied. Numerical and finite element methods have been used to extract the diagnostic indices (natural frequencies, mode shapes) from cracked and intact beam structure. An intelligent Genetic Algorithm (GA) based controller has been designed to automate the fault identification and location process. Single point crossover and in some cases mutation procedure have been followed to find out the optimal solution from the search space. The controller has been trained in offline mode using the simulation and experimental results (initial data pool) under various healthy and faulty conditions of the structure. The outcome from the developed controller shows that the system could not only detect the cracks but also predict their locations and severities. Good agreement between the simulation, experimental and GA controller results confirms the effectiveness of the proposed controller.


2013 ◽  
Vol 432 ◽  
pp. 565-570
Author(s):  
Xin Wen Gao ◽  
Ben Bo Guan ◽  
Xing Jian Guan

The purpose of this paper is to improve the efficiency of the Gene Expression Programming (GEP) algorithm. The GEP algorithm is an evolutionary computation. It inherits the characteristics of Genetic Algorithm and Genetic Programming. Through its own characteristics, the GEP algorithm can get the optimal solution of the complicated problem. So, the GEP algorithm has achieved good results in many areas. However, there are also some inevitable drawbacks about the GEP algorithm itself. This paper proposes 5 deficiencies aspects of the GEP algorithm (expression meaning, fitness calculation, local convergence, variable selection, genetic operations, selection of genetic operation rates), and gives the corresponding solutions.


2019 ◽  
Vol 9 (19) ◽  
pp. 4005 ◽  
Author(s):  
Geunho Yang ◽  
Byung Do Chung ◽  
Sang Jin Lee

This study addresses the dual resource constrained flexible job shop scheduling problem (DRCFJSP) with a multilevel product structure. The DRCFJSP is a strong NP-hard problem, and an efficient algorithm is essential for DRCFJSP. In this study, we propose an algorithm for the DRCFJSP with a multilevel product structure to minimize the lateness, makespan, and deviation of the workload with preemptive priorities. To efficiently solve the problem within a limited time, the search space is limited based on the possible start and end time, and focus is placed on the intensification rather than diversification, which can help the algorithm spend more time to find an optimal solution in a reasonable solution space. The performance of the proposed algorithm is compared with those of a genetic algorithm and a hybrid genetic algorithm with variable neighborhood search. The numerical experiments demonstrate that the strategy limiting the search space is effective for large and complex problems.


Author(s):  
Morteza Madhkhan ◽  
Mohammad Reza Baradaran

Genetic Algorithm (GA) is one of the most widely used optimization algorithms. This algorithm consists of five stages, namely population generation, crossover, mutation, evaluation, and selection. This study presents a modified version of GA called Improved Genetic Algorithm (IGA) for the optimization of steel frame designs. In the IGA, the rate of convergence to the optimal solution is increased by splitting the population generation process to two stages. In the first stage, the initial population is generated by random selection of members from among AISC W-shapes. The generated population is then evaluated in another stage, where the member that does not satisfy the design constraints are replaced with stronger members with larger cross sectional area. This process continues until all design constraints are satisfied. Through this process, the initial population will be improved intelligently so that the design constraints fall within the allowed range. For performance evaluation and comparison, the method was used to design and optimize 10-story and 24-story frames based on the LRFD method as per AISC regulations with the finite element method used for frame analysis. Structural analysis, design, and optimization were performed using a program written with MATLAB programming language. The results show that using the proposed method (IGA) for frame optimization reduces the volume of computations and increases the rate of convergence, thus allowing access to frame designs with near-optimal weights in only a few iterations. Using the IGA also limits the search space to the area of acceptable solutions.


Sign in / Sign up

Export Citation Format

Share Document