Tree-Structure-Aware Genetic Operators in Genetic Programming

Traditional software engineering dictates the use of modular and structured programming and top-down stepwise refinement techniques that reduce the amount of variability arising in the development process by establishing standard procedures to be followed while writing software. This focusing leads to reduced variability in the resulting products, due to the use of standardized constructs. Genetic programming (GP) performs heuristic search in the space of programs. Programs produced through the GP paradigm emerge as the result of simulated evolution and are built through a bottom-up process, incrementally augmenting their functionality until a satisfactory level of performance is reached. Can we automatically extract knowledge from the GP programming process that can be useful to focus the search and reduce product variability, thus leading to a more effective use of the available resources? An answer to this question is investigated with the aid of cultural algorithms. A new system, cultural algorithms with genetic programming (CAGP), is presented. The system has two levels. The first is the pool of genetic programs (population level), and the second is a knowledge repository (belief set) that is built during the GP run and is used to guide the search process. The microevolution within the population brings about potentially meaningful characteristics of the programs for the achievement of the given task, such as properties exhibited by the best performers in the population. CAGP extracts these features and represents them as the set of the current beliefs. Beliefs correspond to constraints that all the genetic operators and programs must follow. Interaction between the two levels occurs in one direction through the extraction process and, in the other, through the modulation of an individual's program parameters according to which, and how many, of the constraints it follows. CAGP is applied to solve an instance of the symbolic regression problem, in which a function of one variable needs to be discovered. The results of the experiments show an overall improvement on the average performance of CAGP over GP alone and a significant reduction of the complexity of the produced solution. Moreover, the execution time required by CAGP is comparable with the time required by GP alone.

Download Full-text

Guided Subtree Selection for Genetic Operators in Genetic Programming for Dynamic Flexible Job Shop Scheduling

10.26686/wgtn.13158314.v1 ◽

2020 ◽

Author(s):

Fangfang Zhang ◽

Yi Mei ◽

S Nguyen ◽

Mengjie Zhang

Keyword(s):

Genetic Programming ◽

Job Shop ◽

Job Shop Scheduling ◽

Computational Time ◽

Flexible Job Shop ◽

Flexible Job Shop Scheduling ◽

Genetic Operators ◽

Operation Sequencing ◽

Shop Scheduling ◽

Selection For

© 2020, Springer Nature Switzerland AG. Dynamic flexible job shop scheduling (DFJSS) has been widely studied in both academia and industry. Both machine assignment and operation sequencing decisions need to be made simultaneously as an operation can be processed by a set of machines in DFJSS. Using scheduling heuristics to solve the DFJSS problems becomes an effective way due to its efficiency and simplicity. Genetic programming (GP) has been successfully applied to evolve scheduling heuristics for job shop scheduling automatically. However, the subtrees of the selected parents are randomly chosen in traditional GP for crossover and mutation, which may not be sufficiently effective, especially in a huge search space. This paper proposes new strategies to guide the subtree selection rather than picking them randomly. To be specific, the occurrences of features are used to measure the importance of each subtree of the selected parents. The probability to select a subtree is based on its importance and the type of genetic operators. This paper examines the proposed algorithm on six DFJSS scenarios. The results show that the proposed GP algorithm with the guided subtree selection for crossover can converge faster and achieve significantly better performance than its counterpart in half of the scenarios while no worse in all other scenarios without increasing the computational time.

Download Full-text

High-dimensional Unbalanced Binary Classification by Genetic Programming with Multi-criterion Fitness Evaluation and Selection

Evolutionary Computation ◽

10.1162/evco_a_00304 ◽

2021 ◽

pp. 1-26

Author(s):

Wenbin Pei ◽

Bing Xue ◽

Lin Shang ◽

Mengjie Zhang

Keyword(s):

Genetic Programming ◽

Fitness Function ◽

Binary Classification ◽

Class Imbalance ◽

Area Under The Curve ◽

High Dimensional ◽

Genetic Operators ◽

Minority Class ◽

Fitness Evaluation ◽

Unbalanced Classification

Abstract High-dimensional unbalanced classification is challenging because of the joint effects of high dimensionality and class imbalance. Genetic programming (GP) has the potential benefits for use in high-dimensional classification due to its built-in capability to select informative features. However, once data is not evenly distributed, GP tends to develop biased classifiers which achieve a high accuracy on the majority class but a low accuracy on the minority class. Unfortunately, the minority class is often at least as important as the majority class. It is of importance to investigate how GP can be effectively utilized for high-dimensional unbalanced classification. In this paper, to address the performance bias issue of GP, a new two-criterion fitness function is developed, which considers two criteria, i.e. the approximation of area under the curve (AUC) and the classification clarity (i.e. how well a program can separate two classes). The obtained values on the two criteria are combined in pairs, instead of summing them together. Furthermore, this paper designs a three-criterion tournament selection to effectively identify and select good programs to be used by genetic operators for generating better offspring during the evolutionary learning process. The experimental results show that the proposed method achieves better classification performance than other compared methods.

Download Full-text

Gene Expression Programming

Bio-Inspired Computing for Information Retrieval Applications - Advances in Knowledge Acquisition, Transfer, and Management ◽

10.4018/978-1-5225-2375-8.ch010 ◽

2017 ◽

pp. 269-292 ◽

Cited By ~ 2

Author(s):

Baddrud Zaman Laskar ◽

Swanirbhar Majumder

Keyword(s):

Gene Expression ◽

Genetic Algorithm ◽

Natural Selection ◽

Genetic Programming ◽

Soft Computing ◽

Gene Expression Programming ◽

Research Work ◽

Genetic Operators ◽

A New Technique ◽

Work Done

Gene expression programming (GEP) introduced by Candida Ferreira is a descendant of genetic algorithm (GA) and genetic programming (GP). It takes the advantage of both the optimization and search technique based on genetics and natural selection as GA and its programmatic Darwinian counterpart GP. It is gaining popularity because; it has to some extent eradicated the ‘cons' of both while keeping in the ‘pros'. It is still a new technique not much explored since its introduction in 2001. In this chapter both GA and GP is first discussed followed by the elaborate discussion of GEP. This is followed up by the discussion on research work done is different fields using GEP as a tool followed up by GEP architectures. Finally, here GEP has been used for detection of age from facial features as a soft computing based optimization problem using genetic operators.

Download Full-text

Enzyme Genetic Programming

Cellular Computing ◽

10.1093/oso/9780195155396.003.0007 ◽

2004 ◽

Author(s):

Michael A. Lones ◽

Andy M. Tyrrell

Keyword(s):

Genetic Programming ◽

Biological Evolution ◽

Target System ◽

Search Process ◽

Automatic Programming ◽

Genetic Operators ◽

Viable Systems ◽

The Right ◽

And Behavior ◽

Environment Evolution

Programming is a process of optimization; taking a specification, which tells us what we want, and transforming it into an implementation, a program, which causes the target system to do exactly what we want. Conventionally, this optimization is achieved through manual design. However, manual design can be slow and error-prone, and recently there has been increasing interest in automatic programming; using computers to semiautomate the process of refining a specification into an implementation. Genetic programming is a developing approach to automatic programming, which, rather than treating programming as a design process, treats it as a search process. However, the space of possible programs is infinite, and finding the right program requires a powerful search process. Fortunately for us, we are surrounded by a monotonous search process capable of producing viable systems of great complexity: evolution. Evolution is the inspiration behind genetic programming. Genetic programming copies the process and genetic operators of biological evolution but does not take any inspiration from the biological representations to which they are applied. It can be argued that the program representation that genetic programming does use is not well suited to evolution. Biological representations, by comparison, are a product of evolution and, a fact to which this book is testament, describe computational structures. This chapter is about enzyme genetic programming, a form of genetic programming that mimics biological representations in an attempt to improve the evolvability of programs. Although it would be an advantage to have a familiarity with both genetic programming and biological representations, concise introductions to both these subjects are provided. According to modern biological understanding, evolution is solely responsible for the complexity we see in the structure and behavior of biological organisms. Nevertheless, evolution itself is a simple process that can occur in any population of imperfectly replicating entities where the right to replicate is determined by a process of selection. Consequently, given an appropriate model of such an environment, evolution can also occur within computers.

Download Full-text

Self-adaptation of genetic operators through genetic programming techniques

Proceedings of the Genetic and Evolutionary Computation Conference on - GECCO '17 ◽

10.1145/3071178.3071214 ◽

2017 ◽

Cited By ~ 6

Author(s):

Andres Felipe Cruz-Salinas ◽

Jonatan Gomez Perdomo

Keyword(s):

Genetic Programming ◽

Genetic Operators ◽

Programming Techniques ◽

Self Adaptation

Download Full-text

Multimodal Genetic Programming by Using Tree Structure Similarity

Transactions of the Society of Instrument and Control Engineers ◽

10.9746/sicetr.54.640 ◽

2018 ◽

Vol 54 (8) ◽

pp. 640-649

Author(s):

Shubu YOSHIDA ◽

Tomohiro HARADA ◽

Ruck THAWONMAS

Keyword(s):

Genetic Programming ◽

Tree Structure ◽

Structure Similarity

Download Full-text

Multi-tree Genetic Programming with A New Fitness Function for Melanoma Detection

10.26686/wgtn.12616751.v1 ◽

2020 ◽

Author(s):

Q Ul Ain ◽

Bing Xue ◽

Harith Al-Sahaf ◽

Mengjie Zhang

Keyword(s):

Genetic Programming ◽

Fitness Function ◽

Local Binary Patterns ◽

Skin Lesions ◽

Color Variation ◽

Classification Model ◽

Genetic Operators ◽

Melanoma Detection ◽

Multiple Trees ◽

Performance Gains

© 2019 IEEE. The occurrence of malignant melanoma had enormously increased since past decades. For accurate detection and classification, not only discriminative features are required but a properly designed model to combine these features effectively is also needed. In this study, the multi-tree representation of genetic programming (GP) has been utilised to effectively combine different types of features and evolve a classification model for the task of melanoma detection. Local binary patterns have been used to extract pixel-level informative features. For incorporating the properties of ABCD (asymmetrical property, border shape, color variation and geometrical characteristics) rule of dermoscopy, various features have been used to include local and global information of the skin lesions. To meet the requirements of the proposed multi-tree GP representation, genetic operators such as crossover and mutation are designed accordingly. Moreover, a new weighted fitness function is designed to evolve better GP individuals having multiple trees influencing each other's performance during the evolution, in order to get overall performance gains. The performance of the new method is checked on two benchmark skin image datasets, and compared with six widely used classification algorithms and the single tree GP method. The experimental results have shown that the proposed method has significantly outperformed all these classification methods.

Download Full-text

A multi-tree genetic programming representation for melanoma detection using local and global features

10.26686/wgtn.12616742.v1 ◽

2020 ◽

Author(s):

Q Ul Ain ◽

Harith Al-Sahaf ◽

Bing Xue ◽

Mengjie Zhang

Keyword(s):

Skin Cancer ◽

Genetic Programming ◽

Early Stage ◽

Extraction Methods ◽

Image Features ◽

Color Variation ◽

Genetic Operators ◽

Global Features ◽

Melanoma Detection ◽

Performance Gains

© Springer Nature Switzerland AG 2018. Melanoma is the deadliest type of skin cancer that accounts for nearly 75% of deaths associated with it. However, survival rate is high, if diagnosed at an early stage. This study develops a novel classification approach to melanoma detection using a multi-tree genetic programming (GP) method. Existing approaches have employed various feature extraction methods to extract features from skin cancer images, where these different types of features are used individually for skin cancer image classification. However they remain unable to use all these features together in a meaningful way to achieve performance gains. In this work, Local Binary Pattern is used to extract local information from gray and color images. Moreover, to capture the global information, color variation among the lesion and skin regions, and geometrical border shape features are extracted. Genetic operators such as crossover and mutation are designed accordingly to fit the objectives of our proposed method. The performance of the proposed method is assessed using two skin image datasets and compared with six commonly used classification algorithms as well as the single tree GP method. The results show that the proposed method significantly outperformed all these classification methods. Being interpretable, this method may help dermatologist identify prominent skin image features, specific to a type of skin cancer.

Download Full-text