Automated Design of Genetic Programming Classification Algorithms Using a Genetic Algorithm

The main objective of this chapter is to propose a new effective algorithm to design stream cipher systems automatically using simulated annealing algorithm and genetic programming with a different method for representing the genetic programming population individuals. Usually the individual programs represented as LISP expressions; in the proposed method the programs are represented as strings of integers representing the individual program syntactic rule numbers. Genetic programming with this representation method is called genetic algorithm for developing software (GADS). The performance of the proposed algorithm will be studied by applying different genetic methods and parameters. Furthermore, it will be compared with other representation methods such as LISP expression.

Download Full-text

Evolutionary Machine Learning for Classification with Incomplete Data

10.26686/wgtn.17072123 ◽

2021 ◽

Author(s):

◽

Cao Truong Tran

Keyword(s):

Machine Learning ◽

Feature Selection ◽

Genetic Programming ◽

Incomplete Data ◽

Missing Values ◽

Machine Learning Techniques ◽

Feature Construction ◽

Classification Algorithms ◽

Learning Techniques ◽

Effectiveness And Efficiency

<p>Classification is a major task in machine learning and data mining. Many real-world datasets suffer from the unavoidable issue of missing values. Classification with incomplete data has to be carefully handled because inadequate treatment of missing values will cause large classification errors. Existing most researchers working on classification with incomplete data focused on improving the effectiveness, but did not adequately address the issue of the efficiency of applying the classifiers to classify unseen instances, which is much more important than the act of creating classifiers. A common approach to classification with incomplete data is to use imputation methods to replace missing values with plausible values before building classifiers and classifying unseen instances. This approach provides complete data which can be then used by any classification algorithm, but sophisticated imputation methods are usually computationally intensive, especially for the application process of classification. Another approach to classification with incomplete data is to build a classifier that can directly work with missing values. This approach does not require time for estimating missing values, but it often generates inaccurate and complex classifiers when faced with numerous missing values. A recent approach to classification with incomplete data which also avoids estimating missing values is to build a set of classifiers which then is used to select applicable classifiers for classifying unseen instances. However, this approach is also often inaccurate and takes a long time to find applicable classifiers when faced with numerous missing values. The overall goal of the thesis is to simultaneously improve the effectiveness and efficiency of classification with incomplete data by using evolutionary machine learning techniques for feature selection, clustering, ensemble learning, feature construction and constructing classifiers. The thesis develops approaches for improving imputation for classification with incomplete data by integrating clustering and feature selection with imputation. The approaches improve both the effectiveness and the efficiency of using imputation for classification with incomplete data. The thesis develops wrapper-based feature selection methods to improve input space for classification algorithms that are able to work directly with incomplete data. The methods not only improve the classification accuracy, but also reduce the complexity of classifiers able to work directly with incomplete data. The thesis develops a feature construction method to improve input space for classification algorithms with incomplete data by proposing interval genetic programming-genetic programming with a set of interval functions. The method improves the classification accuracy and reduces the complexity of classifiers. The thesis develops an ensemble approach to classification with incomplete data by integrating imputation, feature selection, and ensemble learning. The results show that the approach is more accurate, and faster than previous common methods for classification with incomplete data. The thesis develops interval genetic programming to directly evolve classifiers for incomplete data. The results show that classifiers generated by interval genetic programming can be more effective and efficient than classifiers generated the combination of imputation and traditional genetic programming. Interval genetic programming is also more effective than common classification algorithms able to work directly with incomplete data. In summary, the thesis develops a range of approaches for simultaneously improving the effectiveness and efficiency of classification with incomplete data by using a range of evolutionary machine learning techniques.</p>

Download Full-text

Feature Construction and Selection Using Genetic Programming and a Genetic Algorithm

Lecture Notes in Computer Science - Genetic Programming ◽

10.1007/3-540-36599-0_21 ◽

2003 ◽

pp. 229-237 ◽

Cited By ~ 18

Author(s):

Matthew G. Smith ◽

Larry Bull

Keyword(s):

Genetic Algorithm ◽

Genetic Programming ◽

Feature Construction

Download Full-text

Computer-Assisted Design of Image Classification Algorithms: Dynamic and Static Fitness evaluations in a Scaffolded Genetic Programming Environment

Genetic Programming 1996 ◽

10.7551/mitpress/3242.003.0039 ◽

1996 ◽

Keyword(s):

Genetic Programming ◽

Image Classification ◽

Computer Assisted ◽

Classification Algorithms ◽

Programming Environment ◽

Computer Assisted Design

Download Full-text

Gene Expression Programming

Bio-Inspired Computing for Information Retrieval Applications - Advances in Knowledge Acquisition, Transfer, and Management ◽

10.4018/978-1-5225-2375-8.ch010 ◽

2017 ◽

pp. 269-292 ◽

Cited By ~ 2

Author(s):

Baddrud Zaman Laskar ◽

Swanirbhar Majumder

Keyword(s):

Gene Expression ◽

Genetic Algorithm ◽

Natural Selection ◽

Genetic Programming ◽

Soft Computing ◽

Gene Expression Programming ◽

Research Work ◽

Genetic Operators ◽

A New Technique ◽

Work Done

Gene expression programming (GEP) introduced by Candida Ferreira is a descendant of genetic algorithm (GA) and genetic programming (GP). It takes the advantage of both the optimization and search technique based on genetics and natural selection as GA and its programmatic Darwinian counterpart GP. It is gaining popularity because; it has to some extent eradicated the ‘cons' of both while keeping in the ‘pros'. It is still a new technique not much explored since its introduction in 2001. In this chapter both GA and GP is first discussed followed by the elaborate discussion of GEP. This is followed up by the discussion on research work done is different fields using GEP as a tool followed up by GEP architectures. Finally, here GEP has been used for detection of age from facial features as a soft computing based optimization problem using genetic operators.

Download Full-text

Automated Design of Classification Algorithms

Automated Design of Machine Learning and Search Algorithms - Natural Computing Series ◽

10.1007/978-3-030-72069-8_10 ◽

2021 ◽

pp. 171-184

Author(s):

Nelishia Pillay ◽

Thambo Nyathi

Keyword(s):

Automated Design ◽

Classification Algorithms

Download Full-text

The GA-P: a genetic algorithm and genetic programming hybrid

IEEE Expert ◽

10.1109/64.393137 ◽

1995 ◽

Vol 10 (3) ◽

pp. 11-15 ◽

Cited By ~ 53

Author(s):

L.M. Howard ◽

D.J. D'Angelo

Keyword(s):

Genetic Algorithm ◽

Genetic Programming

Download Full-text

Genetic Programming over Context-Free Languages with Linear Constraints for the Knapsack Problem: First Results

Evolutionary Computation ◽

10.1162/106365602317301772 ◽

2002 ◽

Vol 10 (1) ◽

pp. 51-74 ◽

Cited By ~ 3

Author(s):

Peter Bruhn ◽

Andreas Geyer-Schulz

Keyword(s):

Genetic Algorithm ◽

Combinatorial Optimization ◽

Genetic Programming ◽

Knapsack Problem ◽

Linear Constraints ◽

Penalty Functions ◽

Multidimensional Knapsack Problem ◽

Multidimensional Knapsack ◽

First Results ◽

Context Free

In this paper, we introduce genetic programming over context-free languages with linear constraints for combinatorial optimization, apply this method to several variants of the multidimensional knapsack problem, and discuss its performance relative to Michalewicz's genetic algorithm with penalty functions. With respect to Michalewicz's approach, we demonstrate that genetic programming over context-free languages with linear constraints improves convergence. A final result is that genetic programming over context-free languages with linear constraints is ideally suited to modeling com-plementarities between items in a knapsack problem: The more complementarities in the problem, the stronger the performance in comparison to its competitors.

Download Full-text