A Multi-dimensional Genetic Programming Approach for Multi-class Classification Problems

Genetic programming (GP) is a powerful machine learning (ML) algorithm that can produce readable white-box models. Although successfully used for solving an array of problems in different scientific areas, GP is still not well known in the field of remote sensing. The M3GP algorithm, a variant of the standard GP algorithm, performs feature construction by evolving hyperfeatures from the original ones. In this work, we use the M3GP algorithm on several sets of satellite images over different countries to create hyperfeatures from satellite bands to improve the classification of land cover types. We add the evolved hyperfeatures to the reference datasets and observe a significant improvement of the performance of three state-of-the-art ML algorithms (decision trees, random forests, and XGBoost) on multiclass classifications and no significant effect on the binary classifications. We show that adding the M3GP hyperfeatures to the reference datasets brings better results than adding the well-known spectral indices NDVI, NDWI, and NBR. We also compare the performance of the M3GP hyperfeatures in the binary classification problems with those created by other feature construction methods such as FFX and EFS.

Download Full-text

A genetic programming approach to feature construction for ensemble learning in skin cancer detection

Proceedings of the 2020 Genetic and Evolutionary Computation Conference ◽

10.1145/3377930.3390228 ◽

2020 ◽

Author(s):

Qurrat Ul Ain ◽

Harith Al-Sahaf ◽

Bing Xue ◽

Mengjie Zhang

Keyword(s):

Skin Cancer ◽

Genetic Programming ◽

Ensemble Learning ◽

Cancer Detection ◽

Programming Approach ◽

Feature Construction ◽

Skin Cancer Detection

Download Full-text

Confidence interval for micro-averaged F1 and macro-averaged F1 scores

Applied Intelligence ◽

10.1007/s10489-021-02635-5 ◽

2021 ◽

Author(s):

Kanae Takahashi ◽

Kouji Yamamoto ◽

Aya Kuchiba ◽

Tatsuki Koyama

Keyword(s):

Binary Classification ◽

Classification Problem ◽

Classification Problems ◽

Summary Measure ◽

Medical Field ◽

Predictive Values ◽

Binary Classification Problem ◽

Multi Class Classification ◽

Sensitivity Specificity ◽

Measures Of Performance

AbstractA binary classification problem is common in medical field, and we often use sensitivity, specificity, accuracy, negative and positive predictive values as measures of performance of a binary predictor. In computer science, a classifier is usually evaluated with precision (positive predictive value) and recall (sensitivity). As a single summary measure of a classifier’s performance, F1 score, defined as the harmonic mean of precision and recall, is widely used in the context of information retrieval and information extraction evaluation since it possesses favorable characteristics, especially when the prevalence is low. Some statistical methods for inference have been developed for the F1 score in binary classification problems; however, they have not been extended to the problem of multi-class classification. There are three types of F1 scores, and statistical properties of these F1 scores have hardly ever been discussed. We propose methods based on the large sample multivariate central limit theorem for estimating F1 scores with confidence intervals.

Download Full-text

Lee Spector  Automatic Quantum Computer Programming: A Genetic Programming Approach. Kluwer Academic Publishers (2004). ISBN 1-4020-7894-3. €100. 153 pp.

The Computer Journal ◽

10.1093/comjnl/bxh134 ◽

2005 ◽

Vol 49 (1) ◽

pp. 129-130

Author(s):

Anas N. Al-Rabadi

Keyword(s):

Genetic Programming ◽

Computer Programming ◽

Quantum Computer ◽

Programming Approach

Download Full-text

Binary Image Classification: A Genetic Programming Approach to the Problem of Limited Training Instances

Evolutionary Computation ◽

10.1162/evco_a_00146 ◽

2016 ◽

Vol 24 (1) ◽

pp. 143-182 ◽

Cited By ~ 10

Author(s):

Harith Al-Sahaf ◽

Mengjie Zhang ◽

Mark Johnston

Keyword(s):

Computer Vision ◽

Pattern Recognition ◽

Genetic Programming ◽

Visual System ◽

Image Classification ◽

Human Visual System ◽

Binary Classification ◽

Programming Approach ◽

Data Sets ◽

New Class

In the computer vision and pattern recognition fields, image classification represents an important yet difficult task. It is a challenge to build effective computer models to replicate the remarkable ability of the human visual system, which relies on only one or a few instances to learn a completely new class or an object of a class. Recently we proposed two genetic programming (GP) methods, one-shot GP and compound-GP, that aim to evolve a program for the task of binary classification in images. The two methods are designed to use only one or a few instances per class to evolve the model. In this study, we investigate these two methods in terms of performance, robustness, and complexity of the evolved programs. We use ten data sets that vary in difficulty to evaluate these two methods. We also compare them with two other GP and six non-GP methods. The results show that one-shot GP and compound-GP outperform or achieve results comparable to competitor methods. Moreover, the features extracted by these two methods improve the performance of other classifiers with handcrafted features and those extracted by a recently developed GP-based method in most cases.

Download Full-text

A NEW LINEAR GENETIC PROGRAMMING APPROACH BASED ON STRAIGHT LINE PROGRAMS: SOME THEORETICAL AND EXPERIMENTAL ASPECTS

International Journal of Artificial Intelligence Tools ◽

10.1142/s0218213009000391 ◽

2009 ◽

Vol 18 (05) ◽

pp. 757-781 ◽

Cited By ~ 7

Author(s):

CÉSAR L. ALONSO ◽

JOSÉ LUIS MONTAÑA ◽

JORGE PUENTE ◽

CRUZ ENRIQUE BORGES

Keyword(s):

Data Structure ◽

Genetic Programming ◽

Computer Programs ◽

Symbolic Regression ◽

Programming Approach ◽

Linear Genetic Programming ◽

Straight Line ◽

Structured Representations ◽

Regression Problems ◽

Straight Line Programs

Tree encodings of programs are well known for their representative power and are used very often in Genetic Programming. In this paper we experiment with a new data structure, named straight line program (slp), to represent computer programs. The main features of this structure are described, new recombination operators for GP related to slp's are introduced and a study of the Vapnik-Chervonenkis dimension of families of slp's is done. Experiments have been performed on symbolic regression problems. Results are encouraging and suggest that the GP approach based on slp's consistently outperforms conventional GP based on tree structured representations.

Download Full-text

Feature Selection and Classification of High Dimensional Mass Spectrometry Data: A Genetic Programming Approach

Evolutionary Computation, Machine Learning and Data Mining in Bioinformatics - Lecture Notes in Computer Science ◽

10.1007/978-3-642-37189-9_5 ◽

2013 ◽

pp. 43-55 ◽

Cited By ~ 15

Author(s):

Soha Ahmed ◽

Mengjie Zhang ◽

Lifeng Peng

Keyword(s):

Mass Spectrometry ◽

Feature Selection ◽

Genetic Programming ◽

Mass Spectrometry Data ◽

High Dimensional ◽

Programming Approach

Download Full-text

Evolutionary Deep Learning: A Genetic Programming Approach to Image Classification

2018 IEEE Congress on Evolutionary Computation (CEC) ◽

10.1109/cec.2018.8477933 ◽

2018 ◽

Cited By ~ 5

Author(s):

Benjamin Evans ◽

Harith Al-Sahaf ◽

Bing Xue ◽

Mengjie Zhang

Keyword(s):

Deep Learning ◽

Genetic Programming ◽

Image Classification ◽

Programming Approach

Download Full-text

A Multi-dimensional Genetic Programming Approach for Multi-class Classification Problems

Genetic Programming with Random Binary Decomposition for Multi-Class Classification Problems

A Genetic Programming Approach for Evolving Variable Selectors in Constraint Programming

Improving Land Cover Classification Using Genetic Programming for Feature Construction

A genetic programming approach to feature construction for ensemble learning in skin cancer detection

Confidence interval for micro-averaged F1 and macro-averaged F1 scores

Lee Spector  Automatic Quantum Computer Programming: A Genetic Programming Approach. Kluwer Academic Publishers (2004). ISBN 1-4020-7894-3. €100. 153 pp.

Binary Image Classification: A Genetic Programming Approach to the Problem of Limited Training Instances

A NEW LINEAR GENETIC PROGRAMMING APPROACH BASED ON STRAIGHT LINE PROGRAMS: SOME THEORETICAL AND EXPERIMENTAL ASPECTS

Feature Selection and Classification of High Dimensional Mass Spectrometry Data: A Genetic Programming Approach

Evolutionary Deep Learning: A Genetic Programming Approach to Image Classification

Export Citation Format

A Multi-dimensional Genetic Programming Approach for Multi-class Classification Problems

Genetic Programming with Random Binary Decomposition for Multi-Class Classification Problems

A Genetic Programming Approach for Evolving Variable Selectors in Constraint Programming

Improving Land Cover Classification Using Genetic Programming for Feature Construction

A genetic programming approach to feature construction for ensemble learning in skin cancer detection

Confidence interval for micro-averaged F1 and macro-averaged F1 scores

Lee Spector Automatic Quantum Computer Programming: A Genetic Programming Approach. Kluwer Academic Publishers (2004). ISBN 1-4020-7894-3. €100. 153 pp.

Binary Image Classification: A Genetic Programming Approach to the Problem of Limited Training Instances

A NEW LINEAR GENETIC PROGRAMMING APPROACH BASED ON STRAIGHT LINE PROGRAMS: SOME THEORETICAL AND EXPERIMENTAL ASPECTS

Feature Selection and Classification of High Dimensional Mass Spectrometry Data: A Genetic Programming Approach

Evolutionary Deep Learning: A Genetic Programming Approach to Image Classification

Lee Spector  Automatic Quantum Computer Programming: A Genetic Programming Approach. Kluwer Academic Publishers (2004). ISBN 1-4020-7894-3. €100. 153 pp.