Choosing the fitness function for the job: Automated generation of test suites that detect real faults

In the present paper, we investigate an approach to intelligent support of the software white-box testing process based on an evolutionary paradigm. As a part of this approach, we solve the urgent problem of automated generation of the optimal set of test data that provides maximum statement coverage of the code when it is used in the testing process. We propose the formulation of a fitness function containing two terms, and, accordingly, two versions for implementing genetic algorithms (GA). The first term of the fitness function is responsible for the complexity of the code statements executed on the path generated by the current individual test case (current set of statements). The second term formulates the maximum possible difference between the current set of statements and the set of statements covered by the remaining test cases in the population. Using only the first term does not make it possible to obtain 100 percent statement coverage by generated test cases in one population, and therefore implies repeated launch of the GA with changed weights of the code statements which requires recompiling the code under the test. By using both terms of the proposed fitness function, we obtain maximum statement coverage and population diversity in one launch of the GA. Optimal relation between the two terms of fitness function was obtained for two very different programs under testing.

Download Full-text

The Fitness Function for the Job: Search-Based Generation of Test Suites That Detect Real Faults

2017 IEEE International Conference on Software Testing, Verification and Validation (ICST) ◽

10.1109/icst.2017.38 ◽

2017 ◽

Cited By ~ 15

Author(s):

Gregory Gay

Keyword(s):

Job Search ◽

Fitness Function ◽

Test Suites

Download Full-text

Automated generation of test suites from formal specifications of real-time reactive systems

Journal of Systems and Software ◽

10.1016/j.jss.2007.05.009 ◽

2008 ◽

Vol 81 (2) ◽

pp. 286-304 ◽

Cited By ~ 10

Author(s):

Mao Zheng ◽

Vasu Alagar ◽

Olga Ormandjieva

Keyword(s):

Real Time ◽

Reactive Systems ◽

Formal Specifications ◽

Automated Generation ◽

Test Suites

Download Full-text

KVEST: Automated Generation of Test Suites from Formal Specifications

FM’99 — Formal Methods - Lecture Notes in Computer Science ◽

10.1007/3-540-48119-2_34 ◽

1999 ◽

pp. 608-621 ◽

Cited By ~ 15

Author(s):

Igor Burdonov ◽

Alexander Kossatchev ◽

Alexander Petrenko ◽

Dmitri Galter

Keyword(s):

Formal Specifications ◽

Automated Generation ◽

Test Suites

Download Full-text

Modification Point Aware Test Prioritization and Sampling to Improve Patch Validation in Automatic Program Repair

Applied Sciences ◽

10.3390/app10051593 ◽

2020 ◽

Vol 10 (5) ◽

pp. 1593 ◽

Cited By ~ 1

Author(s):

Yazhini Venugopal ◽

Phung Quang-Ngoc ◽

Lee Eunseok

Keyword(s):

Fitness Function ◽

Test Cases ◽

Software Bugs ◽

Test Execution ◽

Automatic Program ◽

Repair Efficiency ◽

Program Repair ◽

Test Prioritization ◽

Automatic Program Repair ◽

Test Suites

Recently, Automatic Program Repair (APR) has shown a high capability of repairing software bugs automatically. In general, most of the APR techniques require test suites to validate automatically generated patches. However, the test suites used for patch validation might contain thousands of test cases. Running these whole test suites to validate every program variant makes the validation process not only time-consuming but also expensive. To mitigate this issue and to enhance the patch validation in APR, we introduce (1) MPTPS (Modification Point-aware Test Prioritization and Sampling), which iteratively records test execution. Based on the failed test information, it performs test prioritization, then sampling to reduce the test execution time by moving forward the test cases that are most likely to fail in the test suite; and (2) a new fitness function that refines the existing one to improve repair efficiency. We implemented our MPPEngine approach in the Astor workspace by extending jGenProg. And the experiments on the Defects4j benchmark against jGenProg show that, on average, jGenProg consumes 79.27 s to validate one program variant, where MPPEngine takes only 33.70 s for results in 57.50% of validation time reduction. Also, MPPEngine outperforms jGenProg by finding patches for six more bugs than jGenProg.

Download Full-text

Learning how to search: generating effective test cases through adaptive fitness function selection

Empirical Software Engineering ◽

10.1007/s10664-021-10048-8 ◽

2022 ◽

Vol 27 (2) ◽

Author(s):

Hussein Almulla ◽

Gregory Gay

Keyword(s):

Test Generation ◽

Fitness Function ◽

Generation Process ◽

Scoring Functions ◽

Strategic Choices ◽

Case Examples ◽

Fitness Functions ◽

Function Selection ◽

Test Suites ◽

Effective Fitness

AbstractSearch-based test generation is guided by feedback from one or more fitness functions—scoring functions that judge solution optimality. Choosing informative fitness functions is crucial to meeting the goals of a tester. Unfortunately, many goals—such as forcing the class-under-test to throw exceptions, increasing test suite diversity, and attaining Strong Mutation Coverage—do not have effective fitness function formulations. We propose that meeting such goals requires treating fitness function identification as a secondary optimization step. An adaptive algorithm that can vary the selection of fitness functions could adjust its selection throughout the generation process to maximize goal attainment, based on the current population of test suites. To test this hypothesis, we have implemented two reinforcement learning algorithms in the EvoSuite unit test generation framework, and used these algorithms to dynamically set the fitness functions used during generation for the three goals identified above. We have evaluated our framework, EvoSuiteFIT, on a set of Java case examples. EvoSuiteFIT techniques attain significant improvements for two of the three goals, and show limited improvements on the third when the number of generations of evolution is fixed. Additionally, for two of the three goals, EvoSuiteFIT detects faults missed by the other techniques. The ability to adjust fitness functions allows strategic choices that efficiently produce more effective test suites, and examining these choices offers insight into how to attain our testing goals. We find that adaptive fitness function selection is a powerful technique to apply when an effective fitness function does not already exist for achieving a testing goal.

Download Full-text

QUBEKit: Automating the Derivation of Force Field Parameters from Quantum Mechanics

10.26434/chemrxiv.7247045.v2 ◽

2019 ◽

Cited By ~ 1

Author(s):

Joshua Horton ◽

Alice Allen ◽

Leela Dodda ◽

Daniel Cole

Keyword(s):

Quantum Mechanics ◽

Force Field ◽

Organic Molecules ◽

Force Fields ◽

Liquid Density ◽

Small Organic Molecules ◽

Automated Generation ◽

Energy Of Hydration ◽

Novel Method ◽

Force Field Parameters

<div><div><div><p>Modern molecular mechanics force fields are widely used for modelling the dynamics and interactions of small organic molecules using libraries of transferable force field parameters. For molecules outside the training set, parameters may be missing or inaccurate, and in these cases, it may be preferable to derive molecule-specific parameters. Here we present an intuitive parameter derivation toolkit, QUBEKit (QUantum mechanical BEspoke Kit), which enables the automated generation of system-specific small molecule force field parameters directly from quantum mechanics. QUBEKit is written in python and combines the latest QM parameter derivation methodologies with a novel method for deriving the positions and charges of off-center virtual sites. As a proof of concept, we have re-derived a complete set of parameters for 109 small organic molecules, and assessed the accuracy by comparing computed liquid properties with experiment. QUBEKit gives highly competitive results when compared to standard transferable force fields, with mean unsigned errors of 0.024 g/cm3, 0.79 kcal/mol and 1.17 kcal/mol for the liquid density, heat of vaporization and free energy of hydration respectively. This indicates that the derived parameters are suitable for molecular modelling applications, including computer-aided drug design.</p></div></div></div>

Download Full-text