Revisiting where are the hard knapsack problems? via Instance Space Analysis

Various criteria and algorithms can be used for clustering, leading to very distinct outcomes and potential biases towards datasets with certain structures. More generally, the selection of the most effective algorithm to be applied for a given dataset, based on its characteristics, is a problem that has been largely studied in the field of meta-learning. Recent advances in the form of a new methodology known as Instance Space Analysis provide an opportunity to extend such meta-analyses to gain greater visual insights of the relationship between datasets’ characteristics and the performance of different algorithms. The aim of this study is to perform an Instance Space Analysis for the first time for clustering problems and algorithms. As a result, we are able to analyze the impact of the choice of the test instances employed, and the strengths and weaknesses of some popular clustering algorithms, for datasets with different structures.

Download Full-text

An instance space analysis of combat simulations to understand the impact of force and information advantage on survival ratios

10.36334/modsim.2021.m7.smith-miles ◽

2021 ◽

Keyword(s):

Space Analysis ◽

Information Advantage ◽

Instance Space ◽

The Impact

Download Full-text

Instance Space Analysis of Combinatorial Multi-objective Optimization Problems

2020 IEEE Congress on Evolutionary Computation (CEC) ◽

10.1109/cec48606.2020.9185664 ◽

2020 ◽

Author(s):

Estefania Yap ◽

Mario A. Munoz ◽

Kate Smith-Miles ◽

Arnaud Liefooghe

Keyword(s):

Optimization Problems ◽

Multi Objective Optimization ◽

Space Analysis ◽

Multi Objective ◽

Instance Space

Download Full-text

Instance space analysis for a personnel scheduling problem

Annals of Mathematics and Artificial Intelligence ◽

10.1007/s10472-020-09695-2 ◽

2020 ◽

Author(s):

Lucas Kletzander ◽

Nysret Musliu ◽

Kate Smith-Miles

Keyword(s):

Scheduling Problem ◽

Personnel Scheduling ◽

Space Analysis ◽

Instance Space

Download Full-text

An Instance Space Analysis of Regression Problems

ACM Transactions on Knowledge Discovery from Data ◽

10.1145/3436893 ◽

2021 ◽

Vol 15 (2) ◽

pp. 1-25

Author(s):

Mario Andrés Muñoz ◽

Tao Yan ◽

Matheus R. Leal ◽

Kate Smith-Miles ◽

Ana Carolina Lorena ◽

...

Keyword(s):

Visual Analytics ◽

Visual Analysis ◽

Predictive Performance ◽

Test Problems ◽

Algorithm Performance ◽

Space Analysis ◽

Regression Algorithms ◽

Regression Techniques ◽

Regression Problems ◽

Instance Space

The quest for greater insights into algorithm strengths and weaknesses, as revealed when studying algorithm performance on large collections of test problems, is supported by interactive visual analytics tools. A recent advance is Instance Space Analysis, which presents a visualization of the space occupied by the test datasets, and the performance of algorithms across the instance space. The strengths and weaknesses of algorithms can be visually assessed, and the adequacy of the test datasets can be scrutinized through visual analytics. This article presents the first Instance Space Analysis of regression problems in Machine Learning, considering the performance of 14 popular algorithms on 4,855 test datasets from a variety of sources. The two-dimensional instance space is defined by measurable characteristics of regression problems, selected from over 26 candidate features. It enables the similarities and differences between test instances to be visualized, along with the predictive performance of regression algorithms across the entire instance space. The purpose of creating this framework for visual analysis of an instance space is twofold: one may assess the capability and suitability of various regression techniques; meanwhile the bias, diversity, and level of difficulty of the regression problems popularly used by the community can be visually revealed. This article shows the applicability of the created regression instance space to provide insights into the strengths and weaknesses of regression algorithms, and the opportunities to diversify the benchmark test instances to support greater insights.

Download Full-text