1. Travelling Salesman Problem
The traveling salesman problem (TSP) seeks to find the shortest route between a set of points
(cities) that a salesperson must visit exactly once before returning their starting city.
The goal is to minimise the distance travelled by the salesperson, as a proxy measure for
costs incurred. Our studies have focused on Euclidean TSPs, and the performance of two well-known
variants of the Lin-Kernighan heuristic: one that exploits cluster structure in a TSP instance (set of cities),
and one that doesn't. The instance space analysis supports the intuition that the heuristic that
exploits cluster structure is more powerful, except on instances where little cluster structure exists.
The test instances used to build the instance space include randomly generated 100-node TSPs, and instances
that have been evolved to be hard or easy for the two heuristics, as described in the papers below.
Research Publications |
Downloads |
Instance Space Analysis |
1. Smith-Miles, K. A., van Hemert, J., Lim, Y., “Understanding TSP Difficulty by Learning from Evolved Instances”, Proceedings of Learning and Intelligent Optimization, LION 4, Lecture Notes in Computer Science, vol. 6073, pp. 266-280, 2010.
2. Smith-Miles, K. and van Hemert, J. “Discovering the Suitability of Optimisation Algorithms by Learning from Evolved Instances”, Annals of Mathematics and Artificial Intelligence, vol. 61, no. 2, pp. 87-104, 2011.
3. Smith-Miles, K. and Tan, T. “Measuring Algorithm Footprints in Instance Space”, in Proceedings of the 2012 IEEE Congress on Evolutionary Computation, pp. 3446-3453, 2012
|
Instances
Metadata
Feature Extraction Code
|
|
2. Graph Coloring Problem
In graph theory, graph or vertex colouring aims to colour the vertices of a graph
such that no two adjacent vertices share the same color. Our 2014 study considered the
benchmark graphs commonly used in computational studies
of graph colouring algorithms, as well as a collection of eight state-of-the-art heuristics.
The instance space reveals pockets where the "on-average-best" performing algorithm is not as well suited as other
algorithms, and the unique strengths and weaknesses of each algorithm can be visualised. Our extended study in 2015,
focusing on 100-node graphs,
showed how new graphs could be evolved to fill the instance space, challenging the conclusions that are formed
when only graphs from existing generators are studied.
Research Publications |
Downloads |
Instance Space Analysis |
Smith-Miles, K. A., Baatar, D., Wreford, B. and Lewis, R., “Towards Objective Measures of Algorithm Performance across Instance Space”, Computers and Operations Research, vol. 45, pp. 12-24, 2014.
|
Metadata
Code for Feature Selection
|
|
Smith-Miles, K. A. and Bowly, S., "Generating New Test Instances by Evolving in Instance Space", Computers & Operations Research, vol. 63, pp. 102-113, 2015. |
Instances
Metadata
|
|
3. MaxCut Graph Problem
The Maximum Cut (MaxCut) graph problem involves partitioning the nodes of a
graph into two subsets such that the number of edges between the subsets is maximised.
There also exist generalisations of this problem such as the n-MaxCut problem, where the aim is
to partition the nodes into n subsets, and the weighted MaxCut problem, where each edge is assigned
a numerical weight and the aim is to maximise the sum of the edge weights between subsets.
We have constructed an instance space focusing on the unweighted 2-MaxCut problem.
MaxCut is a well studied NP-Hard problem, with many researchers attempting to find new and innovative algorithms.
In this research, we focus on the Adiabatic Quantum Computing algorithm to solve the MaxCut graph.
The aim of this project is to run the algorithm on a wide variety of different graph instances and
investigate how the features of the graph instances make it easy and hard for adiabatic quantum computing when solving MaxCut.
4. Knapsack Problem
Given a set of items, each with a weight and a value, the Knapsack Problem aims to determine the number of
each item to select so that the total value of the knapsack is as large as possible, while the total weight
meets the maximum capacity constraint. Our studies have focused on the 0-1 knapsack problem, where each
item is either selected or not, and multiple copies of each item are not considered. We have constructed
an instance space of well known benchmark instances aiming to show where the hard instances lie, in response to
efforts to generate harder knapsack instances.
5. Bin Packing Problem
The one-dimensional Bin packing problem (1DBPP) is one of the best-known optimisation problems, and its
structure and applications have been actively studied. In the 1DBPP there are a set of items,
each with its own weight, and an infinite number of bins, all with the same capacity. The objective
is to minimise the total number of bins used, subject to the constraints that every item is placed
in a bin, and the sum of the weights of the items in each bin does not exceed the capacity.
There is also an analog of the 1DBPP in two dimensions. In the two-dimensional
bin packing problem (2DBPP), the bins and items are rectangular. All the bins have
equal widths and heights, and the optimisation objective is to minimise the total
number of bins used while placing the objects into the bins without overlap. During
the placement process, items cannot be rotated.
The goal of this research is to use the methodology of Instance Space Analysis to
study the 1DBPP and 2DBPP. The strengths and
weaknesses of common heuristics are analyzed by exploring the relationships between instance
features and algorithm performances, and scrutinising the diversity of common benchmark test instances.
* The instance-space was originally generated without "random" instances, which were projected on the original instance-space later.
6. Linear Programming
Linear programming is an optimization technique for finding the optimal decision variables that maximise or
minimise a linear objective function subject to a set of linear inequality or equality constraints. Our
studies have focused on understanding how the various strategies used in simplex algorithms are influenced by
the properties of the constraint set. We have also developed methods for generating feasible bounded instances to
enable a more diverse set of instances to be considered than existing benchmark problems offer.
Research Publications |
Downloads |
Instance Space Analysis |
|
Coming Soon |
Coming Soon |
7. Mixed Integer Programming
Mixed integer programming extends linear programming by constraining some variables to only integer or binary values.
Our studies have focused on generating small instances which are challenging for branch variable selection methods.
Specifically, we evolve instances which show a large discrepancy in the size of the branch and bound tree generated by
different branch variable selection methods.
Research Publications |
Downloads |
Instance Space Analysis |
|
Coming Soon |
Coming Soon |
8. Black-Box Optimization
The goal in a continuous optimization problem is to maximize or minimize a function whose input and output
variables are real numbers, subject to some constraints. Often, these problems lack an algebraic
expression, have non-calculable derivatives, exhibit uncertainty or noise, or may involve time-consuming
simulations or experiments. All that is known is a black-box output response to a given input. Therefore,
sampling the inputs and obtaining the output response is the only way to acquire information about the problem.
This approach is known as black-box optimization (BBO). Our studies have focused on both single-objective and
multi-objective BBO problems. For single-objective BBO we have constructed an instance space of the well-studied COCO
benchmarks, and then filled the instance space by evolving new test instances exhibiting a wide variety of problem features
and landscapes. Our ongoing work in multi-objective BBO is developing features that can characterise the difficulties of
MOO problems, and the robustness of existing test instance benchmarks.
Research Publications |
Downloads |
Instance Space Analysis |
Muñoz, M. A. and Smith-Miles, K. A., "Performance analysis of continuous black-box optimization algorithms via footprints in instance space", Evolutionary Computation, vol. 25, no. 4, pp. 529-554, 2017.
|
Metadata
Instances
Code for Feature Selection
|
|
M.A. Muñoz and K. Smith-Miles. Generating new space-filling test instances for continuous black-box optimization. Evolutionary Computation, vol. 28, no. 3, pp. 379-404, 2020.
|
Instances
2D Contour Plot Images
Code for Feature Selection
|
|
9. Multi-Objective Continuous Black-Box Optimisation
Multi-objective optimisation involves the simultaneous maximisation or minimisation of two or more objective functions.
When the objectives are not in conflict with each other, the problem is trivial as an MOP, since it can be solved for
a single best solution. However, when the objectives are conflicting, there are trade-offs between the objectives
whereby a solution is better for one (or more) objective(s) but worse for the others. Our studies focus on three
contributions: 1) exploring the diversity of existing suites by tuning problem parameters, 2) consolidating many
of the continuous multi-objective suites which are typically considered in isolation and understanding how well
they represent the entire instance space, and 3) supporting and exploring new methods for constructing interesting
benchmark problems which differ from the already available instances.
Research Publications |
Downloads |
Instance Space Analysis |
Yap, E., Muñoz, M. A. and Smith-Miles, K., "Informing multi-objective optimisation benchmark construction through Instance Space Analysis", IEEE Transactions on Evolutionary Computation, vol. 26, no. 6, pp. 1246-1260, 2022.
|
Metadata
Supplementary Material
Code for Features Calculation
Instances
|
|
10. Job Shop Scheduling
The Early-Tardy (E/T) Scheduling Problem is motivated by Just-in-Time production, which required delivery
of goods to be made at the time required. Both early and late production are discouraged, as early production
incurs holding costs, and late delivery means a loss of customer goodwill. The problem we consider is the single
machine, distinct due date, early/tardy scheduling problem, where each job has an earliness and tardiness penalty
and due date. Once a job is dispatched on the machine, it runs to completion with no
interruptions permitted. The objective is to minimise the total penalty produced when each job is scheduled as
close as possible to its due date. We evaluate the performance of two of the simplest and most commonly used dispatching
heuristics for the E/T scheduling problem:Earliest Due Date and Shortest Processing Time heuristics. The instance space analysis
provides visual confirmation of the relationships discovered in a rule-based machine learning analysis in the 2009 paper below, and
identifies the conditions under which each heuristic is better suited.
Research Publications |
Downloads |
Instance Space Analysis |
Smith-Miles, K. A., James, R. J. W., Giffin, J. W. and Tu, Y., “A Knowledge Discovery Approach
to Understanding Relationships between Scheduling Problem Structure and Heuristic Performance”,
Selected papers of the 3rd International Conference on Learning and Intelligent Optimization,
Lecture Notes in Computer Science, vol. 5851, pp. 89-103, 2009. |
Metadata
|
|
11. Timetabling
Udine Timetabling, also known as the curriculum-based course timetabling problem, was the subject of
track 3 of the 2007 International Timetabling Competition (ITC2007). 21 real-world instances from the
University of Udine were available. In a series of studies below, we have taken the top 2 performing
algorithms in ITC2007 - simulated annealing with constraint propagation (SACP) and tabu search over weighted
constraint satisfaction problem (TSCS) and examined their performance on instances from a variety of sources.
Besides the 21 competition real-world instances, we used instances from a random generator of Burke et al. (2010),
and instances we generated by using machine learning to adapt the parameters of this random generator to create
instances that are i) real-world like, and ii) more discriminating of unique algorithm performance. A total of
8199 instances from these three classes are contained in the meta-data, along with 32 timetabling features.
Applying instance space analysis now to these previous studies supports the findings in the papers below
(which used decision trees and self-organising maps at that time). There are features such as slack that
determine unique algorithm behaviour, other features such as properties of the teacher connectivity graph that
determine tied behaviours, and features that explain the key differences between the randomly generated instances
and real-world instances.
Research Publications |
Downloads |
Instance Space Analysis |
Smith-Miles, K. A.,
and Lopes, L. B., “Generalising Algorithm Performance in Instance Space: A timetabling
case study”, Proceedings of the 5th Learning and Intelligent Optimization conference, LION 5,
Lecture Notes in Computer Science, vol. 6683, pp. 524-539, Springer, 2011.
Lopes, L. and Smith-Miles, K., “Generating Applicable Synthetic Instances
for Branch Problems”, Operations Research, vol. 61, no. 3, pp. 563-577, 2013.
Lopes, L. B. and Smith-Miles, K. A., “Pitfalls in Instance Generation for Udine Timetabling”, Proceedings of the 4th Learning and Intelligent Optimization conference, LION 4, Lecture Notes in Computer Science, vol. 6073, pp. 299-302, Springer, 2010.
Smith-Miles, K. and Muñoz, M. A., "Instance Space Analysis for Algorithm Testing:
Methodology and Software Tools", ACM Computing Surveys, vol. 55, no, 12, 2023.
|
Metadata
Summary of Instance Space Analysis
|
|
De Coster, A. Musliu, N., Schaerf, A., Schoisswohl, J. and Smith-Miles, K., "Algorithm Selection and Instance Space Analysis for Curriculum-based Course Timetabling", Journal of Scheduling, vol. 25, pp. 35–58, 2022.
|
Metadata
Instances
Code for Feature Selection
|
|
12. Maximum Flow Problem
The Maximum Flow Problem (MFP) involves finding the maximum
volume of flow of a commodity that could be sent from a source
node to a sink node through a network. This problem has been
studied for more than six decades and different polynomial time
algorithms has been proposed for it. Although almost all MFP
algorithms are supported by strong theoretical worst-case
analyses, the judgement about which algorithm is best for a
particular instance or class of instances is still unclear.
The
aim of this research is to construct and analyze the instance
space of MFP for the first time, including proposing suitable
features of MFP instances that capture the algorithms’
performances. Additionally, this research contributes to the
literature by expanding the ISA methodology to address the issue
of how benchmark instances should be selected to reduce bias in
the analysis. Using the enhanced ISA methodology with MFP as a
case study, the paper demonstrates that the most important
instance features affecting algorithm performance can be
detected, and machine learning methods can learn their impact on
algorithm performance, without the bias caused by
over-representation within the selected sample of test
instances. The methodology enables new insights into the
performance of state-of-the-art general-purpose MFP algorithms,
as well as recommendations for the construction of comprehensive
and unbiased benchmark test suites for MFP algorithm testing.
Here, three different instance spaces, i.e., Initial,
Augmented, and Unbiased, can be explored for MFP. The initial
instance space is constructed by the initial metadata, which
includes 8508 instances. The Augmented instance space is
constructed based on the augmented metadata, which is obtained
by adding (generating) 16893 new instances with specific
characteristics to the initial metadata to fill the sparse areas
of the initial instance space. Applying the instance selection
process on the augmented metadata, a filtered metadata with only
3857 out of 25401 instances is obtained. The unbiased instance
space is constructed using this filtered metadata.
Research Publications |
Downloads |
Instance Space Analysis |
Alipour, H., Muñoz, M. A. and Smith-Miles, K., "Enhanced Instance Space Analysis for the Maximum Flow Problem", European Journal of Operational Research, vol. 304, no. 2, pp. 411-428, 2022.
|
Metadata-unbiased
Code for feature extraction and instance selection
|
|
Metadata-augmented
|
|
Metadata-initial
|
|