[PDF] Analysis of Evolutionary Diversity Optimisation for Permutation Problems

Abstract

Generating diverse populations of high quality solutions has gained interest as a promising extension to the traditional optimization tasks. We contribute to this line of research by studying evolutionary diversity optimization for two of the most prominent permutation problems, namely the Traveling Salesperson Problem (TSP) and Quadratic Assignment Problem (QAP). We explore the worst-case performance of a simple mutation-only evolutionary algorithm with different mutation operators, using an established diversity measure. Theoretical results show most mutation operators for both problems ensure production of maximally diverse populations of sufficiently small size within cubic expected run-time. We perform experiments on QAPLIB instances in unconstrained and constrained settings, and reveal much more optimistic practical performances. Our results should serve as a baseline for future studies.

Full PDF

AAnalysis of Evolutionary Diversity Optimisation for PermutationProblems

Anh Viet Do

Optimisation and LogisticsThe University of Adelaide, Adelaide, Australia

Mingyu Guo

Optimisation and LogisticsThe University of Adelaide, Adelaide, Australia

Aneta Neumann

Optimisation and LogisticsThe University of Adelaide, Adelaide, Australia

Frank Neumann

Optimisation and LogisticsThe University of Adelaide, Adelaide, Australia

ABSTRACT

Generating diverse populations of high quality solutions has gainedinterest as a promising extension to the traditional optimizationtasks. We contribute to this line of research by studying evolutionarydiversity optimization for two of the most prominent permutationproblems, namely the Traveling Salesperson Problem (TSP) andQuadratic Assignment Problem (QAP). We explore the worst-caseperformance of a simple mutation-only evolutionary algorithm withdifferent mutation operators, using an established diversity measure.Theoretical results show most mutation operators for both problemsensure production of maximally diverse populations of sufficientlysmall size within cubic expected run-time. We perform experimentson QAPLIB instances in unconstrained and constrained settings, andreveal much more optimistic practical performances. Our resultsshould serve as a baseline for future studies.

KEYWORDS

Evolutionary algorithms, diversity maximization, traveling salesper-son problem, quadratic assignment problem, run-time analysis

Evolutionary diversity optimization (EDO) aims to compute a setof diverse solutions that all have high quality while maximally dif-fering from each other. This area of research started by Ulrich andThiele [20, 21] has recently gained significant attention within theevolutionary computation community, as evolution itself is increas-ingly regarded as a diversification device rather than a pure objectiveoptimizer [17]. After all, in nature, deviating from the predecessorsleads to finding new niches, which reduces competitive pressureand increases evolvability [11]. This perspective challenges the no-tion that evolutionary processes are mainly adaptive with respectto some quality metrics, and that population diversity is only inservice of adapting its individuals and is without intrinsic worth. Inoptimization, diversity optimization is a useful extension to the tra-ditional optimization tasks, as a set of multiple interesting solutionshas more practical value than a single very good solution.Along this line of research, there have been studies that exploredifferent relationships between quality and diversity. A trend emerg-ing from the evolutionary robotics is Quality Diversity, which fo-cuses on exploring diverse niches in the feature space and maximiz-ing quality within each niche [3, 5, 9, 17]. This approach maximizesdiversity via niches discovery, meaning the what constitutes a nichein the solution space needs to be well-defined beforehand. Otherstudies place more importance on diversity measured directly fromsolutions, applying evolutionary techniques to generate images withvarying features [2], or to compute diverse Traveling SalespersonProblem (TSP) instances [4, 8] useful for automated algorithm se-lection and configuration [10]. Different indicators for measuringthe diversity of sets of solutions in EDO algorithms such as the stardiscrepancy [14] or popular indicators from the area of evolutionarymulti-objective optimization [15] have been investigated to createhigh quality sets of solutions. The study [6] explores EDO for the TSP, the first study on solution diversification for a combinatorialoptimization problem.In this study, we contribute to the understanding of evolution-ary diversity optimization on combinatorial problems. Specifically,we focus on TSP and Quadratic Assignment Problem (QAP), twofundamental NP-hard problems where solutions are representedas permutations, and the latter of which has also been attemptedwith genetic algorithms [1, 13, 18, 19]. The structures of the solu-tion spaces associated with these problems are similar, yet differentenough to merit distinct diversity measures. We use two approachesto measuring diversity: one based on the representation frequenciesof “objects” (edges or assignments) in the population, and one basedon the minimum distance between each solution and the rest. Weconsider the simple evolutionary algorithm that only uses mutation,and examine its worst-case performances in diversity maximizationwhen various mutation operators are used. Our results reveal howproperties of a population influence the effectiveness of mutationsin equalizing objects’ representation frequencies. Additionally, wecarried out experimental benchmark on various QAPLIB instancesin unconstrained (no quality threshold) and constrained settings,using a simple mutation-only algorithm with 2-opt mutation. Theresults indicate optimistic run-time to maximize diversity on QAPsolutions, and show maximization behaviors when using differentdiversity measures in the algorithm. With this, we extend the in-vestigation in [6] theoretically, and experimentally with regard toQAP.The paper is structured as follows. In Section 2, we introduce theTSP and QAP in the context of evolutionary diversity optimizationand describe the algorithm that is the subject of our analysis. InSection 3, we introduce the diversity measures for both problems.Section 4 consists of the run-time analysis of the introduced algo-rithm. We report on our experimental investigations in Section 5and finish with some conclusions.

Throughout the paper, we use the shorthand [ 𝑛 ] = { , . . . , 𝑛 } . Thesymmetric TSP is formulated as follow. Given a complete undirectedgraph 𝐺 = ( 𝑉 , 𝐸 ) with 𝑛 = | 𝑉 | nodes, 𝑚 = 𝑛 ( 𝑛 − )/ = | 𝐸 | edgesand the distance function 𝑑 : 𝑉 × 𝑉 → R ≥ , the goal is to computea tour of minimal cost that visits each node exactly once and finallyreturns to the original node. Let 𝑉 = [ 𝑛 ] , the goal is to find a tourrepresented by the permutation 𝜋 : 𝑉 → 𝑉 that minimizes the tourcost 𝑐 ( 𝜋 ) = 𝑑 ( 𝜋 ( 𝑛 ) , 𝜋 ( )) + 𝑛 − ∑︁ 𝑖 = 𝑑 ( 𝜋 ( 𝑖 ) , 𝜋 ( 𝑖 + )) . The QAP is formulated as follow. Given facilities 𝐹 = { 𝑓 , . . . , 𝑓 𝑛 } ,locations 𝐿 = { 𝑙 , . . . , 𝑙 𝑛 } , weights 𝑤 : 𝐹 × 𝐹 → R ≥ , flows 𝑓 : 𝐿 × 𝐿 → R ≥ , find a 1-1 mapping 𝑎 : 𝐹 → 𝐿 that minimizes the costfunction 𝑐 ( 𝑎 ) = ∑︁ 𝑖,𝑗 ∈ 𝐹 𝑤 ( 𝑖, 𝑗 ) 𝑓 ( 𝑎 ( 𝑖 ) , 𝑎 ( 𝑗 )) . a r X i v : . [ c s . N E ] F e b lgorithm 1 ( 𝜇 + ) -EA for diversity optimization 𝑃 ← initial population while stopping criteria not met do 𝐼 ← 𝑟𝑎𝑛𝑑𝑜𝑚𝑆𝑒𝑙𝑒𝑐𝑡 ( 𝑃 ) 𝐼 ′ ← 𝑚𝑢𝑡𝑎𝑡𝑒 ( 𝐼 ) if 𝑐 ( 𝐼 ′ ) ≤ ( + 𝛼 ) 𝑂𝑃𝑇 then 𝑃 ← 𝑃 ∪ { 𝐼 ′ } 𝐼 ′′ ← argmin 𝐽 ∈ 𝑃 { 𝑑𝑖𝑣𝑒𝑟𝑠𝑖𝑡𝑦 ( 𝑃 \ { 𝐽 })} 𝑃 ← 𝑃 \ { 𝐼 ′′ } end ifend whilereturn 𝑃 A problem instance is encoded with two 𝑛 × 𝑛 matrices: one for 𝑤 andone for 𝑓 . Similar to TSP, we can abstract 𝐹 and 𝐿 like we do 𝑉 : 𝐹 = [ 𝑛 ] and 𝐿 = [ 𝑛 ] . Therefore, each mapping is uniquely defined by a [ 𝑛 ] → [ 𝑛 ] permutation. Given that there is a 1-to-1 correspondencebetween all permutations and all mappings, the solution space is thepermutation space. This is an important distinction between TSPand QAP from which low-level differences between the diversitymeasures in each case emerge. On the other hand, the high levelstructure of a tour is identical to that of a mapping, so the notionslike distance or diversity are the same for both above a certain layerof abstraction.In this paper, we consider diversity optimization for the TSP andthe QAP. For each problem instance, we are to find a set 𝑃 of 𝜇 = | 𝑃 | solution that is diverse with respect to some diversity measure, whileeach solution meets a given quality threshold. Let 𝑂𝑃𝑇 is the valueof an optimal solution, a solution 𝐼 satisfies the quality threshold iff 𝑐 ( 𝐼 ) ≤ ( + 𝛼 ) 𝑂𝑃𝑇 , where 𝛼 > ( + 𝛼 ) approximationsfor a problem instance. We assume that the optimal tour is knownfor a given TSP or QAP instance.We consider ( 𝜇 + ) -EA algorithm which was used to diversifyTSP tours [6]. The algorithm is described in Algorithm 1. It uses onlymutation to introduce new genes, and tries to minimize duplicationin the gene pool with elitist survival selection. The algorithm slightlymodifies the population in each step by mutating a random solution,essentially performing random local search in the population space.As with many evolutionary algorithms, it can be customized fordifferent problems, in this case by modifying the mutation operatorand the diversity measure. In this work, we are interested in worst-case performances of the algorithm under the assumption that anyoffspring is acceptable. The structure of a TSP tour is similar to that of a QAP mapping inthe sense that they are both each defined by a set of objects: edges intours and assignments in mappings. In fact, the size of such a set isalways equal to the instance size. For this reason, diversity measuresfor populations of tours, and those for populations of mappingsshare many commonalities. In particular, we describe two measuresintroduced in [6], customized for TSP and QAP. For consistency,we use the same notations for the same concepts between the twoproblems unless told otherwise. We also refer to [6] for more in-depth discussion on the measures, and fast implementations of thesurvival selection for Algorithm 1 based on these measures, whichcan be customized for QAP solutions.

In this approach, we consider diversity in terms of equal representa-tions of edges/assignments in the population. It takes into account, for each object, the number of solutions containing it, among the 𝜇 solutions in the population.For TSP, given a population of tours 𝑃 and an edge 𝑒 ∈ 𝐸 , wedenote by 𝑛 ( 𝑒, 𝑃 ) its edge count, which is defined, 𝑛 ( 𝑒, 𝑃 ) = |{ 𝑇 ∈ 𝑃 | 𝑒 ∈ 𝐸 ( 𝑇 )}| ∈ { , . . . , 𝜇 } where 𝐸 ( 𝑇 ) ⊂ 𝐸 is the set of edges used by tour 𝑇 . Then in order tomaximize the edge diversity we aim to minimize the vector N ( 𝑃 ) = sort ( 𝑛 ( 𝑒 , 𝑃 ) , 𝑛 ( 𝑒 , 𝑃 ) , . . . , 𝑛 ( 𝑒 𝑚 , 𝑃 )) , in the lexicographic order where sorting is performed in descendingorder. As shown in [6], this maximizes the pairwise distances sum 𝐷 ( 𝑃 ) = ∑︁ 𝑇 ∈ 𝑃 ∑︁ 𝑇 ∈ 𝑃 | 𝐸 ( 𝑇 ) \ 𝐸 ( 𝑇 )| . Similarly for QAP, given a population of mappings 𝑃 , we denoteby 𝑛 ( 𝑖, 𝑗, 𝑃 ) its assignment count as follow, 𝑛 ( 𝑖, 𝑗, 𝑃 ) = |{ 𝑎 ∈ 𝑃 |( 𝑖, 𝑗 ) ∈ 𝐴 ( 𝑎 )}| ∈ { , . . . , 𝜇 } where 𝐴 ( 𝑎 ) ⊂ [ 𝑛 ] × [ 𝑛 ] is the set of assignments used by solution 𝑎 . The corresponding vector to be minimized in order to maximizeassignment diversity is then N ( 𝑃 ) = sort ( 𝑛 ( 𝑖, 𝑗, 𝑃 )) 𝑖,𝑗 ∈[ 𝑛 ] , in the lexicographic order where sorting is performed in descendingorder. Similar, this maximizes the following quantity 𝐷 ( 𝑃 ) = ∑︁ 𝑎 ∈ 𝑃 ∑︁ 𝑏 ∈ 𝑃 | 𝐴 ( 𝑎 ) \ 𝐴 ( 𝑏 )| . While this diversity measure is directly related to the notionof diversity, using it to optimize populations has its drawbacks.As mentioned in [6], populations containing clustering subsets ofsolutions can have high 𝐷 score, which is undesirable. For thisreason, we also consider another measure that circumvents thisissue. Instead of maximizing all pairwise distances at once, this approachfocuses on maximizing smallest distances, potentially reducing largerdistances as a result. Optimizing for this measure reduces clusteringphenomena, as well as tends to increase the distance sum. In thisapproach, we minimize the following vector lexicographically D( 𝑃 ) = sort (cid:16)(cid:0) 𝑜 𝑋,𝑌 (cid:1)

𝑋,𝑌 ∈ 𝑃 (cid:17) , where sorting is performed in descending order, and 𝑜 𝑋𝑌 = | 𝐸 ( 𝑋 ) ∩ 𝐸 ( 𝑌 )| if 𝑋 and 𝑌 are TSP tours, and 𝑜 𝑋𝑌 = | 𝐴 ( 𝑋 ) ∩ 𝐴 ( 𝑌 )| if theyare QAP mappings. Doing this would also maximize the followingquantity 𝐷 ( 𝑃 ) = ∑︁ 𝑇 ∈ 𝑃 min 𝑋 ∈ 𝑃 \{ 𝑇 } {| 𝐸 ( 𝑇 ) \ 𝐸 ( 𝑋 )|} , or 𝐷 ( 𝑃 ) = ∑︁ 𝑎 ∈ 𝑃 min 𝑏 ∈ 𝑃 \{ 𝑇 } {| 𝐴 ( 𝑎 ) \ 𝐴 ( 𝑏 )|} . We can see that for any TSP tour population 𝑃 of size at most (cid:4) 𝑛 − (cid:5) ,we have argmin 𝑃 {N ( 𝑃 )} = argmin 𝑃 {D( 𝑃 )} = argmin 𝑃 { 𝐷 ( 𝑃 )} = argmin 𝑃 { 𝐷 ( 𝑃 )} . One of the results in this study implies that the same is true forany QAP mapping population of size at most 𝑛 . On the other hand,when 𝜇 > 𝑛 , 𝑃 ∗ ∈ argmax 𝑃 { 𝐷 ( 𝑃 )} doesn’t necessarily imply 𝑃 ∗ ∈ argmin 𝑃 {D( 𝑃 )} , as shown by the following example. xample 1. For a QAP instance where 𝑛 = and 𝜇 = , let 𝑎 = ( , , , ) , 𝑎 = ( , , , ) , 𝑎 = ( , , , ) , 𝑎 = ( , , , ) , 𝑎 = ( , , , ) , 𝑎 = ( , , , ) , 𝑎 = ( , , , ) , 𝑃 = { 𝑎 , 𝑎 , 𝑎 , 𝑎 , 𝑎 } , 𝑃 ′ = { 𝑎 , 𝑎 , 𝑎 , 𝑎 , 𝑎 } , we have 𝐷 ( 𝑃 ) = 𝐷 ( 𝑃 ′ ) = which is themaximum. However, D( 𝑃 ) = ( , , , , , , , , , ) > D( 𝑃 ′ ) = ( , , , , , , , , , ) . Because of this, it is tricky to determine the maximum achievablediversity D in such cases. For now, we rely on the upper bound 𝜇𝑛 of 𝐷 , which is relevant to our experimentation in Section 5. We investigate the theoretical performance of Algorithm 1 in op-timizing for N without the quality criterion. For TSP, we considerthree mutation operators: 2-opt, 3-opt (insertion) and 4-opt (ex-change). For QAP, we consider the 2-opt mutation where two assign-ments are swapped. In particular, we are interested in the number ofiterations until a population with optimal diversity is reached. Ourderivation of results is predicated on the lack of local optima: it isalways possible to strictly improves diversity in a single step of thealgorithm. Let 𝑑 𝑃 = max 𝑒 ∈ 𝐸 { 𝑛 ( 𝑒, 𝑃 )} and 𝑐 𝑃 = | 𝑒 ∈ 𝐸 | 𝑛 ( 𝑒, 𝑃 ) = 𝑑 𝑃 | . For eachnode 𝑖 , let 𝑖𝑛 ( 𝑖 ) be the set of edges incident to 𝑖 , and 𝑠 ( 𝑖, 𝑃 ) = (cid:205) 𝑒 ∈ 𝑖𝑛 ( 𝑖 ) 𝑛 ( 𝑒, 𝑃 ) . For each tour 𝐼 , let 2 𝑜𝑝𝑡 ( 𝐼, 𝑖, 𝑗 ) be the tour resultedfrom applying 2-opt to 𝐼 at positions 𝑖 and 𝑗 in the permutation, and4 𝑜𝑝𝑡 ( 𝐼, 𝑖, 𝑗 ) be the tour from exchanging 𝑖 -th and 𝑗 -th elements in 𝐼 .We assume 𝑛 ≥ 𝑛 ≥ 𝑐 𝑃 or 𝑑 𝑃 , asaligned with the algorithm’s convergence path.Lemma 1. Given a population of tours 𝑃 such that ≤ 𝜇 ≤ (cid:4) 𝑛 + (cid:5) and 𝑑 𝑃 ≥ , there exist a tour 𝐼 ∈ 𝑃 and a pair ( 𝑖, 𝑗 ) , such that 𝑃 ′ = ( 𝑃 \ { 𝐼 }) ∪ { 𝑜𝑝𝑡 ( 𝐼, 𝑖, 𝑗 )} satisfies, ( 𝑐 𝑃 > 𝑐 𝑃 ′ ∧ 𝑑 𝑃 = 𝑑 𝑃 ′ ) ∨ 𝑑 𝑃 > 𝑑 𝑃 ′ . (1) Moreover, in each iteration, the Algorithm 1 with 2-opt mutationand 𝛼 = ∞ makes such an improvement with probability at least [( 𝑛 − ) ( 𝑑 𝑃 − )+ ] 𝜇𝑛 ( 𝑛 − ) . Proof. There must be 𝑑 𝑃 tours 𝐼 in 𝑃 such that ∃ 𝑒 ∈ 𝐸 ( 𝐼 ) , 𝑛 ( 𝑒, 𝑃 ) = 𝑑 𝑃 , let 𝐼 be one such tour. W.l.o.g, let 𝐼 be represented by a permuta-tion of nodes ( 𝑖 , 𝑖 , . . . , 𝑖 𝑛 ) where 𝑛 (( 𝑖 , 𝑖 ) , 𝑃 ) = 𝑑 𝑃 . The operation2 𝑜𝑝𝑡 ( 𝐼, , 𝑘 ) trades edges ( 𝑖 , 𝑖 ) and ( 𝑖 𝑘 , 𝑖 𝑘 + ) in 𝐼 for ( 𝑖 , 𝑖 𝑘 ) and ( 𝑖 , 𝑖 𝑘 + ) . If 𝑛 (( 𝑖 , 𝑖 𝑘 ) , 𝑃 ) < 𝑑 𝑃 − 𝑛 (( 𝑖 , 𝑖 𝑘 + ) , 𝑃 ) < 𝑑 𝑃 −

1, then 𝑃 ′ = ( 𝑃 \ { 𝐼 }) ∪ { 𝑜𝑝𝑡 ( 𝐼, , 𝑘 )} satisfies (1) since 𝑛 (( 𝑖 , 𝑖 𝑘 ) , 𝑃 ′ ) and 𝑛 (( 𝑖 , 𝑖 𝑘 + ) , 𝑃 ′ ) cannot reach 𝑑 𝑃 . We show that there is always such aposition 𝑘 . Since 𝑘 can only go from 3 to 𝑛 −

1, there are 𝑛 − 𝑘 . It’s the case that 𝑠 ( 𝑖, 𝑃 ) = 𝜇 for any 𝑖 since each tour con-tributes 2 to 𝑠 ( 𝑖, 𝑃 ) , and that 𝑛 (( 𝑖 𝑛 , 𝑖 ) , 𝑃 ) ≥ 𝑛 (( 𝑖 , 𝑖 ) , 𝑃 ) ≥ 𝐼 uses them, thus 𝑛 − ∑︁ 𝑘 = 𝑛 (( 𝑖 , 𝑖 𝑘 ) , 𝑃 ) ≤ 𝜇 − 𝑑 𝑃 − ≤ (cid:106) 𝑛 (cid:107) − 𝑑 𝑃 , and (2) 𝑛 ∑︁ 𝑘 = 𝑛 (( 𝑖 , 𝑖 𝑘 ) , 𝑃 ) ≤ (cid:106) 𝑛 (cid:107) − 𝑑 𝑃 . According to the pigeonhole principle, (2) implies there are at least 𝛿 elements 𝑘 from 3 to 𝑛 − 𝑛 (( 𝑖 , 𝑖 𝑘 ) , 𝑃 ) < 𝑑 𝑃 −

1, where 𝛿 = 𝑛 − − (cid:22) ⌊ 𝑛 / ⌋ − 𝑑 𝑃 𝑑 𝑃 − (cid:23) . Likewise, there are at least 𝛿 elements 𝑘 from 4 to 𝑛 such that 𝑛 (( 𝑖 , 𝑖 𝑘 ) , 𝑃 ) < 𝑑 𝑃 −

1. This implies that there are at least 2 𝛿 − 𝑛 + 𝑘 from 3 to 𝑛 − 𝑛 (( 𝑖 , 𝑖 𝑘 ) , 𝑃 ) < 𝑑 𝑃 − 𝑛 (( 𝑖 , 𝑖 𝑘 + ) , 𝑃 ) < 𝑑 𝑃 −

1. We have2 𝛿 − 𝑛 + = 𝑛 − − (cid:22) ⌊ 𝑛 / ⌋ − 𝑑 𝑃 𝑑 𝑃 − (cid:23) ≥ ( 𝑛 − )( 𝑑 𝑃 − ) + 𝑑 𝑃 − ≥ , proving the first part of the lemma. In each iteration, the Algorithm1 selects a tour like 𝐼 with probability at least 𝑑 𝑃 / 𝜇 . There are at least [( 𝑛 − )( 𝑑 𝑃 − ) + ]/( 𝑑 𝑃 − ) different 2-opt operations on such atour to produce 𝑃 ′ . Since there are 𝑛 ( 𝑛 − )/ 𝑃 ′ from 𝑃 is atleast ( 𝑛 − )( 𝑑 𝑃 − ) + 𝑑 𝑃 − 𝑑 𝑃 𝜇𝑛 ( 𝑛 − ) ≥ [( 𝑛 − )( 𝑑 𝑃 − ) + ] 𝜇𝑛 ( 𝑛 − ) . □ In Lemma 1, only one favorable scenario is accounted for whereboth edges to be traded in have counts less than 𝑑 𝑃 −

1. However,there are other situations where strict improvements would be madeas well, such as when both swapped-out edges have count 𝑑 𝑃 . Fur-thermore, a tour to be mutated might contain more than 2 edgeswith such count, increasing the number of beneficial choices dramat-ically. Consequently, the derived probability bound is pessimistic,and the average success rate might be much higher. It also meansthat the bound of the range of 𝜇 is pessimistic and the lack of lo-cal optima is very probable at larger population sizes, albeit withreduced diversity improvement probability.Intuitively, larger population sizes present more complex searchspaces where local search approaches are more prone to reachingsub-optimal results. It is reasonable to infer that small populationsizes make diversity maximization easier for Algorithm 1. However,for 3-opt mutation, local optima can still exist even with popula-tion size being as small as 3. Next, we show a simple constructionof supposedly easy cases where 3-opt fails to produce any strictimprovement.Example 2. For any TSP instance of size 𝑛 ≥ where 𝑛 is a multipleof 4, we can always construct a population of 3 tours having sub-optimal diversity, such that no single 3-opt operation on any tour canimprove diversity. Let the first tour be 𝐼 = ( 𝑖 , 𝑖 , . . . , 𝑖 𝑛 ) , we derivethe second tour 𝐼 sharing only 2 edges with 𝐼 and containing edgesthat form a “crisscrossing” pattern on 𝐼 , 𝐼 = ( 𝑖 , 𝑖 𝑛 − , . . . , 𝑖 𝑘 + , 𝑖 𝑛 − 𝑘 − , . . . , 𝑖 𝑛 / − , 𝑖 𝑛 / + ,𝑖 𝑛 / , 𝑖 𝑛 / + , . . . , 𝑖 𝑛 / − 𝑘 , 𝑖 𝑛 / + 𝑘 , . . . , 𝑖 , 𝑖 𝑛 ) . The third tour 𝐼 shares no edge with 𝐼 or 𝐼 and contains many edgesthat “skip one node” on 𝐼 . 𝐼 = ( 𝑖 , . . . , 𝑖 𝑘 + , . . . , 𝑖 𝑛 / − , 𝑖 𝑛 / + , . . . , 𝑖 𝑛 / + 𝑘 , . . . , 𝑖 𝑛 𝑖 𝑛 / , . . . , 𝑖 𝑛 / − 𝑘 , . . . , 𝑖 , 𝑖 𝑛 − , . . . , 𝑖 𝑛 − 𝑘 − , 𝑖 𝑛 / + ) . In order to improve diversity, the operation must exchange, on eithertour, at least one edge with count 2. However, any 3-opt operation withsuch restriction ends up trading in at least another edge used by theother tours, nullifying any improvement it makes. This populationpresents a local optimum for algorithms that uses 3-opt as the onlysolution generating mechanism. Figure 2 illustrates two examples ofthe construction with 𝑛 = and 𝑛 = . We speculate that in many cases, the insertion 3-opt suffers fromits asymmetrical nature. Both 2-opt and 3-opt operations are eachdefined by two decisions. For 2-opt, the two decisions are which igure 1: Examples of constructed tours with 𝑛 = and 𝑛 = where no single 3-opt operation on any tour improves diver-sity. two edges to be exchanged, and only after both are made will thetwo new edges be fixed. For 3-opt, one decision determines whichset of two adjacent edges to exchanged, and the other defines thethird edge. Unlike 2-opt, after only one decision, one out of thethree new edges is already fixed. Such limited flexibility makesit difficult to guarantee diversity improvements via 3-opt withoutadditional assumptions about the population. In contrast, 4-opt isnot subjected to this drawback, as the two decisions associated withit are symmetric. For this reason, we can derive another result for4-opt similar to Lemma 1.Lemma 2. Given a population of tours 𝑃 such that ≤ 𝜇 ≤ (cid:4) 𝑛 + (cid:5) and 𝑑 𝑃 ≥ , there exist a tour 𝐼 ∈ 𝑃 and a pair ( 𝑖, 𝑗 ) , such that 𝑃 ′ = ( 𝑃 \ { 𝐼 }) ∪ { 𝑜𝑝𝑡 ( 𝐼, 𝑖, 𝑗 )} satisfies (1) . Moreover, in each iteration,the Algorithm 1 with 4-opt mutation and 𝛼 = ∞ makes such animprovement with probability at least [( 𝑛 − ) ( 𝑑 𝑃 − )+ ] 𝜇𝑛 ( 𝑛 − ) . Proof. There must be 𝑑 𝑃 tours 𝐼 in 𝑃 such that ∃ 𝑒 ∈ 𝐸 ( 𝐼 ) , 𝑛 ( 𝑒, 𝑃 ) = 𝑑 𝑃 , let 𝐼 be one such tour. W.l.o.g, let 𝐼 be represented by a permu-tation of nodes ( 𝑖 , 𝑖 , . . . , 𝑖 𝑛 ) where 𝑛 (( 𝑖 , 𝑖 ) , 𝑃 ) = 𝑑 𝑃 . The opera-tion 4 𝑜𝑝𝑡 ( 𝐼, , 𝑘 ) trades edges ( 𝑖 , 𝑖 ) , ( 𝑖 , 𝑖 ) , ( 𝑖 𝑘 − , 𝑖 𝑘 ) , ( 𝑖 𝑘 , 𝑖 𝑘 + ) in 𝐼 for ( 𝑖 , 𝑖 𝑘 ) , ( 𝑖 , 𝑖 𝑘 ) , ( 𝑖 , 𝑖 𝑘 − ) , ( 𝑖 , 𝑖 𝑘 + ) . If 𝑛 (( 𝑖 , 𝑖 𝑘 ) , 𝑃 ) < 𝑑 𝑃 − 𝑛 (( 𝑖 , 𝑖 𝑘 ) , 𝑃 ) < 𝑑 𝑃 − 𝑛 (( 𝑖 , 𝑖 𝑘 − ) , 𝑃 ) < 𝑑 𝑃 − 𝑛 (( 𝑖 , 𝑖 𝑘 + ) , 𝑃 ) < 𝑑 𝑃 −

1, then 𝑃 ′ = ( 𝑃 \ { 𝐼 }) ∪ { 𝑜𝑝𝑡 ( 𝐼, , 𝑘 )} satisfies (1) followingsimilar reasoning in the proof of Lemma 1. We show that there isalways such a position 𝑘 . Since 𝑘 can only go from 5 to 𝑛 −

1, thereare 𝑛 − 𝑘 . We use the fact that 𝑠 ( 𝑖, 𝑃 ) = 𝜇 for any 𝑖 , andthat 𝑛 (( 𝑖 , 𝑖 ) , 𝑃 ) ≥ 𝐼 uses them, thus 𝑛 − ∑︁ 𝑘 = 𝑛 (( 𝑖 , 𝑖 𝑘 − ) , 𝑃 ) ≤ 𝜇 − 𝑑 𝑃 − ≤ (cid:106) 𝑛 (cid:107) − 𝑑 𝑃 , and (3) 𝑛 − ∑︁ 𝑘 = 𝑛 (( 𝑖 , 𝑖 𝑘 + ) , 𝑃 ) ≤ (cid:106) 𝑛 (cid:107) − 𝑑 𝑃 . According to the pigeonhole principle, (3) implies there are at least 𝛿 elements 𝑘 from 5 to 𝑛 − 𝑛 (( 𝑖 , 𝑖 𝑘 − ) , 𝑃 ) < 𝑑 𝑃 −

1, where 𝛿 = 𝑛 − − (cid:22) ⌊ 𝑛 / ⌋ − 𝑑 𝑃 𝑑 𝑃 − (cid:23) . Likewise, there are at least 𝛿 elements 𝑘 from 5 to 𝑛 − 𝑛 (( 𝑖 , 𝑖 𝑘 + ) , 𝑃 ) < 𝑑 𝑃 −

1. This implies that there are at least 2 𝛿 − 𝑛 + 𝑘 from 5 to 𝑛 − 𝑛 (( 𝑖 , 𝑖 𝑘 − ) , 𝑃 ) < 𝑑 𝑃 − 𝑛 (( 𝑖 , 𝑖 𝑘 + ) , 𝑃 ) < 𝑑 𝑃 −

1, which we will call condition 1. We denotethe number by ΔΔ = 𝛿 − 𝑛 + = 𝑛 − − (cid:22) ⌊ 𝑛 / ⌋ − 𝑑 𝑃 𝑑 𝑃 − (cid:23) , Using 𝑛 (( 𝑖 , 𝑖 𝑛 ) , 𝑃 ) ≥

1, we similarly derive that there are at least 𝛿 element 𝑘 from 5 to 𝑛 − 𝑛 (( 𝑖 , 𝑖 𝑘 ) , 𝑃 ) < 𝑑 𝑃 −

1. However, we only have 𝑛 (( 𝑖 , 𝑖 ) , 𝑃 ) ≥

1, meaning there are at least 𝛿 ′ element 𝑘 from 5 to 𝑛 − 𝑛 (( 𝑖 , 𝑖 𝑘 ) , 𝑃 ) < 𝑑 𝑃 − 𝛿 ′ = 𝑛 − − (cid:22) ⌊ 𝑛 / ⌋ 𝑑 𝑃 − (cid:23) . From this, we have that there are at least 𝛿 + 𝛿 ′ − 𝑛 + 𝑘 from5 to 𝑛 − 𝑛 (( 𝑖 , 𝑖 𝑘 ) , 𝑃 ) < 𝑑 𝑃 − 𝑛 (( 𝑖 , 𝑖 𝑘 ) , 𝑃 ) < 𝑑 𝑃 − Δ ′ Δ ′ = 𝛿 + 𝛿 ′ − 𝑛 + = 𝑛 − − (cid:22) ⌊ 𝑛 / ⌋ − 𝑑 𝑃 𝑑 𝑃 − (cid:23) − (cid:22) ⌊ 𝑛 / ⌋ 𝑑 𝑃 − (cid:23) , Finally, we can infer that there are at least Δ + Δ ′ − 𝑛 + 𝑘 such that both condition 1 and condition 2 are met. We have Δ + Δ ′ − 𝑛 + ≥ 𝑛 + − ⌊ 𝑛 / ⌋ − 𝑑 𝑃 𝑑 𝑃 − ≥ ( 𝑛 − )( 𝑑 𝑃 − ) + 𝑑 𝑃 − ≥ , proving the first part of the lemma. In each iteration, the Algorithm1 selects a tour like 𝐼 with probability at least 𝑑 𝑃 / 𝜇 . There are at least [( 𝑛 − )( 𝑑 𝑃 − ) + ]/( 𝑑 𝑃 − ) different 4-opt operations on such atour to produce 𝑃 ′ . Since there are 𝑛 ( 𝑛 − )/ 𝑃 ′ from 𝑃 is atleast ( 𝑛 − )( 𝑑 𝑃 − ) + 𝑑 𝑃 − 𝑑 𝑃 𝜇𝑛 ( 𝑛 − ) ≥ [( 𝑛 − )( 𝑑 𝑃 − ) + ] 𝜇𝑛 ( 𝑛 − ) . □ Like in Lemma 1, only one out of many favorable scenarios isconsidered in Lemma 2, so the lower bound is strict. The range of thepopulation size is smaller to account for the fact that the conditionfor such a scenario is stronger than the one in Lemma 1. With theseresults, we derive run-time results for 2-opt and 4-opt, relying onthe longest possible path from zero diversity to the optimum.Theorem 1.

Given any TSP instance with 𝑛 ≥ nodes and 𝜇 ≥ ,the Algorithm 1 with 𝛼 = ∞ obtains a 𝜇 -population with maximumdiversity within expected time O( 𝜇 𝑛 ) if • it uses 2-opt mutation and 𝜇 ≤ (cid:4) 𝑛 + (cid:5) , • it uses 4-opt mutation and 𝜇 ≤ (cid:4) 𝑛 + (cid:5) . Proof. In the worst case, the algorithm begins with 𝑑 𝑃 = 𝜇 and 𝑐 𝑃 = 𝑛 . At any time, we have 𝑐 𝑃 ≤ 𝜇𝑛 / 𝑑 𝑃 . Moreover, in theworst case, each improvement either reduces 𝑐 𝑃 by 1, or reduces 𝑑 𝑃 by 1 and sets 𝑐 𝑃 to its maximum value. With 2 ≤ 𝜇 ≤ (cid:4) 𝑛 + (cid:5) , themaximum diversity is achieved iff 𝑑 𝑃 = 𝜇 ∑︁ 𝑗 = 𝜇𝑛𝑗 𝜇𝑛 ( 𝑛 − ) [( 𝑛 − )( 𝑗 − ) + ] = O( 𝜇 𝑛 ) . On the other hand, Lemma 2 implies that when 2 ≤ 𝜇 ≤ (cid:4) 𝑛 + (cid:5) , Al-gorithm 1 with 4-opt mutation needs at most the following expectedrun time 𝜇 ∑︁ 𝑗 = 𝜇𝑛𝑗 𝜇𝑛 ( 𝑛 − ) [( 𝑛 − )( 𝑗 − ) + ] = O( 𝜇 𝑛 ) . □ As expected, the simple algorithm requires only quadratic ex-pected run-time to achieve optimal diversity from any starting pop-ulation of sufficiently small size. The quadratic scaling with 𝜇 comesfrom two factors. One is the fact that Algorithm 1 needs to selectthe “correct” tour to mutate out of 𝜇 tours. The other is the fact thatup to 𝜇 − 𝑛 comesfrom the quadratic number of possible mutation operations, and thenumber of edges to modify in each tour. Additionally, most of the un-time is spent on the “last stretch” when reducing 𝑑 𝑃 from 2 to1, as the rest only takes up O( 𝜇 𝑛 ) expected number of steps. Let 𝑑 𝑃 = max 𝑖,𝑗 ∈[ 𝑛 ] { 𝑛 ( 𝑖, 𝑗, 𝑃 )} and 𝑐 𝑃 = | 𝑖, 𝑗 ∈ [ 𝑛 ]| 𝑛 ( 𝑖, 𝑗, 𝑃 ) = 𝑑 𝑃 | .We denote the 2-opt operation by 𝑠 𝑖,𝑗 (·) where 𝑖 and 𝑗 are twopositions in the permutation to be exchanged. For each 𝑖 ∈ [ 𝑛 ] , let 𝑠 ( 𝑖, 𝑃 ) = (cid:205) 𝑗 ∈[ 𝑛 ] 𝑛 ( 𝑖, 𝑗, 𝑃 ) and 𝑧 ( 𝑖, 𝑃 ) = (cid:205) 𝑗 ∈[ 𝑛 ] 𝑛 ( 𝑗, 𝑖, 𝑃 ) . Let 𝜙 be ashift operation such that for all permutation 𝑎 : [ 𝑛 ] → [ 𝑛 ] , 𝑏 = 𝜙 ( 𝑎 ) = ⇒ ∀ 𝑖 ∈ [ 𝑛 − ] , 𝑏 ( 𝑖 ) = 𝑎 ( 𝑖 + ) ∧ 𝑏 ( 𝑛 ) = 𝑎 ( ) . Also, for convenience, we use the notation 𝐴 ( 𝑃 ) = {( 𝑖, 𝑗 )|∃ 𝑎 ∈ 𝑃, 𝑎 ( 𝑖 ) = 𝑗 } . We first show the achievable maximum diversity for anypositive 𝑛 , which will be the foundation for our run-time analysis.Theorem 2. Given 𝑛, 𝜇 ≥ , there exists a 𝜇 -size population 𝑃 ofpermutations of [ 𝑛 ] such that max 𝑖,𝑗 ∈[ 𝑛 ] 𝑛 ( 𝑖, 𝑗, 𝑃 ) − min 𝑖,𝑗 ∈[ 𝑛 ] 𝑛 ( 𝑖, 𝑗, 𝑃 ) ≤ . (4)Proof. We prove by constructing such a 𝑃 . Let 𝑎 : [ 𝑛 ] → [ 𝑛 ] be some arbitrary permutation and 𝑄 = { 𝜙 𝑖 ( 𝑎 )| 𝑖 ∈ [ 𝑛 ]} where 𝜙 𝑖 is 𝜙 applied 𝑖 times. Note that 𝜙 𝑛 ( 𝑎 ) = 𝑎 . It is the case that notwo solutions in 𝑄 share assignments, so for all 𝑖, 𝑗 ∈ [ 𝑛 ] , we have 𝑛 ( 𝑖, 𝑗, 𝑄 ) =

1, and 𝐴 ( 𝑄 ) = [ 𝑛 ] × [ 𝑛 ] . Let 𝜇 = 𝑘𝑛 + 𝑟 where 𝑘, 𝑟 ∈ N and 𝑟 < 𝑛 , and 𝐵 ⊂ 𝑄 where | 𝐵 | = 𝑟 , we include in 𝑃 𝑘 + 𝐵 and 𝑘 copies of each solution in 𝑄 \ 𝐵 . Then 𝑃 satisfies (4) since ∀( 𝑖, 𝑗 ) ∈ 𝐴 ( 𝐵 ) , 𝑛 ( 𝑖, 𝑗, 𝑃 ) = 𝑘 + , and ∀( 𝑖, 𝑗 ) ∈ 𝐴 ( 𝑄 \ 𝐵 ) , 𝑛 ( 𝑖, 𝑗, 𝑃 ) = 𝑘. □ With maximum diversity well-defined, we can determine whetherit is reached with population 𝑃 using only information from N ( 𝑃 ) .Therefore, we can show the guarantee of strict diversity improve-ment with a single 2-opt on some sub-optimal population, and theprobability that Algorithm 1 makes such an improvement, similarto Lemma 1. For brevity’s sake, we reuse the expression (1) withnotations defined in the QAP context.Lemma 3. Given a population of permutations 𝑃 such that ≤ 𝜇 ≤ (cid:4) 𝑛 + (cid:5) and 𝑑 𝑃 ≥ , there exist a permutation 𝑎 ∈ 𝑃 and pair ( 𝑖, 𝑗 ) where ≤ 𝑖 < 𝑗 ≤ 𝑛 , such that 𝑃 ′ = ( 𝑃 \ { 𝑎 }) ∪ { 𝑠 𝑖,𝑗 ( 𝑎 )} satisfies (1) . Moreover, in each iteration, the Algorithm 1 with 2-opt mutationand 𝛼 = ∞ makes such an improvement with probability at least [( 𝑛 + ) ( 𝑑 𝑃 − )+ ] 𝜇𝑛 ( 𝑛 − ) . Proof. There must be 𝑑 𝑃 permutations 𝑎 in 𝑃 such that ∃ 𝑖 ∈[ 𝑛 ] , 𝑛 ( 𝑖, 𝑎 ( 𝑖 ) , 𝑃 ) = 𝑑 𝑃 , let 𝑎 be one such permutation, and 𝑖 ∈ [ 𝑛 ] such that 𝑛 ( 𝑖, 𝑎 ( 𝑖 ) , 𝑃 ) = 𝑑 𝑃 . The operation 𝑠 𝑖,𝑘 ( 𝑎 ) trades assignments 𝑖 → 𝑎 ( 𝑖 ) and 𝑘 → 𝑎 ( 𝑘 ) in 𝑎 for 𝑖 → 𝑎 ( 𝑘 ) and 𝑘 → 𝑎 ( 𝑖 ) . Regardlessof 𝑛 ( 𝑘, 𝑎 ( 𝑘 ) , 𝑃 ) , if 𝑛 ( 𝑖, 𝑎 ( 𝑘 ) , 𝑃 ) < 𝑑 𝑃 − 𝑛 ( 𝑘, 𝑎 ( 𝑖 ) , 𝑃 ) < 𝑑 𝑃 − 𝑃 ′ = ( 𝑃 \ { 𝑎 }) ∪ { 𝑠 𝑖,𝑘 ( 𝑎 )} satisfies (1) since 𝑛 ( 𝑖, 𝑎 ( 𝑘 ) , 𝑃 ′ ) and 𝑛 ( 𝑘, 𝑎 ( 𝑖 ) , 𝑃 ′ ) cannot reach 𝑑 𝑃 . We show that there is always such aposition 𝑘 . There are 𝑛 − 𝑘 since 𝑘 ≠ 𝑖 . It’s the case that 𝑠 ( 𝑖, 𝑃 ) = 𝑧 ( 𝑖, 𝑃 ) = 𝜇 , thus ∑︁ 𝑗 ≠ 𝑎 ( 𝑖 ) 𝑛 ( 𝑖, 𝑗, 𝑃 ) ≤ 𝜇 − 𝑑 𝑃 ≤ (cid:22) 𝑛 + (cid:23) − 𝑑 𝑃 , and (5) ∑︁ 𝑗 ≠ 𝑖 𝑛 ( 𝑗, 𝑎 ( 𝑖 ) , 𝑃 ) ≤ (cid:22) 𝑛 + (cid:23) − 𝑑 𝑃 . According to the pigeonhole principle, (5) implies there are at least 𝛿 elements 𝑘 ≠ 𝑖 such that 𝑛 ( 𝑖, 𝑎 ( 𝑘 ) , 𝑃 ) < 𝑑 𝑃 −

1, where 𝛿 = 𝑛 − − (cid:22) ⌊( 𝑛 + )/ ⌋ − 𝑑 𝑃 𝑑 𝑃 − (cid:23) . Likewise, there are at least 𝛿 elements 𝑘 ≠ 𝑖 such that 𝑛 ( 𝑘, 𝑎 ( 𝑖 ) , 𝑃 ) < 𝑑 𝑃 −

1. This implies that there are at least 2 𝛿 − 𝑛 + 𝑘 ≠ 𝑖 where 𝑛 ( 𝑖, 𝑎 ( 𝑘 ) , 𝑃 ) < 𝑑 𝑃 − 𝑛 ( 𝑘, 𝑎 ( 𝑖 ) , 𝑃 ) < 𝑑 𝑃 −

1. We have2 𝛿 − 𝑛 + = 𝑛 − − (cid:22) ⌊( 𝑛 + )/ ⌋ − 𝑑 𝑃 𝑑 𝑃 − (cid:23) ≥ ( 𝑛 + )( 𝑑 𝑃 − ) + 𝑑 𝑃 − ≥ , proving the first part of the lemma. In each iteration, the Algorithm1 selects a tour like 𝐼 with probability at least 𝑑 𝑃 / 𝜇 . There are atleast [( 𝑛 + )( 𝑑 𝑃 − ) + ]/( 𝑑 𝑃 − ) different 2-opt operations on sucha tour to produce 𝑃 ′ . Since there are 𝑛 ( 𝑛 − )/ 𝑃 ′ from 𝑃 is atleast ( 𝑛 + )( 𝑑 𝑃 − ) + 𝑑 𝑃 − 𝑑 𝑃 𝜇𝑛 ( 𝑛 − ) ≥ [( 𝑛 + )( 𝑑 𝑃 − ) + ] 𝜇𝑛 ( 𝑛 − ) . □ Compared to Lemma 1, the range of 𝜇 in Lemma 3 is about twiceas large, which coincides with the fact that the maximum numberof disjoint solutions (sharing no assignment/edge) for any giveninstance size is also twice as large in QAP than it is in TSP. Theresult lends itself to the following run-time bound for Algorithm 1,similar to Theorem 1.Theorem 3. Given any QAP instance with 𝑛 ≥ and ≤ 𝜇 ≤ (cid:4) 𝑛 + (cid:5) , the Algorithm 1 with 2-opt mutation and 𝛼 = ∞ obtains a 𝜇 -population with maximum diversity within expected time O( 𝜇 𝑛 ) . Proof. In the worst case, the algorithm begins with 𝑑 𝑃 = 𝜇 and 𝑐 𝑃 = 𝑛 . At any time, we have 𝑐 𝑃 ≤ 𝜇𝑛 / 𝑑 𝑃 . Moreover, in the worstcase, each improvement either reduces 𝑐 𝑃 by 1, or reduces 𝑑 𝑃 by1 and sets 𝑐 𝑃 to its maximum value. With 2 ≤ 𝜇 ≤ (cid:4) 𝑛 + (cid:5) , themaximum diversity is achieved iff 𝑑 𝑃 = 𝜇 ∑︁ 𝑗 = 𝜇𝑛𝑗 𝜇𝑛 ( 𝑛 − ) [( 𝑛 + )( 𝑗 − ) + ] = O( 𝜇 𝑛 ) . □ The results in Theorem 1 and 3 are identical due to similaritiesbetween structures of TSP tours and QAP mappings, and the sameintuition applies. Of note is that according to the proofs, the proba-bility of making improvements drops as the population is closer tomaximum diversity. This is a common phenomenon for randomizedheuristics in general, which we expect to see replicated in experi-mentation.

We perform two sets of experiments to establish baseline results forevolving diverse QAP mappings. These involve running Algorithm1 separately using two described measures: N and D . We denotethese two variants by 𝐷 and 𝐷 . The mutation operator used is2-opt. Firstly, we consider the unconstrained case where no qualityconstraint is applied. Then, we impose constraints with varyingquality thresholds 𝛼 on the solutions.For our experiments, we use three QAPLIB instances: Nug30 [16],Lipa90b [12], Esc128 [7]. The optimal solutions for these instancesare known. We vary the population size among 3, 10, 20, 50. Werun each variant of the algorithm 30 times on each instance, andeach run is allotted 𝜇𝑛 maximum iterations. It is important tonote that any reported diversity score is normalized with the upperbound appropriate to the instance. For 𝐷 , the bound is derivedfrom Theorem 2, while it is 𝜇𝑛 for 𝐷 as mentioned. We specify thedifferences in settings between unconstrained case and constrainedcase in the following sections. .1 Unconstrained diversity optimization In the unconstrained case, we are interested in how optimizing forone measure affect the other, and how many iterations are neededto reach maximum diversity from zero diversity. To this end, we setthe initial population to contain only duplicates of some randomtour. Furthermore, we apply a stopping criterion that holds whenthe measure being optimized for reaches its upper bound. However,for 𝑛 > 𝜇 , the bound is unreachable, so we expect that the algorithmdoes not terminate prematurely while minimizing D .Figure 2 shows the mean diversity scores and their standard devi-ations throughout the runs, and the average numbers of iterationstill termination. Overall, when 𝜇 ≤ 𝑛 , Algorithm 1 maximizes both 𝐷 and 𝐷 well within the run time limit. The ratios between neededrun-times and corresponding total run-times seem to correlate withthe ratio 𝜇 / 𝑛 . Additionally, the algorithm seems to require similarrun-time to optimize for both measures, as no consistent differencesare visible.The figure also shows a notable difference in the evolutionary tra-jectories resulted from using N and D for survival selection. When D is used, Algorithm 1 improves 𝐷 about as efficiently as when N is used. On the other hand, when N is used, it increases 𝐷 poorlyduring the early stages, and even noticeably decreases it in shortperiods. In fact, in many cases, 𝐷 only starts to increase quicklywhen 𝐷 reaches a certain threshold. That said, this particular dif-ference is not observable for 𝜇 =

3. Nevertheless, it indicates thateven in easy cases ( 𝜇 ≤ 𝑛 ), highly even distributions of assignmentsin the population are unlikely to preclude clustering. In contrast,separating each solution from the rest of the population tends toimprove overall diversity effectively. In the constrained case, we look for the final diversity scores acrossvarying 𝛼 and the extent to which optimizing for 𝐷 mitigate clus-tering, especially at small 𝛼 . Therefore, we consider 𝛼 values 0.05,0.2, 0.5, 1, and run the algorithm for 𝜇𝑛 steps with no additionalstopping criterion. Furthermore, we initiate the population withduplicates of the optimal solution to allow flexibility for meaningfulbehaviors. Aside from diversity scores, we also record the percent-age of assignments belonging to exactly one solution (unique) outof 𝜇𝑛 assignments in each final population.Table 1 shows a comparison in terms of 𝐷 and 𝐷 scores aswell as unique assignment percentages. Overall, maximum diver-sity is achieved reliably in most cases when 𝛼 = . ,

1. For Lipa90b,there are tremendous gaps in final diversity scores when 𝛼 changesfrom 0 .

05 to 0 .

2. The differences are much smaller in other QAPLIBinstances. Also, at 𝛼 = .

5, maximum diversity is not reached asfrequently for Esc128 as for other instances. These suggest signifi-cantly different cost distributions in the solution spaces associatedwith these QAPLIB instances.Comparing the diversity scores from the two approaches, we cansee trends consistent with those in the unconstrained case. Eachapproach predictably excels at maximizing the its own measure overthe other. That said, the 𝐷 approach does not fall far behind in 𝐷 scores, even in cases where statistical significance is observed (atmost 7% difference). Meanwhile, the 𝐷 approach’s 𝐷 scores aremuch lower than those of the other, especially in hard cases (small 𝛼 and large 𝜇 ). The same differences can be seen in the percentages ofunique assignments, which seem to strongly correlate with 𝐷 . Thisindicates that using the measure D , Algorithm 1 significantly re-duces clustering, and equalizes assignments’ representations almostas effectively as when using the measure N . We studied evolutionary diversity optimization in the TravelingSalesperson Problem and Quadratic Assignment Problem. In thistype of optimization problem, the goal is to maximize diversity asquantified by some metric, and the constraint involves the solutions’qualities. We described the similarity and difference between thestructure of a TSP tour and that of a QAP mapping, and customizedtwo diversity measures to each problem. We considered a baseline ( 𝜇 + ) evolutionary algorithm that incrementally modifies the pop-ulation using traditional mutation operators on one solution at atime. We showed that for any sufficiently small 𝜇 , the algorithmguarantees maximum diversity in TSP within using 2-opt and 4-optwithin O( 𝜇 𝑛 ) expected iterations, while 3-opt suffers from localoptima even with very small 𝜇 . We derived the same result in QAPwith 2-opt, where the upper bound of 𝜇 is more generous. Additionalexperiments on QAPLIB instances shed light on differences on evolu-tionary trajectories when optimizing for the two diversity measures.Our results show heterogeneity in the correlation between the qual-ity constraint threshold and the achieved diversity across differentinstances, and that the average practical performance is much moreoptimistic than the worst-case suggests. ACKNOWLEDGMENTS

This work was supported by the Phoenix HPC service at the Univer-sity of Adelaide, and by the Australian Research Council throughgrant DP190103894.

REFERENCES [1] Ravindra K. Ahuja, James B. Orlin, and Ashish Tiwari. 2000. A greedy geneticalgorithm for the quadratic assignment problem.

Computers & Operations Research

27, 10 (Sept. 2000), 917–934. https://doi.org/10.1016/s0305-0548(99)00067-2[2] Brad Alexander, James Kortman, and Aneta Neumann. 2017. Evolution of artisticimage variants through feature based diversity optimisation. In

Proceedings ofthe Genetic and Evolutionary Computation Conference . ACM, 171–178. https://doi.org/10.1145/3071178.3071342[3] Alberto Alvarez, Steve Dahlskog, Jose Font, and Julian Togelius. 2019. EmpoweringQuality Diversity in Dungeon Design with Interactive Constrained MAP-Elites.In . IEEE, 1–8. https://doi.org/10.1109/cig.2019.8848022[4] Jakob Bossek, Pascal Kerschke, Aneta Neumann, Markus Wagner, Frank Neumann,and Heike Trautmann. 2019. Evolving diverse TSP instances by means of novel andcreative mutation operators. In

Proceedings of the 15th ACM/SIGEVO Conferenceon Foundations of Genetic Algorithms - FOGA '19 . ACM Press, 58–71. https://doi.org/10.1145/3299904.3340307[5] Antoine Cully and Yiannis Demiris. 2018. Quality and Diversity Optimization: AUnifying Modular Framework.

IEEE Transactions on Evolutionary Computation

Parallel Problem Solving from Nature – PPSN XVI . Springer Interna-tional Publishing, 588–603. https://doi.org/10.1007/978-3-030-58115-2_41[7] B. Eschermann and H.-J. Wunderlich. 1990. Optimized synthesis of self-testablefinite state machines. In [1990] Digest of Papers. Fault-Tolerant Computing: 20thInternational Symposium . IEEE Comput. Soc. Press, 390–397. https://doi.org/10.1109/ftcs.1990.89393[8] Wanru Gao, Samadhi Nallaperuma, and Frank Neumann. 2020. Feature-BasedDiversity Optimization for Problem Instance Classification.

Evolutionary Compu-tation (June 2020), 1–22. https://doi.org/10.1162/evco_a_00274[9] Daniele Gravina, Ahmed Khalifa, Antonios Liapis, Julian Togelius, and Georgios N.Yannakakis. 2019. Procedural Content Generation through Quality Diversity. In . IEEE, 1–8. https://doi.org/10.1109/cig.2019.8848053[10] Pascal Kerschke, Holger H. Hoos, Frank Neumann, and Heike Trautmann. 2019.Automated Algorithm Selection: Survey and Perspectives.

Evolutionary Computa-tion

27, 1 (March 2019), 3–45. https://doi.org/10.1162/evco_a_00242[11] Joel Lehman and Kenneth O. Stanley. 2013. Evolvability Is Inevitable: IncreasingEvolvability without the Pressure to Adapt.

PLoS ONE

8, 4 (April 2013), 1–9.https://doi.org/10.1371/journal.pone.0062186[12] Yong Li and Panos M. Pardalos. 1992. Generating quadratic assignment testproblems with known optimal permutations.

Computational Optimization andApplications

1, 2 (Nov. 1992), 163–184. https://doi.org/10.1007/bf00253805[13] Alfonsas Misevicius. 2004. An improved hybrid genetic algorithm: new results forthe quadratic assignment problem.

Knowledge-Based Systems

17, 2-4 (May 2004),65–73. https://doi.org/10.1016/j.knosys.2004.03.001[14] Aneta Neumann, Wanru Gao, Carola Doerr, Frank Neumann, and Markus Wagner.2018. Discrepancy-based evolutionary diversity optimization. In

Proceedingsof the Genetic and Evolutionary Computation Conference . ACM, 991–998. https://doi.org/10.1145/3205455.3205532

00 1000 1500 2000 250000.51

200 400 60000.51

200 400 600 800 100000.51

20 40 60 8000.51

100 200 300 40000.51 N o m a li z e d d i ve r s i t y sc o r e D : D scoreD : D scoreD : D scoreD : D scoreD stop timeD stop time Nug30Lipa90bEsc128 = 10 = 3 Steps = 50 = 20

Figure 2: Means and standard deviations of normalized 𝐷 and 𝐷 scores from both approaches over time. For visibility, theX-axis range is scaled to the maximum number of steps till termination from all runs, missing data points are extrapolatedfrom the final scores. The total run-time is 𝜇𝑛 . The dashed lines denote the average numbers of steps till termination. [15] Aneta Neumann, Wanru Gao, Markus Wagner, and Frank Neumann. 2019. Evo-lutionary diversity optimization using multi-objective indicators. In Proceed-ings of the Genetic and Evolutionary Computation Conference . ACM, 837–845.https://doi.org/10.1145/3321707.3321796[16] Christopher E. Nugent, Thomas E. Vollmann, and John Ruml. 1968. An Experi-mental Comparison of Techniques for the Assignment of Facilities to Locations.

Operations Research

16, 1 (Feb. 1968), 150–173. https://doi.org/10.1287/opre.16.1.150[17] Justin K. Pugh, Lisa B. Soros, and Kenneth O. Stanley. 2016. Quality Diversity: ANew Frontier for Evolutionary Computation.

Frontiers in Robotics and AI

Computers & Operations Research

22, 1 (Jan. 1995), 73–83.https://doi.org/10.1016/0305-0548(93)e0020-t [19] Umut Tosun. 2014. A New Recombination Operator for the Genetic AlgorithmSolution of the Quadratic Assignment Problem.

Procedia Computer Science

Proceedings ofthe 12th annual conference on Genetic and evolutionary computation - GECCO '10 .ACM Press, 455–462. https://doi.org/10.1145/1830483.1830569[21] Tamara Ulrich and Lothar Thiele. 2011. Maximizing population diversity insingle-objective optimization. In

Proceedings of the 13th annual conference onGenetic and evolutionary computation - GECCO '11 . ACM Press, 641–648. https://doi.org/10.1145/2001576.2001665 able 1: Diversity scores and the ratios of unique assignments in the output populations. The highlights denote greater valuesbetween the two approaches with statistical significance, based on Wilcoxon rank sum tests with 95% confidence level. 𝜇 𝛼 Optimizing 𝐷 Optimizing 𝐷 𝐷 𝐷 unique percentage 𝐷 𝐷 unique percentage mean std mean std mean std mean std mean std mean std N u g L i p a b E s c0.37%1 100.00% 0.00% 100.00% 0.01% 100.00% 0.01% 100.00% 0.00% 99.99% 0.02% 99.99% 0.02%