[PDF] Strong Valid Inequalities Identification for Mixed Integer Programming Problems

Abstract

The characterization of strong valid inequalities for integer and mixed-integer programs is more of an artistic task than a systematic methodology, requiring inspiration that can sometimes be elusive. Frequently, this task is facilitated by somehow exploiting the structure of problems for devising strong valid inequalities. Subsequently, various mathematical techniques are utilized for proving that those inequalities, which are often easily shown to be valid, are indeed strong in the sense that they represent facets or other high dimensional faces. This paper develops a method to assist modelers in the challenge to devise strong valid inequalities. In each iteration, the proposed algorithm generates a valid inequality by solving a suitably constructed linear mixed integer program and applies some quality criteria in order to determine if it is a new strong valid inequality. To illustrate the proposed algorithm, a new Traveling Salesman Problem (TSP) formulation is developed based on a set of constraints already constructed in the context of the Hamiltonian Cycle Problem (HCP), and then the proposed algorithm is employed to derive a set of strong inequalities to tighten this TSP formulation. Finally, a comparison study between the relaxation of the new TSP formulation and that of a state-of-the-art TSP formulation is conducted. The computational study confirms the effectiveness of the devised inequalities due to the better quality of the relaxation provided by the new formulation.

Full PDF

11 Strong Valid Inequalities Identification for Mixed Integer Programming Problems

Asghar Moeini , Kate Smith-Miles School of Mathematics & Statistics, The University of Melbourne, Parkville 3010, Australia

Abstract

Keywords : Integer programming, Linear programming, Combinatorial optimization, Facet-defining inequality, Valid inequality, Convex hull problem, Traveling salesman problem Introduction

A natural approach to study a broad range of optimization problems arising in operations research is to express them as Integer Programming (IP) formulations of the form

Min 𝑐 𝑇 𝑥 s.t. 𝐴𝑥 = 𝑏

𝐵𝑥 ≥ 𝑑 𝑥 , … , 𝑥 𝑞 ∈ ℤ 𝑥 𝑞+1 , … , 𝑥 𝑛 ∈ ℝ (IP) where 𝑥 is a vector representing the decision variables, 𝑐 ∈ ℝ 𝑛 is a vector representing objective function coefficients, 𝐴 ∈ ℝ 𝑙×𝑛 and

𝐵 ∈ ℝ 𝑚×𝑛 are matrices representing the coefficients for equality and inequality constraints respectively, and 𝑏 ∈ ℝ 𝑙 and 𝑑 ∈ ℝ 𝑚 are vectors representing the right hand side values for those constraints. IP problems are known to Corresponding Author Email: [email protected] be NP-hard [18], because of their integral variables. Therefore, it is logical to start their analysis by first investigating their LP-relaxation, namely the problems

Min 𝑐 𝑇 𝑥 s.t. 𝐴𝑥 = 𝑏

𝐵𝑥 ≥ 𝑑 𝑥 ∈ ℝ 𝑛 (LP) A critical question that arises is whether we can further reduce 𝒳 , the feasible space of LP, without removing any solutions lying in S , the feasible space of the IP problem. To tackle such a reduction task, we need to study the properties of polytopes associated with these problems. It is well known that a polytope can be specified in two ways [33]. The V-representation defines a polytope with the set of its extreme points (vertices), while the H-representation defines a polytope through a set of valid inequalities. An important question in the integer programming context is how we can compute an H-representation of a polytope from its V-representation. That is, if 𝑆 is a finite set, then the convex hull of 𝑆 , denoted by conv ( 𝑆 ), is a well-defined polytope and according to the results in [30], there is at least a finite set of valid inequalities that characterizes conv ( 𝑆 ). Such a linear description is called the H-representation of the set 𝑆 (see also [31]). Computing the H-representation for a given set of vertices 𝑆 is called the facet enumeration problem and has been studied by numerous researchers, see for example [3], [4], and [14]. There are also some software packages such as Polymake [15] and Porta [6] for polytope specification purposes. The performance and implementation aspects of the state-of-the-art facet enumeration algorithms is summarized in [1]. Note that in this manuscript, we utilize the term “H-representation” for the polyhedral description of any given instance of the IP problem, and we use the term “convex hull formulation” for the polyhedral description in the general form of the underlying problem. Characterizing the convex hull formulation of an NP-hard problem is always heuristic in nature since if there existed such a description, which was also computationally tractable (i.e., there was a polynomial separation algorithm to recognize a violated inequality for any given point outside the polytope), then the original NP-hard problem could be solved in polynomial time [17]. Therefore, one may ask why we are still interested in finding valid inequalities to refine the convex hull formulations if they cannot be fully characterized? This is because in practice, even an insufficient description with a few computationally tractable families of inequalities is adequate to solve many problems. For example, see [13], in which branch-and-cut algorithms are utilized to solve 35 large-sized real-world instances of Asymmetric Traveling Salesman Problem. The above discussion reveals the importance of identifying strong valid inequalities to approximate convex hull formulations. To the authors’ knowledge, there is no formal method of doing so, and it is often achieved by using knowledge of the structure of the underlying problem. In particular, this means trying to guess a family of valid inequalities and then proving that they hold for all instances of the underlying problem. However, sometimes even a complicated structural analysis of a problem may not result in a useful constraint (equality/inequality). For example, in [12], an analysis of properties of Hamiltonian cycle matrices led to a set of constraints which were acknowledged as redundant constraints. One popular approach for guessing valid inequalities is to obtain the H-representation of very small instances, and then try to recognize generalizable patterns. As an example of this approach we refer the reader to [5] in which a few new facets of a TSP formulation derived. This guessing process to devise strong valid inequalities often involves dealing with several issues. The most important issue with methods that are based on the H-representation is their dimensional limitations [2], because even H-representation of small instances often include an extremely large number of inequalities. For example, constructing the H-representation of 10-node TSP polytope requires about fifty billion facet-defining inequalities [21]. This limitation does not allow consideration of large enough instances and, therefore, the generated inequalities (H-representations) might not be generalizable, especially due to the fact that the structures of small polytope instances could be different from the structures of large polytope instances. Thus, the small H-representations are often inadequate to develop an intuition as to how the behaviours would extend for the general case (convex hull formulation). This implies that in order to study the polytope of an IP problem, we need to consider the H-representation of sufficiently large instances, and enough of them, to increase the chance of detecting generalizable inequalities. Another issue with the guessing process is that the H-representation of a polytope can be variant in the sense that there could be several sets of constraints (equalities/inequalities) to represent the same polytope. This means the H-representation methods can sometimes result in a set of constraints in which the underlying patterns are hidden and beyond easy recognition, making generalization difficult. Motivated by these issues, the present paper proposes a potential method to identify strong valid inequalities for IP models. The proposed method can also be employed in the context of extended formulations (e.g., see [7]) to strengthen an IP model by involving new decision variables and then devising new inequalities according to the relationships between the original and new decision variables. The remainder of the paper is arranged as follows. Definitions and preliminaries are introduced in Section 2. The proposed method for generating strong valid inequalities is presented in Section 3. To illustrate the advantages of the proposed method, a new formulation for the TSP is developed in Section 4 and then in Section 5, the proposed method is employed to devise a set of strong inequalities for the new TSP formulation. Furthermore, a numerical comparison is given between the LP-relaxations of our tightened TSP formulation and one of the state-of-the-art TSP formulations to show its competitiveness. Given that the new TSP formulation was initially uncompetitive due to its loose LP-relaxation, the ability of the proposed method to tighten the LP-relaxation to the point of being competitive with the LP-relaxation of a state-of-the-art model shows the promising direction established by the proposed approach. Finally, conclusions are stated in Section 6.

2. Definitions and Preliminaries

This section sets up required definitions and preliminaries and offers a brief review of our previously published

Equality Constraint Augmenting (ECA-method) [25] which plays a prominent role in establishing the proposed method for strong valid inequality identification.

A set ℊ ⊆ ℝ 𝑛 is affine if the line through any two points in ℊ is entirely contained in ℊ (i.e., 𝛼𝑠 + (1 − 𝛼)𝑠 ∈ ℊ when 𝑥 , 𝑥 ∈ ℊ and 𝛼 ∈ ℝ ). Generalizing this idea, we refer to a point of the form 𝛼 𝑠 + ⋯ + 𝛼 𝑁 𝑠 𝑁 , where 𝛼 + ⋯ + 𝛼 𝑁 = 1 , as an affine combination of the points 𝑠 , … 𝑠 𝑁 . It can easily be seen that an affine set contains every affine combination of its points. The set of all affine combinations of points in some set 𝑆 ∈ ℝ 𝑛 is called the affine hull of S , and is denoted by 𝐚𝐟𝐟(𝑆) : 𝐚𝐟𝐟(𝑆) = {𝛼 𝑠 + ⋯ + 𝛼 𝑁 𝑠 𝑁 |𝑠 , … , 𝑠 𝑁 ∈ 𝑆, 𝛼 , … , 𝛼 𝑁 ∈ ℝ, 𝛼 + ⋯ + 𝛼 𝑁 = 1}. The points 𝑠 , … , 𝑠 𝑁 ∈ ℝ 𝑛 are affinely independent if the unique solution to 𝛼 𝑠 + ⋯ +𝛼 𝑁 𝑠 𝑁 = 0 is 𝛼 , … , 𝛼 𝑁 = 0 . The dimension of an affine set ℊ , denoted by dim ( ℊ ), is the maximum number of affinely independent points in ℊ minus one. Analogously, the dimension of 𝐜𝐨𝐧𝐯(𝑆) , the polytope defined as the convex hull of points 𝑠 , … , 𝑠 𝑁 , is equal to the dimension of the affine hull of these points. It can easily be shown that (e.g. see [31]) if a polytope P is constructed as the convex hull of points 𝑠 , . . . , 𝑠 𝑁 then its dimension can be obtained as dim ( P ) = rank ( [𝑣 |𝑣 |. . . |𝑣 𝑁−1 ] ), where 𝑣 𝑖−1 ∶= 𝑠 𝑖 − 𝑠 , 𝑖 = 2, . . . , 𝑁 . Furthermore, if the polytope

𝑃 ⊆ ℝ 𝑛 is given as a set of constraints, namely 𝑃 = {𝐴𝑥 = 𝑏, 𝐵𝑥 ≥ 𝑑} then it follows that dim ( P ) = n − rank ( 𝐴 ), providing that P is nonempty and the inequalities 𝐵𝑥 ≥ 𝑑 do not imply any additional equality constraints. We need the following definitions to formally continue our exposition.

Definition 1.

The inequality 𝜋 𝑇 𝑥 ≤ 𝜋 is a valid inequality for polytope 𝑃 if it is satisfied by all points in 𝑃 , i.e., if and only if max{𝜋 𝑇 𝑥 | 𝑥 ∈ 𝑃} ≤ 𝜋 . Definition 2. If 𝜋 𝑇 𝑥 ≤ 𝜋 is a valid inequality for a polytope 𝑃 , then 𝑓 = {𝑥 ∈ 𝑃|𝜋 𝑇 𝑥 = 𝜋 } , 𝑓 is called a face of 𝑃 , and we say that (𝜋, 𝜋 ) represents 𝑓 . Definition 3.

A face 𝐹 of polytope 𝑃 is called a facet of 𝑃 if dim ( 𝐹 ) = dim ( 𝑃 ) – 1. Definition 4.

A polytope 𝑃 is called full dimensional if there is no equality constraint which can be satisfied by all points 𝑥 ∈ 𝑃 . We also use the following known results from linear algebra (e.g. see [32]). Theorem 1. If 𝐹 is a facet of 𝑃 , there is some inequality that represents 𝐹 . Theorem 2.

If polytope

𝑃 ⊆ ℝ 𝑛 and dim ( 𝑃 ) = 𝑛 − 𝑑 , where 𝑑 > 0 , then the maximal number of independent equality constraints that can be satisfied by points in P is 𝑑 . Theorem 3. If 𝑓 is a ( 𝑛 − 𝑑 )-dimensional face of polytope 𝑃 ⊆ ℝ 𝑛 , where 𝑑 > 1 , then 𝑓 lies in intersections of d facets of 𝑃 . Theorem 4. If 𝑃 is a full dimensional polytope, it has a unique minimal description 𝑃 ={𝑥 ∈ ℝ 𝑛 | 𝐵𝑥 ≤ 𝑑} where each inequality is unique to within a positive multiple. Suppose 𝑆 denotes the solution set of (IP). According to Theorem 2, if the dimension of conv ( S ) is 𝑑 units less than the dimension of 𝒳 , this implies that there exist 𝑑 unidentified equality constraints. These new constraints can be extracted using the ECA-method [25]. The following section provides a review of this method. Throughout this paper, we assume that the set of solution of (IP) is available. However, in practice we might need to use Lattice point enumeration methods to obtain such a set. We refer to [1] in which the performance of the state-of-the-art Lattice point enumeration algorithms is reported. Let

𝑆 = {𝑠 𝑖 ∈ ℝ 𝑛 , 𝑖 = 1, . . . , 𝑁} denote the set of solutions of (IP) and define: 𝑣 𝑖−1 ∶= 𝑠 𝑖 − 𝑠 , 𝑖 = 2, . . . , 𝑁 , and 𝑉 ∶= [𝑣 |𝑣 |. . . |𝑣 𝑁−1 ] the 𝑛 × 𝑁 matrix whose columns are v i ’s. Ignoring the existing equality constraints, 𝐴 𝑥 = 𝑏 , the ECA-method finds a set of all equality constraints, 𝐴̃𝑥 = 𝑏̃ and then removes the ones captured by the already known equality constraints. Thus, only the equalities that refine the feasible space further are retained. To extract all equality constraints, one needs to find a linear equation system,

𝐴̃𝑥 = 𝑏̃ , which represents the affine hull of 𝑆 ( aff ( 𝑆 )), that is, {𝐴̃𝑥 = 𝑏̃ | 𝑥 ∈ 𝐚𝐟𝐟(𝑆)} . It is shown in [24] and [25] that a matrix 𝐴̃ and vector 𝑏̃ can be obtained as follows: 𝐴̃: = 𝐧𝐮𝐥𝐥(𝑉 𝑇 ) 𝑇 , 𝑏̃: = 𝐴̃𝑠 , (*) where null ( 𝐾 ) denotes a matrix, whose columns are a basis of the null space of matrix 𝐾 , and 𝑠 is one of the solutions in the set 𝑆 . The generated set of equalities do not depend on the choice of 𝑠 , thus 𝑠 can be chosen as any point in 𝑆 . Note that the matrix 𝑉 𝑇 is often large and sparse which allows exploitation of some efficient methods to obtain the basis of the null space [16]. Also, Matlab software is available [20] to find the basis of the null space of such matrices. The generated equalities 𝐴̃𝑥 = 𝑏̃ provide a set of all (existing and unidentified) equality constraints. Therefore, these generated equalities must be consecutively checked to see if they are unidentified. That is, the first generated equality is appended to the existing equality constraints to see whether it increases the rank of the coefficient matrix by one. If so, the constraint is new (non-redundant), and the existing equalities will be updated by including this new equality. Repeating this process guarantees all newly added equalities are new. Note that, the order in which the new constraints are added effects which ones are retained. Thus, we sort the generated equalities (𝐴̃𝑥 = 𝑏̃) based on the number of their non-zero coefficients, increasing the chance that the constraints to retain are easier for generalization. Defining

𝐾(𝑖. ) and

𝐾(. 𝑖) as the 𝑖 th row and 𝑖 th column of matrix 𝐾 respectively, the ECA-method can be represented as Algorithm 1. Algorithm 1 (ECA-method):

Step 0.

Receive the inputs including existing equality constraints, 𝐴 𝑥 = 𝑏 and set of solutions, 𝑆 = {𝑠 𝑖 ∈ ℝ 𝑛 , 𝑖 = 1, . . . , 𝑁} . Construct 𝑉 based on 𝑆 . Set 𝑖 = 0 , 𝐸 = 𝐴 , and j = 1. Step 1.

Obtain 𝑚 = 𝐑𝐚𝐧𝐤(𝐴 ) , and 𝑚 = 𝐑𝐚𝐧𝐤(𝑉) . Set 𝑑 = 𝑚 − 𝑚 . Step 2.

Stop if 𝑑 = 0 and return “There are no unidentified equality constraints”, otherwise, return “There are 𝑑 unidentified equality constraints”. Step 3.

Find all equality constraints,

𝐴̃𝑥 = 𝑏̃ based on the equations in (*) and rearrange the rows of the matrix 𝐴̃ based on sorting the number of their non-zero coefficients to construct the matrix 𝐴 . Step 4.

Construct matrix

𝐸 = [ 𝐸𝐴 (𝑗.) ] . If 𝐑𝐚𝐧𝐤(𝐸) = 𝑚 + 1 , then update 𝑖 = 𝑖 + 1 , and 𝑄(𝑖. ) =𝐴 (𝑗. ) . Otherwise, update 𝐸 by eliminating its last row. Step 5.

Stop if 𝑖 = 𝑑 , and return the set of new equality constraints,

𝑄𝑥 = 𝑏 𝑄 , where 𝑏 𝑄 = 𝑄𝑠 . Otherwise, update 𝑗 = 𝑗 + 1 and go back to Step 4.

3. The proposed method to generate strong valid inequalities

In this section, we propose a potential approach to identify new strong valid inequalities for IP problems that can be included already with a partial set of (equality/inequality) constraints. We call this method

Inequality Constraint Augmenting method , or ICA-method, for short. Note that this method can be employed to extract strong valid inequalities including faces of lower dimension than that of facets. However, we explain it here only for finding facet-defining inequalities. The main idea of the ICA-method is intuitively derived from the well-known fact that faces of polytopes are also polytopes themselves. According to Definition 3, the dimension of facets of a polytope is one less than the dimension of that polytope. That is, if the maximal number of independent equalities that the points of a given polytope can satisfy is 𝑚 , then the points on a facet of that polytope can satisfy exactly 𝑚 + 1 independent equalities. Let 𝜋 𝑇 𝑥 = 𝜋 be the equality that points lying in a facet can satisfy but other points of the polytope cannot. Suppose 𝑠 is a point of the polytope that does not lie in the facet. Therefore, according to Theorem 1, if 𝜋 𝑇 𝑠 < 𝜋 then 𝜋 𝑇 𝑥 ≤ 𝜋 is the facet-defining inequality, and otherwise if 𝜋 𝑇 𝑥 > 𝜋 , then −𝜋 𝑇 𝑥 ≤ −𝜋 represents the facet-defining inequality. If the set of solutions lying on a facet, 𝑆 𝐹 is known, we can exploit the aforementioned ECA-method to find (𝜋, 𝜋 ) . More precisely, we first generate all 𝑚 equalities (if they are not already identified) that the solutions in 𝑆 satisfy, called Set 1 and then generate all 𝑚 + 1 equalities that the solutions in 𝑆 𝐹 satisfy, called Set 2. Subsequently, we can check which equality in Set 2 cannot be written as a linear combination of those 𝑚 equalities in Set 1. In order to achieve [ 𝐸𝐴 (𝑗.) ] : Embedding 𝐴 (𝑗. ) to the end of 𝐸 . this, equalities in Set 2 can be added one at a time to the 𝑚 equalities of Set 1 to check if the rank of the newly constructed coefficient matrix is increased by one. Note that only one equality among 𝑚 + 1 equalities of Set 2 increases the rank and therefore that equality provides (𝜋, 𝜋 ) . A natural question that arises is how we can find a set of solutions (vertices) that lie in a facet. This is a very important question since if we can find such sets of solutions then the ECA-method can find the associated facets. From basic linear algebra, if a polytope with 𝑁 extreme points (vertices) has dimension 𝑘 , then at least 𝑘 vertices lie in each facet of that polytope. Hence, for an exhaustive search, we need to take 𝑘 vertices out of 𝑁 and check if they provide a facet. This implies that all (𝑁𝑘 ) possible ways need to be checked for extracting all facets, and therefore it is impractical. Note that facets are valid inequalities which are often satisfied by numerous solutions. Therefore, to find a facet of a polytope constructed as the convex hull of vertices 𝑠 , … , 𝑠 𝑁 , we can design a mixed integer program (§) to find a valid inequality, 𝜋 𝑇 𝑥 ≤ 𝜋 with the maximal number of solutions satisfying it at equality. Max 𝑧 = ∑ 𝜃 𝑖𝑁𝑖=1 s. t. − 𝜋 𝑇 𝑠 𝑖 + 𝜋 ≤ (1 − 𝜃 𝑖 )𝑀 𝑖 = 1, … 𝑁 (1) 𝜋 𝑇 𝑠 𝑖 − 𝜋 ≤ (𝜃 𝑖 − 1)𝜀 𝑖 = 1, … 𝑁 (2) ∑ 𝜃 𝑖𝑁𝑖=1 ≤ 𝑁 − 1 (3) ∑ 𝜃 𝑖𝑁𝑖=1 ≥ 𝑛 − 𝑚 (4) 𝜃 ∈ {0,1} 𝑖 = 1, … 𝑁 (5) (§) where 𝑆 = {𝑠 𝑖 ∈ ℝ 𝑛 , 𝑖 = 1, . . . , 𝑁} is the set of solutions of (IP). 𝜃 𝑖 ’s are binary decision variables associated with solutions such that 𝜃 𝑖 is one if 𝑠 𝑖 satisfies the valid inequality at equality ( 𝜋 𝑇 𝑠 𝑖 = 𝜋 ) and otherwise is zero. 𝑚 is the number of all independent equality constraints ( 𝑚 = 𝐑𝐚𝐧𝐤(𝐴̃) ), and 𝜀 and 𝑀 are a small and large positive value respectively. In (§), the objective function counts the number of solutions that satisfy the valid inequality at equality. Constraints (1) and (2) together ensure that if the solution 𝑠 𝑖 lies in the inequality (i.e., 𝜋 𝑇 𝑠 𝑖 = 𝜋 ) then 𝜃 𝑖 = 1 and otherwise if 𝜋 𝑇 𝑠 𝑖 ≤ 𝜋 − 𝜀 then 𝜃 𝑖 = 0 . Constraint (3) prevents that all solutions can satisfy the valid inequality at the equality form, because it then will be an equality constraint rather than a valid inequality. Constraint (4) is written as each facet of a ( 𝑛 − 𝑚 )-dimensional polytope can be satisfied by at least 𝑛 − 𝑚 solutions. Proposition 1 . Solving (§) provides a facet of the polytope conv ( 𝑆 ) . Proof . By contradiction suppose solving (§) gives a face, 𝑓 which has a lower dimension than that of facets. According to Theorem 3, 𝑓 is a subset of some facets and each of those facets not only are satisfied at equality by all the solutions that lie in 𝑓 but they also can be satisfied by some other solutions at equality. Therefore, the number of solutions that lie in f are less than the number of solutions lying on those facets and this is obviously a contradiction with our assumption ( 𝑓 is the optimal solution of (§)). This implies that the solution of (§) cannot be anything but a facet and it completes the proof. ■ Note that the facet found by solving (§) might not be a new valid inequality and could be implied by existing constraints. To check whether it is a new or redundant inequality, one can solve the following linear program.

Max 𝑧 = 𝜋 𝑇 𝑥 s. t. 𝐴̃𝑥 = 𝑏̃ 𝐵 𝑥 ≥ 𝑑 (†) where 𝐴̃𝑥 = 𝑏̃ and 𝐵 𝑥 ≥ 𝑑 represent the set of all equality constraints and existing inequality constraints respectively. Suppose the optimal objective value of (†) is 𝑧 ∗ and 𝑧 ∗ ≤ 𝜋 , then valid inequality 𝜋 𝑇 𝑥 ≤ 𝜋 is redundant. Regardless of whether solving (§) leads to a new or redundant inequality, one might be interested to re-solve (§) to identify another valid inequality. So, how we can prevent revisiting the former generated valid inequality when re-solving (§)? Before answering this question in Proposition 2, let define 𝜃 ∗ = (𝜃 , … , 𝜃 𝑁∗ ) 𝑇 to be values in an optimal solution of (§). Proposition 2 . Updating the set of constraints of (§) by adding the constraint (‡) , eliminates the former optimal solution (found valid inequality) from the feasible space. ∑ 𝜃 𝑖 𝑖∈{𝑗|𝜃 𝑗∗ =1} − ∑ 𝜃 𝑖 𝑖∈{𝑗|𝜃 𝑗∗ =0} ≤ −1 + ∑ 𝜃 𝑖∗𝑁 𝑖=1 (‡) Proof . Constraint (‡) ensures that either at least one of the 𝜃 𝑖 ’s variables with value one in 𝜃 ∗ must switch to zero or at least one of 𝜃 𝑖 ’s with value zero in 𝜃 ∗ must switch to one and therefore the model cannot deliver the former optimal solution as a new optimal solution. ■ Remark 1.

Note that solving (§) gives a facet-defining inequality which is often in a form that is hard to interpret and generalize for higher dimensional instances of the underlying problem. Moreover, we already studied the process of using algorithm 1 for a given set of solutions that lie in a facet to find the associated facet-defining inequality. We can utilize that process to reobtain the complicated appearance facet-defining inequalities in a simpler form. More precisely, we can consider the set of solutions lying in this facet, 𝑆 𝐹 = {𝑠 𝑖 |𝜃 𝑖 = 1, 𝑖 = 1, … 𝑁} as the input for the that process to regenerate an inequality which is likely to be simpler for generalization. We shall demonstrate this approach in example 1. Note that constraint (‡) can also be applied to avoid regenerating any of the existing inequalities ( 𝐴 𝑥 = 𝑏 ) as follows. Remark 2.

Suppose 𝜋 𝑇 𝑥 ≤ 𝜋 is an existing valid inequality. We then construct its associated 𝜃 ∗ vector, a vector whose 𝑖 th entry has value one if 𝜋 𝑇 𝑠 𝑖 = 𝜋 and otherwise has value zero. Subsequently, we write the constraint (‡) for 𝜃 ∗ and add it to (§) which ensures we do not regenerate 𝜋 𝑇 𝑥 ≤ 𝜋 as a valid inequality. We shall refer to this constraint as deduplication constraint for 𝜋 𝑇 𝑥 ≤ 𝜋 . If the deduplication constraints are not added to (§), we risk revisiting existing facets in further iterations. The ICA-method is now summarized in Algorithm 2 as follows.

Algorithm 2 (ICA-method):

Step 0.

Receive the inputs including existing equality constraints, 𝐴 𝑥 = 𝑏 and set of solutions, 𝑆 = {𝑠 𝑖 ∈ ℝ 𝑛 , 𝑖 = 1, . . . , 𝑁} . Generate a set of all equality constraints, 𝐴̃𝑥 = 𝑏̃ (if they are not already identified) by using Algorithm 1.

Step 1.

Construct the model (§). Update (§) by embedding the deduplication constraints written for existing inequalities based on the instruction in Remark 2.

Step 2.

Solve (§) strop if it is infeasible. Otherwise, let 𝜃 𝑖∗ denote the 𝜃 𝑖 value in an optimal solution, and find 𝑆 𝐹 = {𝑠 𝑖 | 𝜃 𝑖∗ = 1, 𝑖 = 1, … 𝑁} . Step 3 . If 𝑆 𝐹 is a facet (i.e., 𝐝𝐢𝐦(𝐜𝐨𝐧𝐯(𝑆 𝐹 )) = 𝑛 − 𝐫𝐚𝐧𝐤(𝐴̃) − 1 ), then go to the next step, otherwise go to Step 6. Step 4 . Utilize (†) to check whether it is a new facet. If ‘yes’ go to the next step, otherwise go to Step 6.

Step 5 . Regenerate a simpler form of this inequality by the instruction explained in Remark 1.

Step 6.

Update the model (§) by adding the deduplication constraint written for 𝜃 ∗ . Also, update Constraint (3) of (§) to ∑ 𝜃 𝑖𝑁𝑖=1 ≤ 𝑧 ∗ , where 𝑧 ∗ is the optimal value of the objective function of (§) and go back to step 2. Note that the optimal value of the objective function of (§) will be either equal to or less than the optimal objective of the former iteration. Hence, we can update the right-hand side of Constraint (3) of (§) as mentioned in step 6 of the above algorithm.

Remark 3.

Note that there are polytopes with facets such that a small number of extreme points can satisfy those facets at equality, while the same polytopes have faces of lower dimensions with larger numbers of vertices lying on these faces. Therefore, in some iterations of Algorithm 2, we might reach a face rather than a facet, and we can retain the faces with sufficiently large dimensions as strong valid inequalities. It follows that Algorithm 2, which is designed for identification of facet-defining inequalities, can be modified to extract strong valid inequalities. For doing so, one needs to define a threshold dimension for considering the inequalities associated with faces of higher dimension than that threshold as strong valid inequalities.

We now present a simple example to illustrate the implementation of the proposed ICA-method.

Example 1.

Suppose we are given the three-dimensional Stasheff polytope in ℝ whose extreme points are labelled as shown in Figure 1, and whose 4-dimensional coordinates for the labelled extreme points of the polytope are defined in Table 1. Figure 1.

The three-dimensional Stasheff polytope

Suppose we are given the facet-defining inequalities shown in Table 2 for describing the feasible space of the above polytope and asked to find unidentified facet-defining inequalities (if any) for this polytope.

Table 1.

The vertices of the Stasheff polytope.

Labels Coordinates 𝑠 = [1 6 2 1] T 𝑠 = [1 6 1 2] T 𝑠 = [1 4 1 4] T 𝑠 = [4 3 1 2] T 𝑠 = [4 3 2 1] T 𝑠 = [4 2 1 3] T 𝑠 = [3 2 1 4] T 𝑠 = [1 2 3 4] T 𝑠 = [4 1 2 3] T 𝑠 = [3 1 2 4] T 𝑠 = [1 2 6 1] T 𝑠 = [4 1 4 1] T 𝑠 = [2 1 3 4] T 𝑠 = [2 1 6 1] T Table 2.

Given facets of the Stasheff polytope.

ID Given Inequalities Vertices (Labels) lying in the facets 𝑥 ≥ 1

2, 3, 4, 6, 7 𝑥 + 𝑥 ≥ 3

6, 7, 9, 10 𝑥 ≥ 1

9, 10, 12, 13, 14 𝑥 ≤ 4

4, 5, 6, 9, 12 𝑥 ≥ 1

1, 2, 3, 8, 11 𝑥 + 𝑥 ≥ 3

8, 11, 13, 14 𝑥 + 𝑥 ≤ 7

1, 2, 4, 5

The first step of Algorithm 2 requires us to find all equality constraints (if they are not already identified). Thus, we use the ECA-method (Algorithm 1) to find any new equality constraints as follows: We have 𝑣 𝑖−1 ∶= 𝑠 𝑖 − 𝑠 , 𝑖 = 2, . . . , 14 , 𝑉 ∶= [𝑣 |𝑣 |. . . |𝑣 ] = [ −0 − 0 − 3 − 3 − 3 − 2 − 0 − 3 − 2 − 0 − 3 − 1 − 1−0 − 2 − 3 − 3 − 4 − 4 − 4 − 5 − 5 − 4 − 5 − 5 − 5−1 − 1 − 1 − 0 − 1 − 1 − 1 − 0 − 0 − 4 − 2 − 1 − 4−1 − 3 − 1 − 0 − 2 − 3 − 3 − 2 − 3 − 0 − 0 − 3 − 0 ] Then, a set of all equality constraints,

𝐴̃𝑥 = 𝑏̃ can be obtained by the equations in (*) as follows:

𝐴̃ = 𝐧𝐮𝐥𝐥(𝑉 𝑇 ) 𝑇 = [1 1 1 1], 𝑏̃: = 𝐴𝑠 = 10. Therefore, the only equality constraint for this polytope is 𝑥 + 𝑥 + 𝑥 + 𝑥 = 10 . Subsequently, we need to construct the model (§) and then update it by embedding the following deduplication constraints: −𝜃 + 𝜃 + 𝜃 + 𝜃 − 𝜃 + 𝜃 + 𝜃 − 𝜃 − 𝜃 − 𝜃 − 𝜃 − 𝜃 − 𝜃 − 𝜃 ≤ 4, −𝜃 − 𝜃 − 𝜃 − 𝜃 − 𝜃 + 𝜃 + 𝜃 − 𝜃 + 𝜃 + 𝜃 − 𝜃 − 𝜃 − 𝜃 − 𝜃 ≤ 3, −𝜃 − 𝜃 − 𝜃 − 𝜃 − 𝜃 − 𝜃 − 𝜃 − 𝜃 + 𝜃 + 𝜃 − 𝜃 + 𝜃 + 𝜃 + 𝜃 ≤ 4, −𝜃 − 𝜃 − 𝜃 + 𝜃 + 𝜃 + 𝜃 − 𝜃 − 𝜃 + 𝜃 − 𝜃 − 𝜃 + 𝜃 − 𝜃 − 𝜃 ≤ 4, +𝜃 + 𝜃 + 𝜃 − 𝜃 − 𝜃 − 𝜃 − 𝜃 + 𝜃 − 𝜃 − 𝜃 + 𝜃 − 𝜃 − 𝜃 − 𝜃 ≤ 4, −𝜃 − 𝜃 − 𝜃 − 𝜃 − 𝜃 − 𝜃 − 𝜃 + 𝜃 − 𝜃 − 𝜃 + 𝜃 − 𝜃 + 𝜃 + 𝜃 ≤ 3, +𝜃 + 𝜃 − 𝜃 + 𝜃 + 𝜃 − 𝜃 − 𝜃 − 𝜃 − 𝜃 − 𝜃 − 𝜃 − 𝜃 − 𝜃 − 𝜃 ≤ 3. Note that deduplication constraints are obtained by writing the constraint (‡) for each of the existing inequalities given in Table 2. This set of constraints ensures that we do not generate any of existing inequalities.

Iteration 1.

Solving (§) by setting

𝑀 = 100 and 𝜀 = 0.01 , we arrive at the following solution: 𝜋 =[0.001 0.001 0.001 − 0.009] 𝑇 , 𝜋 = 0 , 𝜃 = [1 0 0 0 1 0 0 0 0 0 1 1 0 1] 𝑇 . Checking the dimension of the convex hull of the extreme points lying on +0.001𝑥 + 0.001𝑥 − 0.009𝑥 ≤ 0 implies that it is a facet-defining inequality. Furthermore, checking (†) verifies that it is associated with a new facet. As mentioned in Remark 1, when a polytope is not full-dimension, a facet can be represented by several inequalities, and one might be interested in finding a simple version of the inequality to represent that facet. For doing so, one can utilize the instruction in Remark 1 and reach 𝑥 ≤ 1 . Now we need to add the deduplication constraint to eliminate the currently found facet-defining inequality from the solution space of (§). This constraint is of the form: 𝜃 − 𝜃 − 𝜃 − 𝜃 + 𝜃 − 𝜃 − 𝜃 − 𝜃 − 𝜃 − 𝜃 + 𝜃 + 𝜃 − 𝜃 + 𝜃 ≤ 4, and, the Constraint (3) of (§) must be updated from ∑ 𝜃 𝑖𝑁𝑖=1 ≤ 13 to ∑ 𝜃 𝑖𝑁𝑖=1 ≤ 5 . Iteration 2.

Solving the updated (§), we find the following solution: 𝜋 = [−0.004 − 0.004 − 0.004 0.006] 𝑇 , 𝜋 = 0 , 𝜃 = [0 0 1 0 0 0 1 1 0 1 0 0 1 0] 𝑇 . Dimension and non-redundancy checking reveals that the found inequality is a new facet-defining inequality and by implementing the Remark 1 instruction the simpler version of this inequality can be found in the form of 𝑥 + 𝑥 + 𝑥 ≥ 6 . Hence, this is the second new facet-defining inequality. The deduplication constraint for this iteration is: −𝜃 − 𝜃 + 𝜃 − 𝜃 − 𝜃 − 𝜃 + 𝜃 + 𝜃 − 𝜃 + 𝜃 − 𝜃 − 𝜃 + 𝜃 − 𝜃 ≤ 4, and Constraint (3) of (§) remain intact as this facet-defining constraint is also satisfied by 5 extreme points. Iteration 3.

We terminate Algorithm 2 since the updated (§) model is now infeasible. Therefore, the found facet-defining inequalities are the ones presented in Table 3.

Table 3.

New identified facets of the Stasheff polytope.

Vertices (Labels) lying in the facet Facet-defining Inequalities

1, 5, 11, 12, 14 𝑥 ≤ 1

3, 7, 8, 10, 13 𝑥 + 𝑥 + 𝑥 ≥ 6 Note that facets represented in Table 2 and Table 3 fully describe the Stasheff polytope. Having illustrated the ICA-method with a simple example, we now show how it can be used to strengthen formulations of interesting integer programming problems, such as the Traveling Salesman Problem.

4. A New TSP Formulation

In this section, we will first develop a new TSP formulation and then utilize the proposed method in the preceding section to strengthen the formulation by devising some strong inequalities. The now classical Traveling Salesman Problem (TSP) constitutes a famous challenge for operations researchers, mathematicians, and computers scientists. In particular, suppose there are 𝑛 cities, and a traveling salesman starts from a home city, passing through all other cities exactly once before returning to the home city. Such a travel path is called a tour or a Hamiltonian cycle (HC). The distance between each pair of cities 𝑖 and 𝑗 is given as 𝐶 𝑖𝑗 , and so for any tour, the tour length is the sum of distances traveled. Hence, the TSP can be simply thought of as the optimization problem of identifying the tour of shortest length. When 𝐶 𝑖𝑗 ≠𝐶 𝑗𝑖 the problem is referred to as the Asymmetric Traveling Salesman Problem (ATSP). We propose a new model for the TSP based on its relationship to the Hamiltonian Cycle Problem (HCP) and corresponding polytope constructions. Feinberg [11] investigated the relationship between the HCP and discounted Markov Decision Processes. He constructed a new polytope corresponding to a given graph 𝐺 , which we shall refer to as ℋ β . The constraints of ℋ β are of the form: ∑ 𝑥 − 𝛽 ∑ 𝑥 𝑗1𝑛𝑗=2 = 1 − 𝛽 𝑛 ∑ 𝑥 𝑖𝑗𝑛𝑗=1𝑗≠𝑖 − 𝛽 ∑ 𝑥 𝑗𝑖𝑛𝑗=1𝑗≠𝑖 = 0, 𝑖 = 2, … , 𝑛 ∑ 𝑥 = 1 𝑥 𝑖𝑗 ≥ 0, 𝑖, 𝑗 = 2, … , 𝑛, 𝑖 ≠ 𝑗 𝑥 𝑖𝑗 = 0, arc(𝑖, 𝑗) ∉ 𝐺 ( ℋ β ) where β is a constant value in (0,1). Feinberg showed that, if the graph 𝐺 is Hamiltonian, then the polytope ℋ β has an extreme point for each of its Hamiltonian cycles. This result is elaborated in Theorem 5 below. Theorem 5.

If in a feasible solution of ℋ β , for each node 𝑖 ∈ {1, … , 𝑛} , exactly one variable 𝑥 𝑖𝑗 takes a positive value (i.e., from each node only one flow emanates), then that solution is an extreme point of ℋ β and the arcs associated with the positive valued variables in that solution trace out a tour or Hamiltonian cycle of the graph 𝐺 . Figure 2.

A Hamiltonian graph

Example 2.

The ℋ β set of constraints for the graph given in Figure 2 can be written as: 𝑥 + 𝑥 + 𝑥 − 𝛽(𝑥 + 𝑥 + 𝑥 ) = 1 − 𝛽 𝑥 + 𝑥 − 𝛽(𝑥 + 𝑥 ) = 0 𝑥 + 𝑥 − 𝛽(𝑥 + 𝑥 ) = 0 𝑥 + 𝑥 + 𝑥 − 𝛽(𝑥 + 𝑥 + 𝑥 ) = 0 𝑥 + 𝑥 − 𝛽(𝑥 + 𝑥 ) = 0 𝑥 + 𝑥 + 𝑥 = 1 𝑥 , 𝑥 , 𝑥 , 𝑥 , 𝑥 , 𝑥 , 𝑥 , 𝑥 , 𝑥 , 𝑥 , 𝑥 , 𝑥 ≥ 0. It can be seen by inspection that {𝑥 = 1, 𝑥 = 𝛽, 𝑥 = 𝛽 , 𝑥 = 𝛽 , 𝑥 = 𝛽 𝑥 = 𝑥 = 𝑥 = 𝑥 = 𝑥 = 𝑥 = 𝑥 = 0 is an extreme point of ℋ β , and the set of arcs {(1, 2), (2, 3), (3, 4), (4, 5), (5, 1)}, corresponding to the non-zero variables, traces out a Hamiltonian cycle in that graph. Exploiting the result in Theorem 5, one can utilize binary variables to develop a new TSP formulation. In particular, let 𝑧 𝑖𝑗 > 0 if variable 𝑥 𝑖𝑗 > 0 and otherwise 𝑧 𝑖𝑗 = 0 .Then it is easy to write a set of constraints to ensure that only one flow emanates from each node and this guarantees that the feasible space of the model reduces to only Hamiltonian Cycles (tours). Therefore, the new TSP formulation, which we shall refer to as the TSP H (𝛽) formulation, can be written as follows. Min ∑ ∑ 𝑐 𝑖𝑗 𝑧 𝑖𝑗𝑛𝑗=1𝑗≠𝑖 𝑛𝑖=1 s. t. ∑ 𝑥 − 𝛽 ∑ 𝑥 𝑗1𝑛𝑗=2 = 1 − 𝛽 𝑛 (1) ∑ 𝑥 𝑖𝑗𝑛𝑗=1𝑗≠𝑖 − 𝛽 ∑ 𝑥 𝑗𝑖𝑛𝑗=1𝑗≠𝑖 = 0, 𝑖 = 2, … , 𝑛 (2) ∑ 𝑥 = 1 (3) 𝑥 = 𝑧 , 𝑗 = 2, … , 𝑛 (4) 𝑥 𝑗1 = 𝛽 𝑛−1 𝑧 𝑗1 , 𝑗 = 2, … , 𝑛 (5) (TSP H (𝛽) ) 𝛽 𝑛−2 𝑧 𝑖𝑗 ≤ 𝑥 𝑖𝑗 ≤ 𝛽𝑧 𝑖𝑗 , 𝑖, 𝑗 = 2, … , 𝑛, 𝑖 ≠ 𝑗 (6) ∑ 𝑧 𝑖𝑗𝑛𝑗=1𝑗≠𝑖 = 1, 𝑖 = 1, … , 𝑛 (7) 𝑥 𝑖𝑗 ≥ 0, 𝑖, 𝑗 = 2, … , 𝑛, 𝑖 ≠ 𝑗 (8) 𝑧 𝑖𝑗 ∈ {0,1}, 𝑖, 𝑗 = 2, … , 𝑛, 𝑖 ≠ 𝑗 (9) where 𝑧 𝑖𝑗 = 1 if arc (𝑖, 𝑗) belongs to the tour and otherwise 𝑧 𝑖𝑗 = 0 . In TSP H (𝛽) formulation, constraints (1)-(3) are ℋ β constraints for complete graph and constraints (4)-(7) provide the required condition in theorem 5. Consequently, the only feasible solutions of TSP H (𝛽) are Hamiltonian cycles and, therefore, TSP H (𝛽) is a formulation for solving the ATSP. This formulation can be considered as a node-oriented TSP formulation since it contains a set of variables ( 𝑥 𝑖𝑗 ’s) which show the rank order that each node is visited in a tour. For instance, in the tour of example 2, we have 𝑥 = 𝛽 , and node 3 is visited in the third place with node 1 being assigned a rank of one. Generalizing, if 𝑥 𝑖𝑗 = 𝛽 𝑘−1 , 𝑘 = 1, … , 𝑛 , it implies that if the tour starts from node 1, we will visit node 𝑖 in 𝑘 th order in that tour. To the authors’ knowledge, there are a few node-oriented TSP models with (at most) quadratic order ( 𝒪(𝑛 ) ) in the number of variables and constraints. The first such model is well-known as the MTZ formulation, which was introduced by Miller, Tucker, and Zemlin in 1960 [23]. The MTZ formulation was improved by Desrochers and Laporte in 1991 [8] through lifting some of its inequality constraints into facets of the underling polytope. In 2002, Sherali and Driscoll [28] proposed a reformulation of the MTZ constraints called the SD model and proved that this model provides an even tighter LP-relaxation than of that the DL model. More recently, Oncan et al. showed that the LP-relaxation of SD is also stronger than that of the well-known single commodity flow model [29]. Furthermore, based on the analysis in [29] and [27], the SD formulation possesses the tightest relaxation among all known TSP formulations with at most ( 𝒪(𝑛 ) ) variables and constraints. This summary of existing literature motivates us to select the SD model for comparison with our newly developed TSP formulation. We shall compare the lower bounds generated by solving the LP-relaxation of both models (SD and TSP H (𝛽) ) over a set of ATSP instances from TSPLIB [26]. We will then tighten the TSP H (𝛽) formulation using the ICA-method developed in earlier sections of this paper to show its effectiveness. The SD model [28] is defined as: Min ∑ ∑ 𝑐 𝑖𝑗 𝑧 𝑖𝑗𝑛𝑗=1𝑗≠𝑖 𝑛𝑖=1 s. t. ∑ 𝑥 𝑖𝑗𝑛𝑗=2 + (𝑛 − 1)𝑥 𝑖1 = 𝑢 𝑖 , 𝑖 = 2, … , 𝑛 (1) ∑ 𝑥 𝑖𝑗𝑛𝑖=2 + 1 = 𝑢 𝑗 , 𝑗 = 2, … , 𝑛 (2) 𝑧 𝑖𝑗 ≤ 𝑥 𝑖𝑗 ≤ (𝑛 − 2)𝑥 𝑖𝑗 , 𝑖, 𝑗 = 2, … , 𝑛, 𝑖 ≠ 𝑗 (3) 𝑢 𝑗 + (𝑛 − 2)𝑧 𝑖𝑗 − (𝑛 − 1)(1 − 𝑧 𝑗𝑖 ) ≤ 𝑥 𝑖𝑗 + 𝑥 𝑗𝑖 ≤ 𝑢 𝑗 − (1 − 𝑧 𝑗𝑖 ), 𝑖, 𝑗 = 2, … , 𝑛, 𝑖 ≠ 𝑗 (4) + (𝑛 − 3)𝑥 𝑗1 ≤ 𝑢 𝑗 ≤ (𝑛 − 2) − (𝑛 − 3)𝑥 + 𝑥 𝑗1 , 𝑗 = 2, … , 𝑛 (5) (SD) ∑ 𝑧 𝑖𝑗𝑛𝑗=1𝑗≠𝑖 = 1, 𝑖 = 1, … , 𝑛 (6) ∑ 𝑧 𝑖𝑗𝑛𝑖=1𝑖≠𝑗 = 1, 𝑗 = 1, … , 𝑛 (7) 𝑥 𝑖𝑗 ≥ 0, 𝑖, 𝑗 = 2, … , 𝑛, 𝑖 ≠ 𝑗 (8) 𝑧 𝑖𝑗 ∈ {0,1}, 𝑖, 𝑗 = 2, … , 𝑛, 𝑖 ≠ 𝑗 (9) where 𝑢 𝑗 denotes the rank order that node 𝑗 is visited with the initial city being assigned a rank of zero. Also, 𝑧 𝑖𝑗 takes on a value of 1 if the tour transitions from node 𝑖 to node 𝑗 , and 0 otherwise. Table 4 presents the result of a computational comparison of the LP-relaxation of SD and TSP H (𝛽) formulations. Note that the TSP H (𝛽) formulation depends on parameter β . Interestingly, the structure of the polytope constructed as the LP-relaxation of TSP H (𝛽) could vary based β . Thus, in our numerical study, we consider this parameter very close to one as such values provide greater lower bounds than that of smaller β’s . This observation is consistent with the results in [9] and [10] in which the extreme points of ℋ β and another relevant polytope are analytically and computationaly studied. Table 4.

LP-relaxation comparison study between

TSP H (𝛽) and SD formulation over ATSP instances of TSPLIB. Problem TSP H (0.999) TSP H (0.9999) SD br17 ft_53 ft_70 ftv_33 ftv_35 ftv_38 ftv_44 ftv_47 ftv_53 ftv_55 ftv_64 ftv_70 ftv_170 Kro124p

P43

Rbg323

Rbg358

Rbg403

Rbg443

Ry48p

Although the numerical results in Table 4 reveal that the lower bound found by TSP H (0.9999) are greater than of that TSP H (0.999), the performance of TSP H (𝛽) is far weaker than of that SD in all tested instances. Thus, one might be interested in employing ICA-method to strengthen TSP H (𝛽) formulation by generating some double index facet-defining inequalities. Note that we limited ourselves to find only double index inequalities as we do not wish to exceed using

𝒪(𝑛 ) variables and constraints.

5. Identification of Strong Valid Inequalities for TSP H (𝜷) Before implementing the ICA-method, it is beneficial to conduct symmetry break preparations which can dramatically reduce the required computations. In particular, we set the zero value for many decision variables of the model (§), and this makes Algorithm 2 much more efficient as this model must be solved in each iteration of the algorithm. The symmetry that arises in (§) is due to the indistinguishability of nodes

2, … , 𝑛 in the TSP H (𝛽) formulation. Therefore, for any solution of (§), there exist equivalent symmetric reflections of this solution obtained by relabelling the nodes through mapping labels

2, … , 𝑛 to any reordering of these nodes (in (𝑛 − 1)! ways). Note that not necessarily every map result in a new solution. To elaborate what symmetry means here, consider a double index inequality such as constraint (6) in TSP H (𝛽) which can be written for (𝑛 − 12 ) different combinations of 𝑖 and 𝑗 . However, if these inequalities were unknown, we would only need to find one such an inequality to discover that family of inequalities. To prevent finding different members of an inequality family several times, we can permit only one (or a few of them) to be in the feasible space of (§). More precisely, we could assume that the inequality 𝜋 𝑇 𝑥 ≤ 𝜋 that we are trying to find through solving (§) is of the form: ∑ 𝜋̅ 𝑧 + ∑ 𝜋̅ 𝑘1 𝑧 𝑘1𝑛𝑘=2 + ∑ 𝜋̅ 𝑧 + ∑ 𝜋̅ 𝑘2 𝑧 𝑘2𝑛𝑘=1𝑘≠2 + ∑ 𝜋̅ 𝑧 + ∑ 𝜋̅ 𝑘3 𝑧 𝑘3𝑛𝑘=1𝑘≠3 + ∑ 𝜋 𝑥 + ∑ 𝜋 𝑘2 𝑥 𝑘2𝑛𝑘=1𝑘≠2 + ∑ 𝜋 𝑥 + ∑ 𝜋 𝑘3 𝑥 𝑘3𝑛𝑘=1𝑘≠3 ≤ 𝜋 (**) This is due to distinguishability of node 1 from other nodes in terms of the TSP H (𝛽) formulation viewpoint which allows considering this node as a solo member group. On the other hand, nodes

2, … , 𝑛 are indistinguishable in the TSP H (𝛽) formulation and constitute the second group. That means, if we swap the labels of two distinct nodes from the node set {2, … , 𝑛} in the TSP H (𝛽) formulation, then the lower bound generated by the LP-relaxation of TSP H (𝛽) remains intact, however, if we exchange the label of node 1 with any other nodes, the lower bound generated by LP-relaxation could vary. It follows that the most general form of a double index inequality comprises three indices, namely 1, 𝑖 , and 𝑗 where indices 𝑖 and 𝑗 are two distinct arbitrary indices from the node set {2, … , 𝑛} . Thus, we just need to allow the variables associated with arcs connected to nodes 1, 𝑖 , and 𝑗 be involved in 𝜋 𝑇 𝑥 ≤ 𝜋 . Without loss of generality, 𝑖 and 𝑗 are set to 2 and 3 respectively in (**) and the coefficients of all other variables set to zero. One might think that the terms ∑ 𝜋 𝑥 + ∑ 𝜋 𝑘1 𝑥 𝑘1𝑛𝑘=2 must also be added to the left-hand side of (**). However, it should be noted that the constraints (4) and (5) in TSP H (𝛽) allow us to remove those terms. More precisely, variables 𝑥 and 𝑥 𝑘1 , 𝑘 = 2, … , 𝑛 can be substituted by 𝑧 and 𝛽 𝑛−1 𝑧 𝑘1 , 𝑘 = 2, … , 𝑛, respectively, and as latter variables are involved in 𝜋 𝑇 𝑥 ≤ 𝜋 , the former ones can be removed. The elimination of these terms in (**) reduce the symmetries in the solution space of (§) and consequently make it easier to solve. We refer the readers to [19] and [22] for further discussion about symmetry breaking techniques in integer linear programs. Before any attempt to find strong valid inequalities, we first implemented the ECA-method (Algorithm 1) and found ∑ 𝑧 𝑖𝑗𝑛𝑖=1𝑖≠𝑗 = 1, 𝑗 = 1, … , 𝑛 as the only set of unidentified non-redundant equality constraints. Obviously, this set of constraints along with constraints (7) in TSP H (𝛽) constitute the assignment constraints which appear in most TSP formulations. In order to strengthen the TSP H (𝛽) formulation by inequalities, we employed Matlab R2017b and CPLEX 12.8 and implemented the ICA-method (Algorithm 2) with parameters 𝛽 = 0.999,𝜀 = 0.1, 𝑀 = 100 , for 𝑛 = 6 and 𝑛 = 7 , to determine the unknown coefficients of inequality (**). All computational experiments are carried out on a HP Z840 workstation with Intel Xeon E5-2640 2.4 GHz 10-Core Processors, 128GB RAM, running Windows 7 operating system and using a single thread. After completion of this process, we found 𝜃 vectors for facet-defining inequalities. Recall that the 𝜃 vector for an inequality determines which solutions lie on that inequality. Utilizing the instruction in Remark 1, we used 𝜃 vectors and reobtained the inequalities but this time we considered 𝛽 as a parameter and used symbolic Matlab. This leads to finding expressions for the inequalities based on parameter 𝛽 which allows us to explore whether there are any patterns that are generalizable as 𝑛 grows. By recognizing generalizable patterns in the generated inequalities, we could find a rather large number of valid inequalities, that are facet-defining for those dimensions (𝑛 = 6, 7) . These inequalities, which we shall refer to as Set 3, are listed below: 𝛽 𝑛−1 (1 − 𝑧 𝑖𝑗 − 𝑧 𝑗𝑖 ) ≤ ∑ 𝑥 𝑗𝑎𝑛𝑎=1𝑎≠𝑖,𝑗 − 𝛽𝑥 𝑖𝑗 ≤ 𝛽(1 − 𝑧 𝑖𝑗 − 𝑧 𝑗𝑖 ), 𝑖, 𝑗 = 2, … , 𝑛, 𝑖 ≠ 𝑗 (1) 𝛽 𝑛−2 (1 − 𝑧 𝑗1 − 𝑧 ) + 𝛽𝑧 + 𝐵 𝑛−1 𝑧 𝑗1 ≤ ∑ 𝑥 𝑗𝑎𝑛𝑎=1𝑎≠𝑗 ≤ 𝛽 (1 − 𝑧 𝑗1 − 𝑧 ) + 𝛽𝑧 + 𝛽 𝑛−1 𝑧 𝑗1 , 𝑖 = 2, … , 𝑛 (2) 𝑥 𝑗𝑖 ≤ (𝛽 − 𝛽 )𝑧 + 𝛽 𝑧 𝑗𝑖 , 𝑖, 𝑗 = 2, … , 𝑛, 𝑖 ≠ 𝑗 (3) (𝛽 𝑛−2 − 𝛽)𝑧 + −𝛽 + 𝛽 𝑛−3 ∑ 𝛽 𝑎𝑛−5𝑎=0 𝑧 𝑖1 + 𝛽 𝑛−2 𝑧 𝑗1 − 𝛽 𝑛−3 ∑ 𝛽 𝑎𝑛−5𝑎=0 𝑧 𝑗𝑖 + ∑ 𝛽 𝑎𝑛−4𝑎=0 ∑ 𝛽 𝑎𝑛−5𝑎=0 𝑥 𝑗𝑖 + ∑ 𝑥 𝑗𝑎𝑛𝑎=2𝑎≠𝑖,𝑗 ≥ 𝐵 𝑛−2 , 𝑖, 𝑗 = 2, … , 𝑛, 𝑖 ≠ 𝑗 (4) (𝛽 𝑛−3 − 𝛽)𝑧 + (𝛽 𝑛−2 − 𝛽 𝑛−3 )𝑧 𝑖1 + 𝛽 𝑛−2 𝑧 𝑗1 + 𝛽 + 𝛽 𝑛−3 ∑ 𝛽 𝑎𝑛−4𝑎=0 𝑧 𝑗𝑖 + −𝛽 𝑛−4 + ∑ 𝛽 𝑎𝑛−5𝑎=0 ∑ 𝛽 𝑎𝑛−4𝑎=0 𝑥 𝑗𝑖 + ∑ 𝑥 𝑗𝑎𝑛𝑎=2𝑎≠𝑖,𝑗 ≥ 𝛽 𝑛−2 , 𝑖, 𝑗 = 2, … , 𝑛, 𝑖 ≠ 𝑗 (5) (Set 3) (𝛽 𝑛−4 − 𝛽)(𝑧 + 𝑧 ) + (𝛽 𝑛−2 − 𝛽 𝑛−3 + 𝛽 𝑛−4 )(𝑧 𝑖1 + 𝑧 𝑗1 ) + (𝑥 𝑖𝑗 + 𝑥 𝑗𝑖 ) + ∑ 𝑥 𝑖𝑎𝑛𝑎=2𝑎≠𝑖,𝑗 + ∑ 𝑥 𝑗𝑎𝑛𝑎=2𝑎≠𝑖,𝑗 ≥ 𝛽 𝑛−2 + 𝐵 𝑛−4 , 𝑖, 𝑗 = 2, … , 𝑛, 𝑖 ≠ 𝑗 (6) (𝛽 𝑛−4 − 𝛽)𝑧 + (𝛽 𝑛−2 − 𝛽 𝑛−3 + 𝛽 𝑛−4 )(𝑧 𝑖1 + 𝑧 𝑗1 ) + (𝛽 𝑛−4 + 𝛽 𝑛−2 )𝑧 𝑗𝑖 + 𝑥 𝑖𝑗 − 𝛽𝑥 𝑗𝑖 + ∑ 𝑥 𝑖𝑎𝑛𝑎=2𝑎≠𝑖,𝑗 + ∑ 𝑥 𝑗𝑎𝑛𝑎=2𝑎≠𝑖,𝑗 ≥ 𝛽 𝑛−2 + 𝛽 𝑛−4 , 𝑖, 𝑗 = 2, … , 𝑛, 𝑖 ≠ 𝑗 (7) (𝛽 𝑛−4 − 𝛽)(𝑧 + 𝑧 ) + (𝛽 𝑛−2 − 𝛽 𝑛−3 + 𝛽 𝑛−4 )(𝑧 𝑖1 + 𝑧 𝑗1 ) +𝛽 𝑛−4 ( 𝑎𝑛−3𝑎=2 ∑ 𝛽 𝑎𝑛−5𝑎=0 ) 𝑧 𝑗𝑖 + 𝑥 𝑖𝑗 + 𝑛−4 −𝛽 𝑛−5 ∑ 𝛽 𝑎𝑛−5𝑎=0 𝑥 𝑗𝑖 + ∑ 𝑥 𝑖𝑎𝑛𝑎=2𝑎≠𝑖,𝑗 + ∑ 𝑥 𝑗𝑎𝑛𝑎=2𝑎≠𝑖,𝑗 ≥ 𝛽 𝑛−2 + 𝛽 𝑛−4 , 𝑖, 𝑗 = 2, … , 𝑛, 𝑖 ≠ 𝑗 (8) (𝛽 𝑛−3 − 𝛽 )𝑧 + 𝐵 𝑛 𝑧 𝑖1 + (𝛽 𝑛 + 𝛽 𝑛−1 + 𝛽 𝑛−3 )𝑧 𝑗1 + (𝛽 𝑛 + 𝛽 𝑛−1 +𝛽 𝑛−2 + 𝛽 𝑛−3 )𝑧 𝑗𝑖 + 𝑥 𝑖𝑗 + 𝛽 ∑ 𝑥 𝑖𝑎𝑛𝑎=2𝑎≠𝑖,𝑗 + (𝛽 + 𝛽 + 1) ∑ 𝑥 𝑗𝑎𝑛𝑎=2𝑎≠𝑖,𝑗 −𝛽 𝑥 𝑗𝑖 ≥ 𝛽 𝑛 + 𝛽 𝑛−1 + 𝛽 𝑛−2 + 𝛽 𝑛−3 , 𝑖, 𝑗 = 2, … , 𝑛, 𝑖 ≠ 𝑗 (9) 𝛽 𝑛−3 − 𝛽 𝛽 + 1 𝑧 + (𝛽 𝑛−3 − 𝛽 )𝑧 + 𝛽 𝑛−1 𝛽 + 1 𝑧 𝑖1 + 𝛽 𝑛−1 + 𝛽 𝑛−3 𝛽 + 1 𝑧 𝑗1 +𝛽 𝑛−3 ( 𝑎𝑛−2𝑎=3 𝑛−3 +2 ∑ 𝛽 𝑎𝑛−4𝑎=1 ) 𝑧 𝑗𝑖 + 𝑥 𝑖𝑗 + 𝑛−3 +2𝛽+∑ 𝛽 𝑎𝑛−5𝑎=2 𝑛−3 +2 ∑ 𝛽 𝑎𝑛−4𝑎=1 𝑥 𝑗𝑖 + 𝛽𝛽+1 ∑ 𝑥 𝑖𝑎𝑛𝑎=2𝑎≠𝑖,𝑗 + ∑ 𝑥 𝑗𝑎𝑛𝑎=2𝑎≠𝑖,𝑗 ≥ 𝛽 𝑛−1 + 𝛽 𝑛−2 + 𝛽 𝑛−3 𝛽 + 1 , 𝑖, 𝑗 = 2, … , 𝑛, 𝑖 ≠ 𝑗 (10) The following set of inequalities hold only for even 𝑛 : 𝛽 𝑛−3 − 𝛽 𝛽 + 1 𝑧 + (𝛽 𝑛−3 − 𝛽 )𝑧 + 𝛽 𝑛−1 𝛽 + 1 𝑧 𝑖1 + 𝛽 𝑛−1 + 𝛽 𝑛−3 𝛽 + 1 𝑧 𝑗1 +𝛽 𝑛−3 ( ∑ 𝛽 𝑎𝑛−4𝑎=0 ) 𝑧 𝑗𝑖 + 𝑥 𝑖𝑗 + 𝑛−4 +∑ 𝛽 ∑ 𝛽 𝑎𝑛−4𝑎=0 𝑥 𝑗𝑖 + 𝛽𝛽+1 ∑ 𝑥 𝑖𝑎𝑛𝑎=2𝑎≠𝑖,𝑗 + ∑ 𝑥 𝑗𝑎𝑛𝑎=2𝑎≠𝑖,𝑗 ≥ 𝛽 𝑛−1 + 𝛽 𝑛−2 + 𝛽 𝑛−3 𝛽 + 1 , 𝑖, 𝑗 = 2, … , 𝑛, 𝑖 ≠ 𝑗 (11) Checking the above inequalities for larger 𝑛 ’s , namely 𝑛 = 8, 9, 10, 11, we found them not only valid but also facet-defining for those dimensions as well. These results support our following conjecture. Conjecture 1.

There exists a constant 𝛽 such that for 𝑛 ≥ 6 ,and 𝛽 ∈ (𝛽 , 1) , the inequalities in Set 3, are facet-defining for the TSP H (𝛽) formulation. Conjecture 1 states that when 𝛽 is large enough, the inequalities in Set 3 are facet-defining for TSP H (𝛽) formulation. Note that while it is not difficult to show that these inequalities are valid for any 𝑛 ≥ 6 , proving that they are facet-defining needs further effort and is beyond the purpose of this paper. For the sake of brevity, we supply explanations for the validity of just three inequalities from Set 3, namely, inequalities (1), (7) and (10) in Appendix 1. The others can be explained by derivations that are conceptually similar. We are now able to conduct a comparison study between the strengthened TSP H (𝛽) and SD formulations. For strengthening the TSP H (𝛽) formulation, inequalities (1)-(3) of Set 3 are appended to the former set of constraints of TSP H (𝛽) , and we shall refer to this model as TSP H * (𝛽) . Note that the remainder of the newly developed inequalities in Set 3 were not added to the model as we wish to keep the number of constraints almost equivalent to that of the SD formulation. We conducted the computation for TSP H * (𝛽) (for β = 0.999 and 0.9999), and the results are summarized in Table 8. The boldly written values show the instances where TSP H * (𝛽) could not beat the SD formulation. Table 8.

LP-relaxation comparison study between

TSP H * (𝛽) and SD formulation over ATSP instances of TSPLIB. Problem TSP H (0.999) TSP H (0.9999) SD br17 ft_53 ft_70 ftv_33 ftv_35 ftv_38 ftv_44 ftv_47 ftv_53 ftv_55 ftv_64 ftv_70 ftv_170 Kro124p

P43

Rbg323

Rbg358

Rbg403

Rbg443

Ry48p

Naturally, TSP H *(0.9999) performed better than TSP H *(0.999) in all instances. Furthermore, TSP H *(0.9999) outperforms SD in twelve out of twenty test problems, exactly matches SD in five instances and is outperformed slightly by SD in only three instances. Comparing these results to the results in Table 4 also reveals that inequalities (1)-(3) has considerably strengthened the TSP H * (𝛽) formulation.

6. Conclusion

In this paper, we propose a generic method to assist modelers in identifying strong linear valid inequalities that can be used to refine feasible spaces of LP-relaxations of mixed integer programmes. The main purpose of this method is to make the devising of strong valid inequalities for IP problems closer to a methodological process rather than an artistic task. Although the proposed method is still far from a fully automated process, it can be considered as an initial step for this notoriously difficult task. To illustrate the proposed method, a new TSP formulation is introduced and then the ICA-method is employed to devise strong valid inequalities to strengthen the formulation. Comparison between the relaxation of this formulation and a widely used TSP formulation confirms the effectiveness of the devised strong valid inequalities. Finally, as a naturally arising topic for future research, one can explore whether there are more properties of facet-defining inequalities that can be expressed as new constraints for (§). This can refine the feasible space of (§) from many valid inequalities that are not facets, and therefore the algorithm will be more efficient and more likely to find strong valid inequalities. Another natural line of research is to exploit the ICA-method to extract strong valid inequalities for tightening IP formulations of many other problems.

References [1] Assarf, B., Gawrilow, E., Herr, K., Joswig, M., Lorenz, B., Paffenholz, A., & Rehn, T. (2017). Computing convex hulls and counting integer points with polymake.

Mathematical Programming Computation , (1), 1-38. [2] Avis, D., Bremner, D., & Seidel, R. (1997). How good are convex hull algorithms?. Computational Geometry , (5-6), 265-301. [3] Avis, D., & Fukuda, K. (1992). A pivoting algorithm for convex hulls and vertex enumeration of arrangements and polyhedra. Discrete & Computational Geometry , (3), 295-313. [4] Avis, D., & Fukuda, K. (1996). Reverse search for enumeration. Discrete Applied Mathematics , (1-3), 21-46. [5] Bektaş, T., & Gouveia, L. (2014). Requiem for the Miller–Tucker–Zemlin subtour elimination constraints?. European Journal of Operational Research , (3), 820-832. [6] Christof, T., Löbel, A., & Stoer, M. (1997). Porta-polyhedron representation transformation algorithm. . [7] Conforti, M., Cornuéjols, G., & Zambelli, G. (2014). Integer programming (Vol. 271). Berlin: Springer. [8] Desrochers, M., & Laporte, G. (1991). Improvements and extensions to the Miller-Tucker-Zemlin subtour elimination constraints.

Operations Research Letters , (1), 27-36. [9] Eshragh, A., & Filar, J. (2011). Hamiltonian cycles, random walks, and discounted occupational measures. Mathematics of Operations Research , (2), 258-270. [10] Eshragh, A., Filar, J. A., Kalinowski, T., & Mohammadian, S. (2018). Hamiltonian cycles and subsets of discounted occupational measures. arXiv preprint arXiv:1805.04725 . [11] Feinberg, E. A. (2000). Constrained discounted Markov decision processes and Hamiltonian cycles. Mathematics of Operations Research , (1), 130-140. [12] Filar, J. A., & Moeini, A. (2015). Hamiltonian cycle curves in the space of discounted occupational measures. Annals of Operations Research , 1-18. [13] Fischetti, M., Lodi, A., & Toth, P. (2003). Solving real-world ATSP instances by branch-and-cut. In

Combinatorial Optimization—Eureka, You Shrink! (pp. 64-77). Springer, Berlin, Heidelberg. [14] Fukuda, K., Liebling, T. M., & Margot, F. (1997). Analysis of backtrack algorithms for listing all vertices and all faces of a convex polyhedron.

Computational Geometry , (1), 1-12. [15] Gawrilow, E., & Joswig, M. (2000). Polymake: a framework for analyzing convex polytopes. In Polytopes—combinatorics and computation (pp. 43-73). Birkhäuser, Basel. [16] Golub, G. H., & Van Loan, C. F. (2012).

Matrix computations (Vol. 3). JHU press. [17] Grötschel, M., Lovász, L., & Schrijver, A. (2012).

Geometric algorithms and combinatorial optimization (Vol. 2). Springer Science & Business Media. [18] Karp, R. M. (1972). Reducibility among combinatorial problems. In Complexity of computer computations (pp. 85-103). Springer, Boston, MA. [19] Liberti, L. (2012). Reformulations in mathematical programming: automatic symmetry detection and exploitation.

Mathematical Programming , (1-2), 273-304. [20] lusolZ (2016). lusolZ: Nullspace of sparse matrix via LUSOL or SPQR. available for download at http://stanford.edu/group/SOL/software/lusolZ. [21] Bernhard, K., & Vygen, J. (2008). Combinatorial optimization: Theory and algorithms. Springer, Third Edition, 2005. [22] Margot, F. (2010). Symmetry in integer linear programming. In

50 Years of Integer Programming 1958-2008 (pp. 647-686). Springer, Berlin, Heidelberg. [23] Miller, C. E., Tucker, A. W., & Zemlin, R. A. (1960). Integer programming formulation of traveling salesman problems.

Journal of the ACM (JACM) , (4), 326-329. [24] Moeini, A. (2016). Approximations of the Convex Hull of Hamiltonian Cycles for Cubic Graphs (Doctoral dissertation, Flinders University). [25] Moeini, A. (2017). Identification of unidentified equality constraints for integer programming problems. European Journal of Operational Research , (2), 460-467. [26] Reinelt, G. (1991). TSPLIB—A traveling salesman problem library. ORSA journal on computing , (4), 376-384. [27] Roberti, R., & Toth, P. (2012). Models and algorithms for the asymmetric traveling salesman problem: an experimental comparison. EURO Journal on Transportation and Logistics , (1-2), 113-133. [28] Sherali, H. D., & Driscoll, P. J. (2002). On tightening the relaxations of Miller-Tucker-Zemlin formulations for asymmetric traveling salesman problems. Operations Research , (4), 656-669. [29] Öncan, T., Altınel, İ. K., & Laporte, G. (2009). A comparative analysis of several asymmetric traveling salesman problem formulations. Computers & Operations Research , (3), 637-654. [30] Weyl, H. (1934). Elementare theorie der konvexen polyeder. Commentarii Mathematici Helvetici , (1), 290-306.; Kuhn, H. W., & Tucker, A. W. (Eds.). (1950). Contributions to the Theory of Games (Vol. 1). Princeton University Press. [31] Wolsey, L. A. (1989). Strong formulations for mixed integer programming: a survey.

Mathematical Programming , (1), 173-191. [32] Wolsey, L. A., & Nemhauser, G. L. (2014). Integer and combinatorial optimization . John Wiley & Sons. [33] Ziegler, G. M. (2012).

Lectures on polytopes (Vol. 152). Springer Science & Business Media. Appendix 1

A generic and easy approach to show the validity of inequalities in Set 3 is to consider all the possibilities that binary variables on those inequalities can create. That is, if an inequality has k binary variables, we need to prove that the inequality is valid for all 𝑘 possibilities. For example, a proof for inequality (1) of Set 3 is as follows: Proof.

This constraint has two binary variables which leads to four possibilities. These four possible cases are analyzed in Table 5.

Table 5.

Analysis of proving Inequality (1) from Set 3.

Case 𝑧 𝑖𝑗 𝑧 𝑗𝑖 𝜋 𝑇 𝑥|𝑧 𝑖𝑗 , 𝑧 𝑗𝑖 min {𝜋 𝑇 𝑥|𝑧 𝑖𝑗 , 𝑧 𝑗𝑖 } max {𝜋 𝑇 𝑥|𝑧 𝑖𝑗 , 𝑧 𝑗𝑖 }

1 1 NA NA NA

1 0 ∑ 𝑥 𝑗𝑎𝑛𝑎=1𝑎≠𝑖,𝑗 − 𝛽𝑥 𝑖𝑗

0 0

0 1 ∑ 𝑥 𝑗𝑎𝑛𝑎=1𝑎≠𝑖,𝑗 − 𝛽𝑥 𝑖𝑗

0 0

0 0 ∑ 𝑥 𝑗𝑎𝑛𝑎=1𝑎≠𝑖,𝑗 − 𝛽𝑥 𝑖𝑗 𝛽 𝑛−1 𝛽 In case 1, we have 𝑧 𝑖𝑗 = 𝑧 𝑗𝑖 = 1 , which is not applicable (NA) as TSP H (𝛽) is a TSP formulation and therefore its constraints never allow arcs (𝑖, 𝑗) and (𝑗, 𝑖) to appear simultaneously in a tour. In Case 2, we have 𝑧 𝑖𝑗 = 1 and 𝑧 𝑗𝑖 = 0 . Without loss of generality, suppose node 𝑖 is the 𝑘 th position in a tour, so 𝑥 𝑖𝑗 = 𝛽 𝑘−1 and according to the constraint (2) in TSP H (𝛽) we have ∑ 𝑥 𝑗𝑎𝑛𝑎=1𝑎≠𝑖,𝑗 = 𝛽 𝑘 which results in ∑ 𝑥 𝑗𝑎𝑛𝑎=1𝑎≠𝑖,𝑗 − 𝛽𝑥 𝑖𝑗 = 0 . In case 3, 𝑧 𝑖𝑗 =0 and constraints (4)-(6) of TSP H (𝛽) imply 𝑥 𝑖𝑗 = 0 . Also, 𝑧 𝑗𝑖 = 1 which implies 𝑥 𝑗𝑖 > 0 and 𝑥 𝑗𝑎 = 0, 𝑎 = 1, … , 𝑛, 𝑎 ≠ 𝑖, 𝑗 and therefore ∑ 𝑥 𝑗𝑎𝑛𝑎=1𝑎≠𝑖,𝑗 = 0 . This leads to ∑ 𝑥 𝑗𝑎𝑛𝑎=1𝑎≠𝑖,𝑗 − 𝛽𝑥 𝑖𝑗 = 0 . In case 4 , 𝑥 𝑖𝑗 = 0 as 𝑧 𝑖𝑗 = 0 and obviously 𝛽 𝑛−1 ≤ ∑ 𝑥 𝑗𝑎𝑛𝑎=1𝑎≠𝑖,𝑗 ≤ 𝛽 so 𝛽 𝑛−1 ≤ ∑ 𝑥 𝑗𝑎𝑛𝑎=1𝑎≠𝑖,𝑗 −𝛽𝑥 𝑖𝑗 ≤ 𝛽 and this completes the proof. ■ Similarly, constraint (7) of Set 3 can be proved as follows:

Proof.

Note that this inequality is of the form 𝜋 𝑇 𝑥 ≥ 𝜋 . Hence, one way to prove the validity of this inequality is to show min {𝜋 𝑇 𝑥} ≥ 𝜋 . For doing so, we evaluated the inequality (7) based on the values its binary variables could take (𝜋 𝑇 𝑥|𝑧 , 𝑧 , 𝑧 𝑖1 , 𝑧 𝑗1 ) in column 5 of Table 6, and in column 6 of this table, we set the remaining decision variables so that the minimum for 𝜋 𝑇 𝑥 is achieved (min{𝜋 𝑇 𝑥} |𝑧 , 𝑧 , 𝑧 𝑖1 , 𝑧 𝑗1 ) . Table 6.

Analysis of proving Inequality (6) from Set 3.

Case 𝑧 𝑧 𝑧 𝑖1 𝑧 𝑗1 𝜋 𝑇 𝑥|𝑧 , 𝑧 , 𝑧 𝑖1 , 𝑧 𝑗1 min{𝜋 𝑇 𝑥|𝑧 , 𝑧 , 𝑧 𝑖1 , 𝑧 𝑗1 }

1 1 1 1 NA NA

1 1 1 0 NA NA

1 1 0 1 NA NA

1 1 0 0 NA NA

1 0 1 1 NA NA

1 0 1 0 NA NA

1 0 0 1 𝛽 𝑛−4 + 𝛽 𝑛−2 − 𝛽 𝑛−3 + 𝛽 𝑛−4 + ∑ 𝑥 𝑖𝑎𝑛𝑎=2𝑎≠𝑖,𝑗 𝑛−4 + 𝛽 𝑛−2 − 𝛽 𝑛−3 + 𝛽

1 0 0 0 𝛽 𝑛−4 − 𝛽 + 𝑥 𝑖𝑗 + ∑ 𝑥 𝑖𝑎𝑛𝑎=2𝑎≠𝑖,𝑗 + ∑ 𝑥 𝑗𝑎𝑛𝑎=2𝑎≠𝑖,𝑗 𝛽 𝑛−4 + 𝛽 𝑛−2

0 1 1 1 NA NA

0 1 1 0 𝛽 𝑛−4 + 𝛽 𝑛−2 − 𝛽 𝑛−3 + 𝛽 𝑛−4 + ∑ 𝑥 𝑖𝑎𝑛𝑎=2𝑎≠𝑖,𝑗 𝑛−4 + 𝛽 𝑛−2 − 𝛽 𝑛−3 + 𝛽

0 1 0 1 NA NA

0 1 0 0 𝛽 𝑛−4 − 𝛽 + 𝑥 𝑖𝑗 + ∑ 𝑥 𝑖𝑎𝑛𝑎=2𝑎≠𝑖,𝑗 + ∑ 𝑥 𝑗𝑎𝑛𝑎=2𝑎≠𝑖,𝑗 𝛽 𝑛−4 + 𝛽 𝑛−2

0 0 1 1 NA NA

0 0 1 0 𝛽 𝑛−2 − 𝛽 𝑛−3 + 𝛽 𝑛−4 + (𝑥 𝑖𝑗 + 𝑥 𝑗𝑖 ) +∑ 𝑥 𝑖𝑎𝑛𝑎=2𝑎≠𝑖,𝑗 + ∑ 𝑥 𝑗𝑎𝑛𝑎=2𝑎≠𝑖,𝑗 𝛽 𝑛−2 + 𝛽 𝑛−4 + 𝛽 𝑛−1

0 0 0 1 𝛽 𝑛−2 − 𝛽 𝑛−3 + 𝛽 𝑛−4 + (𝑥 𝑖𝑗 + 𝑥 𝑗𝑖 ) +∑ 𝑥 𝑖𝑎𝑛𝑎=2𝑎≠𝑖,𝑗 + ∑ 𝑥 𝑗𝑎𝑛𝑎=2𝑎≠𝑖,𝑗 𝛽 𝑛−2 + 𝛽 𝑛−4 + 𝛽 𝑛−1

0 0 0 0 (𝑥 𝑖𝑗 + 𝑥 𝑗𝑖 ) + ∑ 𝑥 𝑖𝑎𝑛𝑎=2𝑎≠𝑖,𝑗 + ∑ 𝑥 𝑗𝑎𝑛𝑎=2𝑎≠𝑖,𝑗 𝛽 𝑛−4 + 𝛽 𝑛−2 It can be seen by inspection that all values in column 6 are greater than (or equal to) 𝛽 𝑛−2 +𝛽 𝑛−4 for any 𝛽 ∈ (0,1) which shows that this inequality is valid in general form. ■ As a more complicated inequality, we prove the validity of inequality (10) of Set 3, below:

Proof.

For this inequality we have:

Table 7.

Analysis of proving Inequality (10) from Set 3.

Case 𝑧 𝑧 𝑧 𝑖1 𝑧 𝑗1 𝑧 𝑗𝑖 𝜋 𝑇 𝑥|𝑧 , 𝑧 , 𝑧 𝑖1 , 𝑧 𝑗1 , 𝑧 𝑗𝑖 min{𝜋 𝑇 𝑥|𝑧 , 𝑧 , 𝑧 𝑖1 , 𝑧 𝑗1 , 𝑧 𝑗𝑖 }

1 1 1 1 1 NA NA

1 1 1 1 0 NA NA

1 1 1 0 1 NA NA

1 1 1 0 0 NA NA

1 1 0 1 1 NA NA

1 1 0 1 0 NA NA

1 1 0 0 1 NA NA

1 1 0 0 0 NA NA

1 0 1 1 1 NA NA

1 0 1 1 0 NA NA

1 0 1 0 1 NA NA

1 0 1 0 0 NA NA

1 0 0 1 1 NA NA

1 0 0 1 0 𝛽 𝑛−3 − 𝛽 𝛽 + 1 + 𝛽 𝑛−1 + 𝛽 𝑛−3 𝛽 + 1 + 𝛽𝛽+1 ∑ 𝑥 𝑖𝑎𝑛𝑎=2𝑎≠𝑖,𝑗 𝑛−3 +𝛽 𝑛−1 𝛽 + 1

1 0 0 0 1 NA NA

1 0 0 0 0 𝛽 𝑛−3 − 𝛽 𝛽 + 1 + 𝑥 𝑖𝑗 + ∑ 𝑥 𝑗𝑎𝑛𝑎=2𝑎≠𝑖,𝑗 𝛽 𝑛−1 +𝛽 𝑛−2 +𝛽 𝑛−3 𝛽 + 1

0 1 1 1 1 NA NA

0 1 1 1 0 NA NA

0 1 1 0 1 NA NA

0 1 1 0 0 𝛽 𝑛−3 − 𝛽 + 𝛽 𝑛−1 𝛽 + 1 + ∑ 𝑥 𝑗𝑎𝑛𝑎=2𝑎≠𝑖,𝑗 𝛽 𝑛−1 +𝛽 𝑛−2 +𝛽 𝑛−3 𝛽 + 1

0 1 0 1 1 NA NA

0 1 0 1 0 NA NA

0 1 0 0 1 𝛽 𝑛−3 − 𝛽 +𝛽 𝑛−3 ( 𝑎𝑛−2𝑎=3 𝑛−3 +2 ∑ 𝛽 𝑎𝑛−4𝑎=1 ) + 𝑛−3 +2𝐵+∑ 𝛽 𝑎𝑛−5𝑎=2 𝑛−3 +2 ∑ 𝛽 𝑎𝑛−4𝑎=1 𝑥 𝑗𝑖 + 𝛽𝛽+1 ∑ 𝑥 𝑖𝑎𝑛𝑎=2𝑎≠𝑖,𝑗 𝛽 𝑛−3 − 𝛽 +𝛽 𝑛−3 ( 𝑎𝑛−2𝑎=3 𝑛−3 +2 ∑ 𝛽 𝑎𝑛−4𝑎=1 ) + 𝑛−3 +2𝐵+∑ 𝛽 𝑎𝑛−5𝑎=2 𝑛−3 +2 ∑ 𝛽 𝑎𝑛−4𝑎=1 𝛽 + 𝛽 𝛽+1

0 1 0 0 0 𝛽 𝑛−3 − 𝛽 + 𝛽𝛽+1 ∑ 𝑥 𝑖𝑎𝑛𝑎=2𝑎≠𝑖,𝑗 +∑ 𝑥 𝑗𝑎𝑛𝑎=2𝑎≠𝑖,𝑗 𝛽 𝑛−1 +𝛽 𝑛−2 +𝛽 𝑛−3 𝛽 + 1

0 0 1 1 1 NA NA

0 0 1 1 0 NA NA

0 0 1 0 1 𝛽 𝑛−1 𝐵 + 1 +𝛽 𝑛−3 ( 𝑎𝑛−2𝑎=3 𝑛−3 +2 ∑ 𝛽 𝑎𝑛−4𝑎=1 ) + 𝑛−3 +2𝛽+∑ 𝛽 𝑎𝑛−5𝑎=2 𝑛−3 +2 ∑ 𝛽 𝑎𝑛−4𝑎=1 𝑥 𝑗𝑖 𝛽 𝑛−1 𝛽 + 1 + 𝛽 𝑛−3 ( 𝑎𝑛−2𝑎=3 𝑛−3 +2 ∑ 𝛽 𝑎𝑛−4𝑎=1 ) + 𝑛−3 +2𝛽+∑ 𝛽 𝑎𝑛−5𝑎=2 𝑛−3 +2 ∑ 𝛽 𝑎𝑛−4𝑎=1 𝛽 𝑛−2

0 0 1 0 0 𝛽 𝑛−1 𝛽 + 1 + ∑ 𝑥 𝑗𝑎𝑛𝑎=2𝑎≠𝑖,𝑗 𝛽 𝑛−1 +𝛽 𝑛−2 +𝛽 𝑛−3 𝛽 + 1

0 0 0 1 1 NA NA

0 0 0 1 0 𝛽 𝑛−1 + 𝛽 𝑛−3 𝛽 + 1 + 𝑥 𝑖𝑗 + 𝛽𝛽+1 ∑ 𝑥 𝑖𝑎𝑛𝑎=2𝑎≠𝑖,𝑗 𝛽 𝑛−1 +𝛽 𝑛−2 +𝛽 𝑛−3 𝛽 + 1

0 0 0 0 1 𝛽 𝑛−3 ( 𝑎𝑛−2𝑎=3 𝑛−3 +2 ∑ 𝛽 𝑎𝑛−4𝑎=1 ) + 𝑛−3 +2𝛽+∑ 𝛽 𝑎𝑛−5𝑎=2 𝑛−3 +2 ∑ 𝛽 𝑎𝑛−4𝑎=1 𝑥 𝑗𝑖 + 𝛽𝛽+1 ∑ 𝑥 𝑖𝑎𝑛𝑎=2𝑎≠𝑖,𝑗 𝛽 𝑛−3 ( 𝑎𝑛−2𝑎=3 𝑛−3 +2 ∑ 𝐵 𝑎𝑛−4𝑎=1 ) + 𝑛−3 +2𝛽+∑ 𝑣 𝑎𝑛−5𝑎=2 𝑛−3 +2 ∑ 𝛽 𝑎𝑛−4𝑎=1 𝛽 𝑛−3 + 𝛽 𝑛−1 𝛽+1

0 0 0 0 0 𝑥 𝑖𝑗 + 𝛽𝛽+1 ∑ 𝑥 𝑖𝑎𝑛𝑎=2𝑎≠𝑖,𝑗 +∑ 𝑥 𝑗𝑎𝑛𝑎=2𝑎≠𝑖,𝑗 𝛽 𝑛−1 +𝛽 𝑛−2 +𝛽 𝑛−3 𝛽 + 1 As is shown in the last column of the above table, it is easy to see min{𝜋 𝑇 𝑥} |𝑧 , 𝑧 , 𝑧 𝑖1 , 𝑧 𝑗1 , 𝑧 𝑗𝑖 ≥ 𝛽 𝑛−1 + 𝛽 𝑛−2 + 𝛽 𝑛−3 𝛽 + 1 for all cases except cases 23, 27 and 31. So we further simplified the inequality for these three cases and obtained 𝛽 𝑛−1 ≥ 𝛽 , , and 𝛽 𝑛−2 + 1 + 𝛽 ≥ 𝛽 𝑛−3 + 𝛽 𝑛−4 + 𝛽 respectively. All these three inequalities are obviously correct for any 𝛽 ∈ (0,1)𝛽 ∈ (0,1)