[PDF] Fixpoint Theory -- Upside Down

Abstract

Knaster-Tarski's theorem, characterising the greatest fixpoint of a monotone function over a complete lattice as the largest post-fixpoint, naturally leads to the so-called coinduction proof principle for showing that some element is below the greatest fixpoint (e.g., for providing bisimilarity witnesses). The dual principle, used for showing that an element is above the least fixpoint, is related to inductive invariants. In this paper we provide proof rules which are similar in spirit but for showing that an element is above the greatest fixpoint or, dually, below the least fixpoint. The theory is developed for non-expansive monotone functions on suitable lattices of the form \mathbb{M}^Y, where Y is a finite set and \mathbb{M} an MV-algebra, and it is based on the construction of (finitary) approximations of the original functions. We show that our theory applies to a wide range of examples, including termination probabilities, behavioural distances for probabilistic automata and bisimilarity. Moreover, quite interestingly, it allows us to determine original algorithms for solving simple stochastic games.

Full PDF

aa r X i v : . [ c s . L O ] J a n Fixpoint Theory – Upside Down

Paolo Baldan , Richard Eggert , Barbara K¨onig , and Tommaso Padoan Universit`a di Padova, Italy Universit¨at Duisburg-Essen, Germany

Abstract.

Knaster-Tarski’s theorem, characterising the greatest ﬁxpointof a monotone function over a complete lattice as the largest post-ﬁxpoint, naturally leads to the so-called coinduction proof principle forshowing that some element is below the greatest ﬁxpoint (e.g., for provid-ing bisimilarity witnesses). The dual principle, used for showing that anelement is above the least ﬁxpoint, is related to inductive invariants. Inthis paper we provide proof rules which are similar in spirit but for show-ing that an element is above the greatest ﬁxpoint or, dually, below theleast ﬁxpoint. The theory is developed for non-expansive monotone func-tions on suitable lattices of the form M Y , where Y is a ﬁnite set and M an MV-algebra, and it is based on the construction of (ﬁnitary) approx-imations of the original functions. We show that our theory applies to awide range of examples, including termination probabilities, behaviouraldistances for probabilistic automata and bisimilarity. Moreover it allowsus to determine original algorithms for solving simple stochastic games. Fixpoints are ubiquitous in computer science as they allow to provide a meaningto inductive and coinductive deﬁnitions (see, e.g., [25,22]). A monotone function f : L → L over a complete lattice ( L, ⊑ ), by Knaster-Tarski’s theorem [27],admits a least ﬁxpoint µf and greatest ﬁxpoint νf which are characterised as theleast pre-ﬁxpoint and the greatest post-ﬁxpoint, respectively. This immediatelygives well-known proof principles for showing that a lattice element l ∈ L is below νf or above µf l ⊑ f ( l ) l ⊑ νf f ( l ) ⊑ lµf ⊑ l On the other hand, showing that a given element l is above νf or below µf is more diﬃcult. One can think of using the characterisation of least and largestﬁxpoints via Kleene’s iteration. E.g., the largest ﬁxpoint is the least elementof the (possibly transﬁnite) descending chain obtained by iterating f from ⊤ .Then showing that f i ( ⊤ ) ⊑ l for some i , one concludes that νf ⊑ l . This proofprinciple is related to the notion of ranking functions. However, this is a lesssatisfying notion of witness since f has to be applied i times, and this can beineﬃcient or unfeasible when i is an inﬁnite ordinal.The aim of this paper is to present an alternative proof rule for this purposefor functions over lattices of the form L = M Y where Y is a ﬁnite set and M s an MV-chain, i.e., a totally ordered complete lattice endowed with suitableoperations of sum and complement. This allows us to capture several exam-ples, ranging from ordinary relations, for dealing with bisimilarity, behaviouralmetrics, termination probabilities and simple stochastic games.Assume f : M Y → M Y monotone and consider the question of proving thatsome ﬁxpoint a : Y → M is the largest ﬁxpoint νf . The idea is to show thatthere is no “slack” or “wiggle room” in the ﬁxpoint a that would allow us tofurther increase it. This is done by associating with every a : Y → M a function f a on Y whose greatest ﬁxpoint gives us the elements of Y where we havea potential for increasing a by adding a constant. If no such potential exists,i.e. νf a is empty, we conclude that a is νf . A similar function f a (specifyingdecrease instead of increase) exists for the case of least ﬁxpoints. Note that thepremise is νf a = ∅ , i.e. the witness remains coinductive. The proof rules are: f ( a ) = a νf a = ∅ νf = a f ( a ) = a νf a = ∅ µf = a For applying the rule we compute a greatest ﬁxpoint on Y , which is ﬁnite,instead of working on the potentially inﬁnite M Y . The rule does not work forall monotone functions f : M Y → M Y , but we show that whenever f is non-expansive the rule is valid. Actually, it is not only sound, but also reversible, i.e.,if a = νf then νf a = ∅ , providing an if-and-only-if characterisation of whethera given ﬁxpoint corresponds to the greatest ﬁxpoint.Quite interestingly, under the same assumptions on f , using a restrictedfunction f ∗ a , the rule can be used, more generally, when a is just a pre-ﬁxpoint ( f ( a ) ⊑ a ) and it allows to conclude that νf ⊑ a . A dual result holds for post-ﬁxpoints in the case of least ﬁxpoints. f ( a ) ⊑ a νf ∗ a = ∅ νf ⊑ a a ⊑ f ( a ) νf a ∗ = ∅ a ⊑ µf As already mentioned, the theory above applies to many interesting scenarios:witnesses for non-bisimilarity, algorithms for simple stochastic games [10] andlower bounds for termination probabilities and behavioural metrics in the settingof probabilistic systems [1] and probabilistic automata [2]. In particular we wereinspired by, and generalise, the self-closed relations of Fu [15], also used in [2].

Motivating example.

Consider a Markov chain (

S, T, η ) with a ﬁnite set of states S , where T ⊆ S are the terminal states and every state s ∈ S \ T is associatedwith a probability distribution η ( s ) ∈ D ( S ). Intuitively, η ( s )( s ′ ) denotes theprobability of state s choosing s ′ as its successor. Assume that, given a ﬁxedstate s ∈ S , we want to determine the termination probability of s , i.e. theprobability of reaching any terminal state from s . As a concrete example, takethe Markov chain given in Fig. 1, where u is the only terminal state. D ( S ) is the set of all maps p : S → [0 ,

1] such that P s ∈ S p ( s ) = 1. : [0 , S → [0 , S T ( t )( s ) = ( v ∈ T P s ′ ∈ S η ( s )( s ′ ) · t ( s ′ ) otherwise x /1 u y z

13 1313 Fig. 1: Function T (left) and a Markov chain with two ﬁxpoints of T (right)The termination probability arises as the least ﬁxpoint of a function T deﬁnedas in Fig. 1. The values of µ T are indicated in green (left value).Now consider the function t assigning to each state the termination probabil-ity written in red (right value). It is not diﬃcult to see that t is another ﬁxpointof T , in which states y and z convince each other incorrectly that they terminatewith probability 1, resulting in a vicious cycle that gives “wrong” results. Wewant to show that µ T 6 = t without knowing µ T . Our idea is to compute the setof states that still has some “wiggle room”, i.e., those states which could reducetheir termination probability by δ if all their successors did the same. This def-inition has a coinductive ﬂavour and it can be computed as a greatest ﬁxpointon the ﬁnite powerset S of states, instead of on the inﬁnite lattice S [0 , .We hence consider a function T t : [ S ] t → [ S ] t , dependent on t , deﬁned asfollows. Let [ S ] t be the set of all states s where t ( s ) >

0, i.e., a reduction is inprinciple possible. Then a state s ∈ [ S ] t is in T t ( S ′ ) iﬀ s T and for all s ′ forwhich η ( s )( s ′ ) > s ′ ∈ S ′ , i.e. all successors of s are in S ′ .The greatest ﬁxpoint of T t is { y, z } . The fact that it is not empty means thatthere is some “wiggle room”, i.e., the value of t can be reduced on the elements { y, z } and thus t cannot be the least ﬁxpoint of f . Moreover, the intuition that t can be improved on { y, z } can be made precise, leading to the possibility ofperforming the improvement and search for the least ﬁxpoint from there. Contributions.

In the paper we formalise the theory outlined above, showingthat the proof rules work for non-expansive monotone functions f on lattices ofthe form M Y , where Y is a ﬁnite set and M a (potentially inﬁnite) MV-algebra( § § f we show how to obtainthe corresponding approximation compositionally ( § §

6) and simplestochastic games ( § In this section, we review some basic notions used in the paper, concerningcomplete lattices and MV-algebras [20].3 preordered or partially ordered set ( P, ⊑ ) is often denoted simply as P ,omitting the order relation. Given x, y ∈ P , with x ⊑ y , we denote by [ x, y ] theinterval { z ∈ P | x ⊑ z ⊑ y } . The join and the meet of a subset X ⊆ P (if theyexist) are denoted F X and d X , respectively.A complete lattice is a partially ordered set ( L, ⊑ ) such that each subset X ⊆ L admits a join F X and a meet d X . A complete lattice ( L, ⊑ ) always hasa least element ⊥ = F ∅ and a greatest element ⊤ = d ∅ .A function f : L → L is monotone if for all l, l ′ ∈ L , if l ⊑ l ′ then f ( l ) ⊑ f ( l ′ ). By Knaster-Tarski’s theorem [27, Thm. 1], any monotone function on acomplete lattice has a least and a greatest ﬁxpoint, denoted respectively µf and νf , characterised as the meet of all pre-ﬁxpoints respectively the join of allpost-ﬁxpoints: µf = d { l | f ( l ) ⊑ l } and νf = F { l | l ⊑ f ( l ) } .Let ( C, ⊑ ), ( A, ≤ ) be complete lattices. A Galois connection is a pair ofmonotone functions h α, γ i such that α : C → A , γ : A → C and for all a ∈ A and c ∈ C : α ( c ) ≤ a iﬀ c ⊑ γ ( a ).Equivalently, for all a ∈ A and c ∈ C , (i) c ⊑ γ ( α ( c )) and (ii) α ( γ ( a )) ≤ a . Inthis case we will write h α, γ i : C → A . For a Galois connection h α, γ i : C → A ,the function α is called the left (or lower) adjoint and γ the right (or upper)adjoint.Galois connections are at the heart of abstract interpretation [12,13]. In par-ticular, when h α, γ i is a Galois connection, given f C : C → C and f A : A → A ,monotone functions, if f C ◦ γ ⊑ γ ◦ f A , then νf C ⊑ γ ( νf A ). If equality holds,i.e., f C ◦ γ = γ ◦ f A , then greatest ﬁxpoints are preserved along the connection,i.e., νf C = γ ( νf A ).Given a set Y and a complete lattice L , the set of functions L Y = { f | f : Y → L } , endowed with pointwise order, i.e., for a, b ∈ L Y , a ⊑ b if a ( y ) ⊑ b ( y )for all y ∈ Y , is a complete lattice.In the paper we will mostly work with lattices of the kind M Y where M is aspecial kind of lattice with a rich algebraic structure, i.e. an MV-algebra [20]. Deﬁnition 2.1 (MV-algebra). An MV-algebra is a tuple M = ( M, ⊕ , , ( · )) where ( M, ⊕ , is a commutative monoid and ( · ) : M → M maps each elementto its complement , such that for all x, y ∈ M x = x x ⊕ ( x ⊕ y ) ⊕ y = ( y ⊕ x ) ⊕ x .We denote , multiplication x ⊗ y = x ⊕ y and subtraction x ⊖ y = x ⊗ y . MV-algebras are endowed with a natural order.

Deﬁnition 2.2 (natural order).

Let M = ( M, ⊕ , , ( · )) be an MV-algebra.The natural order on M is deﬁned, for x, y ∈ M , by x ⊑ y if x ⊕ z = y for some z ∈ M . When ⊑ is total M is called an MV-chain . ⊥ = 0, ⊤ = 1, x ⊔ y = ( x ⊖ y ) ⊕ y and x ⊓ y = x ⊔ y = x ⊗ ( x ⊕ y ). We call theMV-algebra complete , if it is a complete lattice, which is not true in general,e.g., ([0 , ∩ Q , ≤ ). Example 2.3.

A prototypical example of an MV-algebra is ([0 , , ⊕ , , ( · )) where x ⊕ y = min { x + y, } and x = 1 − x for x, y ∈ [0 , . This means that x ⊗ y = max { x + y − , } and x ⊖ y = max { , x − y } (truncated subtraction).The operators ⊕ and ⊗ are also known as strong disjunction and conjunction in Lukasiewicz logic [21]. The natural order is ≤ (less or equal) on the reals.Another example is ( { , . . . , k } , ⊕ , , ( · )) where n ⊕ m = min { n + m, k } and n = k − n for n, m ∈ { , . . . , k } . We are in particular interested in the case k = 1 . Both MV-algebras are complete and MV-chains.Boolean algebras (with disjunction and complement) also form MV-algebrasthat are complete, but in general not MV-chains. MV-algebras are the algebraic semantics of Lukasiewicz logic. They can beshown to correspond to intervals of the kind [0 , u ] in suitable groups, i.e., abelianlattice-ordered groups with a strong unit u [20]. As mentioned in the introduction, our interest is for ﬁxpoints of monotone func-tions f : M Y → M Y , where M is an MV-chain and Y is a ﬁnite set. We willsee that for non-expansive functions we can over-approximate the sets of pointsin which a given a ∈ M Y can be increased in a way that is preserved by theapplication of f . This will be the core of the proof rules outlined earlier. Non-expansive functions on MV-algebras.

For deﬁning non-expansiveness it isconvenient to introduce a norm.

Deﬁnition 3.1 (norm).

Let M be an MV-chain and let Y be a ﬁnite set. Given a ∈ M Y we deﬁne its norm as || a || = max { a ( y ) | y ∈ Y } . Given a ﬁnite set Y we extend ⊕ and ⊗ to M Y pointwise. E.g. if a, b ∈ M Y ,we write a ⊕ b for the function deﬁned by ( a ⊕ b )( y ) = a ( y ) ⊕ b ( y ) for all y ∈ Y .Given Y ′ ⊆ Y and δ ∈ M , we write δ Y ′ for the function deﬁned by δ Y ′ ( y ) = δ if y ∈ Y ′ and δ Y ′ ( y ) = 0, otherwise. Whenever this does not generate confusion,we write δ instead of δ Y . It can be seen that ||·|| has the properties of a norm,i.e., for all a, b ∈ M Y and δ ∈ M , it holds that (1) || a ⊕ b || ⊑ || a || ⊕ || b || , (2) || δ ⊗ a || = δ ⊗ || a || and and || a || = 0 implies that a is the constant 0 (see Lem. B.1in the appendix). Moreover, it is clearly monotonic, i.e., if a ⊑ b then || a || ⊑ || b || .We next introduce non-expansiveness. Despite the fact that we will ﬁnally beinterested in endo-functions f : M Y → M Y , in order to allow for a compositionalreasoning we work with functions where domain and codomain can be diﬀerent.5 eﬁnition 3.2 (non-expansiveness). Let f : M Y → M Z be a function, where M is an MV-chain and Y, Z are ﬁnite sets. We say that it is non-expansive iffor all a, b ∈ M Y it holds || f ( b ) ⊖ f ( a ) || ⊑ || b ⊖ a || . Note that ( a, b )

7→ || a ⊖ b || is the supremum lifting of a directed version ofChang’s distance [20]. It is easy to see that all non-expansive functions on MV-chains are monotone and in M = { , } such notions coincide. Approximating the propagation of increases.

Let f : M Y → M Z be a monotonefunction and take a, b ∈ M Y with a ⊑ b . We are interested in the diﬀerence b ( y ) ⊖ a ( y ) for some y ∈ Y and on how the application of f “propagates” thisincrease. The reason is that, understanding that no increase can be propagatedwill be crucial to establish when a ﬁxpoint of a non-expansive function f isactually the largest one, and, more generally, when a (pre-)ﬁxpoint of f is abovethe largest ﬁxpoint.In order to formalise the above intuition, we rely on tools from abstract inter-pretation. In particular, the following pair of functions, which, under a suitablecondition, form a Galois connection, will play a major role. The left adjoint α a,δ takes as input a set Y ′ and, for y ∈ Y ′ , it increases the values a ( y ) by δ , whilethe right adjoint γ a,δ takes as input a function b ∈ M Y , b ∈ [ a, a ⊕ δ ] and checksfor which parameters y ∈ Y the value b ( y ) exceeds a ( y ) by δ .We also deﬁne [ Y ] a , the subset of elements in Y where a ( y ) is not 1 and thusthere is a potential to increase, and δ a , which gives us the minimal such increase. Deﬁnition 3.3 (functions to sets, and vice versa).

Let M be an MV-algebraand let Y be a ﬁnite set. Deﬁne the set [ Y ] a = { y ∈ Y | a ( y ) = 1 } (support of a )and δ a = min { a ( y ) | y ∈ [ Y ] a } with min ∅ = 1 .For ⊏ δ ∈ M we consider the functions α a,δ : [ Y ] a → [ a, a ⊕ δ ] and γ a,δ : [ a, a ⊕ δ ] → [ Y ] a , deﬁned, for Y ′ ∈ [ Y ] a and b ∈ [ a, a ⊕ δ ] , by α a,δ ( Y ′ ) = a ⊕ δ Y ′ γ a,δ ( b ) = { y ∈ [ Y ] a | b ( y ) ⊖ a ( y ) ⊒ δ } . When δ is suﬃciently small, the pair h α a,δ , γ a,δ i is a Galois connection. [ Y ] a [ a, a ⊕ δ ] α a,δ γ a,δ Lemma 3.4 (Galois connection).

Let M be anMV-algebra and Y be a ﬁnite set. For = δ ⊑ δ a ,the pair h α a,δ , γ a,δ i : [ Y ] a → [ a, a ⊕ δ ] is a Galoisconnection. Whenever f is non-expansive, it is easy to see that it restricts to a function f : [ a, a ⊕ δ ] → [ f ( a ) , f ( a ) ⊕ δ ] for all δ ∈ M .As mentioned before, a crucial result shows that for all non-expansive func-tions, under the assumption that Y, Z are ﬁnite and the order on M is total,we can suitably approximate the propagation of increases. In order to state thisresult, a useful tool is a notion of approximation of a function. Deﬁnition 3.5 ( ( δ, a ) -approximation). Let M be an MV-chain, let Y , Z beﬁnite sets and let f : M Y → M Z be a non-expansive function. For a ∈ M Y andany δ ∈ M we deﬁne f a,δ : [ Y ] a → [ Z ] f ( a ) as f a,δ = γ f ( a ) ,δ ◦ f ◦ α a,δ . Y ′ ⊆ [ Y ] a , its image f a,δ ( Y ′ ) ⊆ [ Z ] f ( a ) is the set of points z ∈ [ Z ] f ( a ) such that δ ⊑ f ( a ⊕ δ Y ′ )( z ) ⊖ f ( a )( z ), i.e., the points to which f propagates anincrease of the function a with value δ on the subset Y ′ .We ﬁrst show that f a,δ is antitone in the parameter δ , a non-trivial result. Lemma 3.6 (anti-monotonicity).

Let M be a MV-chain, let Y , Z be ﬁnitesets, let f : M Y → M Z be a non-expansive function and let a ∈ M Y . For θ, δ ∈ M , if θ ⊑ δ then f a,δ ⊆ f a,θ . Since f a,δ increases when δ decreases and there are ﬁnitely many such func-tions, there must be a value ι fa such that all functions f a,δ for 0 ⊏ δ ⊑ ι fa areequal. This function is denoted by f a and is called the a -approximation of f .We next show that indeed, for all non-expansive functions, the a -approximationproperly approximates the propagation of increases. Theorem 3.7 (approximation of non-expansive functions).

Let M bea complete MV-chain, let Y, Z be ﬁnite sets and let f : M Y → M Z be a non-expansive function. Then there exists ι fa ∈ M , the largest value below or equal to δ a such that f a,δ = f a,δ ′ for all ⊏ δ, δ ′ ⊑ ι fa .We denote this function by f a and call it the a -approximation of f . Then for all ⊏ δ ∈ M :a. γ f ( a ) ,δ ◦ f ⊆ f a ◦ γ a,δ b. for δ ⊑ δ a : δ ⊑ ι fa iﬀ γ f ( a ) ,δ ◦ f = f a ◦ γ a,δ [ a, a ⊕ δ ] f (cid:15) (cid:15) γ a,δ / / ⊑ [ Y ] a f a (cid:15) (cid:15) [ f ( a ) , f ( a ) ⊕ δ ] γ f ( a ) ,δ / / [ Z ] f ( a ) Note that if Y = Z and a is a ﬁxpoint of f , i.e., a = f ( a ), condition (a) abovecorresponds exactly to soundness in the sense of abstract interpretation [12],while condition (b) corresponds to ( γ -)completeness (see also § In this section we formalise the proof technique outlined in the introduction forshowing that a ﬁxpoint is the largest and, more generally, for checking over-approximations of greatest ﬁxpoints of non-expansive functions.Consider a monotone function f : M Y → M Y for some ﬁnite set Y . We ﬁrstfocus on the problem of establishing whether some given ﬁxpoint a of f coincideswith νf (without explicitly knowing νf ), and, in case it does not, ﬁnding an“improvement”, i.e., a post-ﬁxpoint of f , larger than a . Observe that when a is a ﬁxpoint, [ Y ] a = [ Y ] f ( a ) and thus the a -approximation of f (Thm. 3.7) is anendofunction f a : [ Y ] a → [ Y ] a . We have the following result, which relies on thefact that due to Thm. 3.7 γ a,δ preserves ﬁxpoints (of f and f a ). Theorem 4.1 (soundness and completeness for ﬁxpoints).

Let M bea complete MV-chain, Y a ﬁnite set and f : M Y → M Y be a non-expansivefunction. Let a ∈ M Y be a ﬁxpoint of f . Then νf a = ∅ if and only if a = νf . a is a ﬁxpoint, but not yet the largest ﬁxpoint of f , we can increaseit and obtain a post-ﬁxpoint. Lemma 4.2.

Let M be a complete MV-chain, f : M Y → M Y a non-expansivefunction, a ∈ M a ﬁxpoint of f , and let f a be the corresponding a -approximationand ι fa as in Thm. 3.7. Then α a,ι fa ( νf a ) = a ⊕ ( ι fa ) νf a is a post-ﬁxpoint of f . Using these results one can perform an alternative ﬁxpoint iteration where weiterate to the largest ﬁxpoint from below: start with a post-ﬁxpoint a ⊑ f ( a )(which is clearly below νf ) and obtain, by (possibly transﬁnite) iteration, anascending chain that converges to a , the least ﬁxpoint above a . Now checkwith Thm. 4.1 whether Y ′ = νf a = ∅ . If yes, we have reached νf = a . If not, α a,ι fa ( Y ′ ) = a ⊕ ( ι fa ) Y ′ is again a post-ﬁxpoint (cf. Lem. 4.2) and we continuethis procedure until – for some ordinal – we reach the largest ﬁxpoint νf , forwhich we have νf νf = ∅ .Interestingly, the soundness result in Thm. 4.1 can be generalised to the casein which a is a pre-ﬁxpoint instead of a ﬁxpoint. In this case, the a -approximationfor a function f : M Y → M Y is a function f a : [ Y ] a → [ Y ] f ( a ) where domain andcodomain are diﬀerent, hence it would not be meaningful to look for ﬁxpoints.However, as explained below, it can be restricted to an endofunction. Theorem 4.3 (soundness for pre-ﬁxpoints).

Let M be a complete MV-chain, Y a ﬁnite set and f : M Y → M Y be a non-expansive function. Givena pre-ﬁxpoint a ∈ M Y of f , let [ Y ] a = f ( a ) = { y ∈ [ Y ] a | a ( y ) = f ( a )( y ) } . Letus deﬁne f ∗ a : [ Y ] a = f ( a ) → [ Y ] a = f ( a ) as f ∗ a ( Y ′ ) = f a ( Y ′ ) ∩ [ Y ] a = f ( a ) , where f a : [ Y ] a → [ Y ] f ( a ) is the a -approximation of f . If νf ∗ a = ∅ then νf ⊑ a . Roughly, the intuition for the above result is the following: the value of f ( a )on some y might or might not depend “circularly” on the value of a on y itself.In a purely inductive setting, without such circular dependencies, µf = νf andhence a being a pre-ﬁxpoint means that we over-approximate νf . However, wemight have vicious cycles, as explained in the introduction, that destroy theover-approximation since the values are too low. Now, since we restrict to non-expansive functions, it must be the case that there is a cycle, such that allelements on this cycle are points where a and f ( a ) coincide. It is hence suﬃcientto check whether a given pre-ﬁxpoint could be increased on its subpart whichcorresponds to a ﬁxpoint, i.e., the idea is to restrict to [ Y ] a = f ( a ) . We detect suchsituations by looking for “wiggle room” as for ﬁxpoints.Completeness does not generalise to pre-ﬁxpoints, i.e., it is not true that if a is a pre-ﬁxpoint of f and νf ⊑ a then νf ∗ a = ∅ . A pre-ﬁxpoint might contain slackeven though it is above the greatest ﬁxpoint. A counterexample is in Ex. 6.11. The dual view for least ﬁxpoints.

The theory developed so far can be easilydualised to check under-approximations of least ﬁxpoints. Given a complete MV-algebra M = ( M, ⊕ , , ( · )) and a monotone function f : M Y → M Y , in order toshow that a post-ﬁxpoint a ∈ M Y satisﬁes a ⊑ µf , we can in fact simply workin the dual MV-algebra, M op = ( M, ⊒ , ⊗ , ( · ) , ⊖ and the original order.8 [ Y ] a [ a ⊖ θ, a ] α a,θ γ a,θ We next outline the dualised setting (for details onhow it arises see Appendix C.1). The notation for thedual case is obtained from that of the original (primal)case, exchanging subscripts and superscripts.Given a ∈ M Y , deﬁne [ Y ] a = { y ∈ Y | a ( y ) = 0 } and δ a = min { a ( y ) | y ∈ [ Y ] a } . For θ ∈ M , we consider the pair of functions h α a,θ , γ a,θ i : [ Y ] a → [ a ⊖ θ, a ]where, for Y ′ ∈ [ Y ] a , we let α a,θ ( Y ′ ) = a ⊖ θ Y ′ and, for b ∈ [ a ⊖ θ, a ], we let γ a,θ ( b ) = { y ∈ Y | a ( y ) ⊖ b ( y ) ⊒ θ } .A function f : M Y → M Z is non-expansive in the dual MV-algebra when itis in the primal one. Its approximation in the sense of Thm. 3.7 is denoted f a .Then the dualisations of Thm. 4.1 and 4.3 hold, i.e., if a is a ﬁxpoint of f , then νf a = ∅ iﬀ µf = a , and whenever a is a post-ﬁxpoint, νf a ∗ = ∅ implies a ⊑ µf . Given a non-expansive function f and a (pre/post-)ﬁxpoint a , it is often non-trivial to determine the corresponding approximations. However, non-expansivefunctions enjoy good closure properties (closure under composition, and closureunder disjoint union) and we will see that the same holds for the correspondingapproximations. Furthermore it turns out that the functions needed in the ap-plications can be obtained from just a few templates. This gives us a toolbox forassembling approximations with relative ease. Theorem 5.1.

All basic functions listed in Table 1 are non-expansive. Further-more non-expansive functions are closed under composition and disjoint union.The approximations are the ones listed in the third column of the table.

We start by making the example from the introduction ( §

1) more formal. Con-sider a Markov chain (

S, T, η ), as deﬁned in the introduction (Fig. 1), where werestrict the codomain of η : S \ T → D ( S ) to D ⊆ D ( S ), where D is ﬁnite (toensure that all involved sets are ﬁnite). Furthermore let T : [0 , S → [0 , S bethe function from the introduction whose least ﬁxpoint µ T assigns to each stateits termination probability. Lemma 6.1.

The function T can be written as T = ( η ∗ ◦ av D ) ⊎ c k where k : T → [0 , is the constant function deﬁned only on terminal states. From this representation and Thm. 5.1 it is obvious that T is non-expansive.9able 1: Basic functions f : M Y → M Z (constant, reindexing, minimum, maxi-mum, average), function composition, disjoint union and the corresponding ap-proximations f a : [ Y ] a → [ Z ] f ( a ) , f a : [ Y ] a → [ Z ] f ( a ) . Notation: R − ( z ) = { y ∈ Y | y R z } , supp ( p ) = { y ∈ Y | p ( y ) > } for p ∈ D ( Y ), Min a = { y ∈ Y | a ( y ) minimal } , Max a = { y ∈ Y | a ( y ) maximal } , a : Y → M function f deﬁnition of f f a ( Y ′ ) (above), f a ( Y ′ ) (below) c k f ( a ) = k ∅ ( k ∈ M Z ) ∅ u ∗ f ( a ) = a ◦ u u − ( Y ′ )( u : Z → Y ) u − ( Y ′ )min R f ( a )( z ) = min y R z a ( y ) { z ∈ [ Z ] f ( a ) | Min a | R− z ) ⊆ Y ′ } ( R ⊆ Y × Z ) { z ∈ [ Z ] f ( a ) | Min a | R− z ) ∩ Y ′ = ∅} max R f ( a )( z ) = max y R z a ( y ) { z ∈ [ Z ] f ( a ) | Max a | R− z ) ∩ Y ′ = ∅} ( R ⊆ Y × Z ) { z ∈ [ Z ] f ( a ) | Max a | R− z ) ⊆ Y ′ } av D ( M = [0 , f ( a )( p ) = P y ∈ Y p ( y ) · a ( y ) { p ∈ [ D ] f ( a ) | supp ( p ) ⊆ Y ′ } Z = D ⊆ D ( Y )) { p ∈ [ D ] f ( a ) | supp ( p ) ⊆ Y ′ } h ◦ g f ( a ) = h ( g ( a )) h g ( a ) ◦ g a ( Y ′ )( g : M Y → M W , h g ( a ) ◦ g a ( Y ′ ) h : M W → M Z ) U i ∈ I f i I ﬁnite f ( a )( z ) = f i ( a | Y i )( z ) U i ∈ I ( f i ) a | Yi ( Y ′ ∩ Y i )( f i : M Y i → M Z i , ( z ∈ Z i ) U i ∈ I ( f i ) a | Yi ( Y ′ ∩ Y i ) Y = S i ∈ I Y i , Z = U i ∈ I Z i ) Lemma 6.2.

Let t : S → [0 , . The approximation for T in the dual sense is T t : [ S ] t → [ S ] T ( t ) with T t ( S ′ ) = { s ∈ [ S ] T ( t ) | s / ∈ T ∧ supp ( η ( s )) ⊆ S ′ } . It is well-known that the function T can be tweaked in such a way that it hasa unique ﬁxpoint, coinciding with µ T , by determining all states which cannotreach a terminal state and setting their value to zero [3]. Hence ﬁxpoint iterationfrom above does not bring us any added value here. It does however make senseto use the proof rule in order to guarantee lower bounds via post-ﬁxpoints.Furthermore, termination probability is a special case of the considerablymore complex stochastic games that will be studied in §

7, where the trick ofmodifying the function is not applicable.10 .2 Behavioural metrics for probabilistic automata

Before we start discussing probabilistic automata, we ﬁrst consider the Hausdorﬀand the Kantorovich lifting and the corresponding approximations.

Hausdorﬀ lifting.

Given a metric on a set X , the Hausdorﬀ metric is obtainedby lifting the original metric to X . Here we deﬁne this for general distancefunctions on M , not restricting to metrics. In particular the Hausdorﬀ lifting isgiven by a function H : M X × X → M X × X where H ( d )( X , X ) = max { max x ∈ X min x ∈ X d ( x , x ) , max x ∈ X min x ∈ X d ( x , x ) } . An alternative characterisation due to M´emoli [19], also in [4], is more convenientfor our purposes. If we let u : X × X → X × X with u ( C ) = ( π [ C ] , π [ C ]),where π , π are the projections π i : X × X → X and π i [ C ] = { π i ( c ) | c ∈ C } .Then H ( d )( X , X ) = min { max ( x ,x ) ∈ C d ( x , x ) | C ⊆ X × X ∧ u ( C ) =( X , X ) } . Relying on this, we can obtain the result below, from which we deducethat H is non-expansive and construct its approximation as the composition ofthe corresponding functions from Table 1. Lemma 6.3. H = min u ◦ max ∈ where max ∈ : M X × X → M X × X ( ∈ ⊆ ( X × X ) × X × X is the “is-element-of ”-relation on X × X ), min u : M X × X → M X × X .Kantorovich lifting. The Kantorovich (also known as Wasserstein) lifting con-verts a metric on X to a metric on probability distributions over X . As for theHausdorﬀ lifting, we lift distance functions that are not necessarily metrics.Furthermore, in order to ensure ﬁniteness of all the sets involved, we re-strict to D ⊆ D ( X ), some ﬁnite set of probability distributions over X . A coupling of p, q ∈ D is a probability distribution c ∈ D ( X × X ) whose leftand right marginals are p, q , i.e., p ( x ) = m Lc ( x ) := P x ∈ X c ( x , x ) and q ( x ) = m Rc ( x ) := P x ∈ X c ( x , x ). The set of all couplings of p, q , denotedby Ω ( p, q ), forms a polytope with ﬁnitely many vertices [23]. The set of all poly-tope vertices that are obtained by coupling any p, q ∈ D is also ﬁnite and isdenoted by VP D ⊆ D ( X × X ).The Kantorovich lifting is given by K : [0 , X × X → [0 , D × D where K ( d )( p, q ) = min c ∈ Ω ( p,q ) X ( x ,x ) ∈ X × X c ( x , x ) · d ( x , x ) . The coupling c can be interpreted as the optimal transport plan to move goodsfrom suppliers to customers [29]. Again there is an alternative characterisation,which shows non-expansiveness of K : Lemma 6.4.

Let u : VP D → D × D , u ( c ) = ( m Lc , m Rc ) . Then K = min u ◦ av VP D where av VP D : [0 , X × X → [0 , VP D , min u : [0 , VP D → [0 , D × D . robabilistic automata. We now compare our approach with [2], which describesthe ﬁrst method for computing behavioural distances for probabilistic automata.Although the behavioural distance arises as a least ﬁxpoint, it is in fact better,even the only known method, to iterate from above, in order to reach this leastﬁxpoint. This is done by guessing and improving couplings, similar to strategyiteration discussed later in §

7. A major complication, faced in [2], is that theprocedure can get stuck at a ﬁxpoint which is not the least and one has todetermine that this is the case and decrease the current candidate. In fact thispaper was our inspiration to generalise this technique to a more general setting.A probabilistic automaton is a tuple A = ( S, L, η, ℓ ), where S is a non-emptyﬁnite set of states, L is a ﬁnite set of labels, η : S → D ( S ) assigns ﬁnite sets ofprobability distributions to states and ℓ : S → L is a labelling function. (In thefollowing we again replace D ( S ) by a ﬁnite subset D .)The probabilistic bisimilarity pseudometrics is the least ﬁxpoint of the func-tion M : [0 , S × S → [0 , S × S where for d : S × S → [0 , s, t ∈ S : M ( d )( s, t ) = ( ℓ ( s ) = ℓ ( t ) H ( K ( d ))( η ( s ) , η ( t )) otherwisewhere H is the Hausdorﬀ lifting (for M = [0 , K is the Kantorovich liftingdeﬁned earlier. Now assume that d is a ﬁxpoint of M , i.e., d = M ( d ). In orderto check whether d = µf , [2] adapts the notion of a self-closed relation from [15]. Deﬁnition 6.5 ([2]).

A relation M ⊆ S × S is self-closed wrt. d = M ( d ) if,whenever s M t , then – ℓ ( s ) = ℓ ( t ) and d ( s, t ) > , – if p ∈ η ( s ) and d ( s, t ) = min q ′ ∈ η ( t ) K ( d )( p, q ′ ) , then there exists q ∈ η ( t ) and c ∈ Ω ( p, q ) such that d ( s, t ) = P u,v ∈ S d ( u, v ) · c ( u, v ) and supp ( c ) ⊆ M , – if q ∈ η ( t ) and d ( s, t ) = min p ′ ∈ η ( s ) K ( d )( p ′ , q ) , then there exists p ∈ η ( s ) and c ∈ Ω ( p, q ) such that d ( s, t ) = P u,v ∈ S d ( u, v ) · c ( u, v ) and supp ( c ) ⊆ M . The largest self-closed relation, denoted by ≈ d is empty if and only if d = µf [2]. We now investigate the relation between self-closed relations and post-ﬁxpoints of approximations. For this we will ﬁrst show that M can be composedfrom non-expansive functions, which proves that it is indeed non-expansive. Fur-thermore, this decomposition will help in the comparison. Lemma 6.6.

The ﬁxpoint function M characterizing probabilistic bisimilaritypseudometrics can be written as: M = max ρ ◦ ((( η × η ) ∗ ◦ H ◦ K ) ⊎ c l ) where ρ : ( S × S ) ⊎ ( S × S ) → ( S × S ) with ρ (( s, t ) , i ) = ( s, t ) . Furthermore l : S × S → [0 , is deﬁned as l ( s, t ) = 0 if ℓ ( s ) = ℓ ( t ) and l ( s, t ) = 1 if ℓ ( s ) = ℓ ( t ) . Here we use i ∈ { , } as indices to distinguish the elements in the disjoint union. M is a composition of non-expansive functions and thus non-expansiveitself. We do not spell out M d explicitly, but instead show how it is related toself-closed relations. Proposition 6.7.

Let d : S × S → [0 , where d = M ( d ) . Then M d : [ S × S ] d → [ S × S ] d , where [ S × S ] d = { ( s, t ) ∈ S × S | d ( s, t ) > } .Then M is a self-closed relation wrt. d if and only if M ⊆ [ S × S ] d and M is a post-ﬁxpoint of M d . In order to deﬁne standard bisimilarity we use a variant G of the Hausdorﬀ lifting H from § G .Now we can deﬁne the ﬁxpoint function for bisimilarity and its correspondingapproximation. For simplicity we consider unlabelled transition systems, but itwould be straightforward to handle labelled transitions.Let X be a ﬁnite set of states and η : X → X a function that assigns a setof successors η ( x ) to a state x ∈ X . For the ﬁxpoint function for bisimilarity B : { , } X × X → { , } X × X we use the Hausdorﬀ lifting G with M = { , } . Lemma 6.8.

Bisimilarity on η is the greatest ﬁxpoint of B = ( η × η ) ∗ ◦ G . Since we are interested in the greatest ﬁxpoint, we are working in the primalsense. Bisimulation relations are represented by their characteristic functions d : X × X → { , } , in fact the corresponding relation can be obtained by takingthe complement of [ X × X ] d = { ( x , x ) ∈ X × X | d ( x , x ) = 0 } . Lemma 6.9.

Let d : X × X → { , } . The approximation for the bisimilarityfunction B in the primal sense is B d : [ X × X ] d → [ X × X ] B ( d ) with B d ( R ) = { ( x , x ) ∈ [ X × X ] B ( d ) |∀ y ∈ η ( x ) ∃ y ∈ η ( x ) (cid:0) ( y , y ) [ X × X ] d ∨ ( y , y ) ∈ R ) (cid:1) ∧∀ y ∈ η ( x ) ∃ y ∈ η ( x ) (cid:0) ( y , y ) [ X × X ] d ∨ ( y , y ) ∈ R ) (cid:9) We conclude this section by discussing how this view on bisimilarity canbe useful: ﬁrst, it again opens up the possibility to compute bisimilarity – agreatest ﬁxpoint – by iterating from below, through smaller ﬁxpoints. This couldpotentially be useful if it is easy to compute the least ﬁxpoint of B inductivelyand continue from there.Furthermore, we obtain a technique for witnessing non-bisimilarity of states.While this can also be done by exhibiting a distinguishing modal formula [16,8]or by a winning strategy for the spoiler in the bisimulation game [26], to ourknowledge there is no known method that does this directly, based on the deﬁ-nition of bisimilarity.With our technique however, we can witness non-bisimilarity of two states x , x ∈ X by presenting a pre-ﬁxpoint d (i.e., B ( d ) ≤ d ) such that d ( x , x ) = 013equivalent to ( x , x ) ∈ [ X × X ] d ) and ν B d = ∅ , since this implies ν B ( x , x ) ≤ d ( x , x ) = 0 by our proof rule.There are two issues to discuss: ﬁrst, how can we characterise a pre-ﬁxpointof B (which is quite unusual, since bisimulations are post-ﬁxpoints)? In fact, thecondition B ( d ) ≤ d can be rewritten to: for all ( x , x ) ∈ [ X × X ] d there exists y ∈ η ( x ) such that for all y ∈ η ( x ) we have ( y , y ) ∈ [ X × X ] d ( or viceversa). Second, at ﬁrst sight it does not seem as if we gained anything since westill have to do a ﬁxpoint computation on relations. However, the carrier set is[ X × X ] d , i.e., a set of non-bisimilarity witnesses and this set can be small eventhough X might be large. Example 6.10.

We consider the transition system depicted below.Our aim is to construct a witness showing that x, u are not bisimilar. This witness is a function d : X × X → { , } with d ( x, u ) = 0 = d ( y, u ) and for all other pairs the value is . x y u Hence [ X × X ] d = B ( d ) = [ X × X ] d = { ( x, u ) , ( y, u ) } and it is easy to checkthat d is a pre-ﬁxpoint of B and that ν B ∗ d = ∅ : we iterate over { ( x, u ) , ( y, u ) } and ﬁrst remove ( y, u ) (since y has no successors) and then ( x, u ) . This impliesthat ν B ≤ d and hence ν B ( x, u ) = 0 , which means that x, u are not bisimilar. Example 6.11.

We modify Ex. 6.10 and consider a function d where d ( x, u ) =0 and all other values are . Again d is a pre-ﬁxpoint of B and ν B ≤ d (sinceonly reﬂexive pairs are in the bisimilarity). However ν B ∗ d = ∅ , since { ( x, u ) } is apost-ﬁxpoint. This is a counterexample to completeness discussed after Thm. 4.3.Intuively speaking, the states y, u over-approximate and claim that they arebisimilar, although they are not. (This is permissible for a pre-ﬁxpoint.) Thistricks x, u into thinking that there is some wiggle room and that one can increasethe value of ( x, u ) . This is true, but only because of the limited, local view, sincethe “true” value of ( y, u ) is . Introduction to simple stochastic games.

In this section we show how our tech-niques can be applied to simple stochastic games [10,9]. A simple stochastic gameis a state-based two-player game where the two players, Min and Max, each owna subset of states they control, for which they can choose the successor. The sys-tem also contains sink states with an assigned payoﬀ and averaging states whichrandomly choose their successor based on a given probability distribution. Thegoal of Min is to minimise and the goal of Max to maximise the payoﬀ.Simple stochastic games are an important type of games that subsume paritygames and the computation of behavioural distances for probabilistic automata(cf. § ?) is known to lie in NP ∩ coNP , but it is an open question whether it is contained in P . There areknown randomised subexponential algorithms [6].14t has been shown that it is suﬃcient to consider positional strategies, i.e.,strategies where the choice of the player is only dependent on the current state.The expected payoﬀs for each state form a so-called value vector and can beobtained as the least solution of a ﬁxpoint equation (see below).A simple stochastic game is given by a ﬁnite set V of nodes, partitioned into MIN , MAX , AV (average) and SINK , and the following data: η min : MIN → V , η max : MAX → V (successor functions for Min and Max nodes), η av : AV → D (probability distributions, where D ⊆ D ( V ) ﬁnite) and w : SINK → [0 , V : [0 , V → [0 , V is deﬁned below for a : V → [0 , v ∈ V : V ( a )( v ) =  min v ′ ∈ η min ( v ) a ( v ′ ) v ∈ MIN max v ′ ∈ η max ( v ) a ( v ′ ) v ∈ MAX P v ′ ∈ V η av ( v )( v ′ ) · a ( v ′ ) v ∈ AV w ( v ) v ∈ SINK

The least ﬁxpoint of V speciﬁes the average payoﬀ for all nodes when Min andMax play optimally. In an inﬁnite game the payoﬀ is 0. In order to avoid inﬁnitegames and guarantee uniqueness of the ﬁxpoint, many authors [17,9,28] restrictto stopping games, which are guaranteed to terminate for every pair of Min/Max-strategies. Here we deal with general games where more than one ﬁxpoint mayexist. Such a scenario has been studied in [18], which considers value iterationto under- and over-approximate the value vector. The over-approximation faceschallenges with cyclic dependencies, similar to the vicious cycles described ear-lier. Here we focus on strategy iteration, which is usually less eﬃcient than valueiteration, but yields a precise result instead of approximating it. Example 7.1.

We consider the game depicted below. Here min is a Min nodewith η min (min) = { , av } , max is a Max node with η max (max) = { ε , av } , is asink node with payoﬀ 1, ε is a sink node with some small payoﬀ ε ∈ (0 , and av is an average node which transitions to both min and max with probability .Min should choose av as successor since a payoﬀ of is bad for Min. Giventhis choice of Min, Max should not declare av as successor since this would createan inﬁnite play and hence the payoﬀ is . Therefore Max has to choose ε and becontent with a payoﬀ of ε , which is achieved from all nodes diﬀerent from . min av ε max In order to be able to determine the approximation of V and to apply ourtechniques, we consider the following equivalent deﬁnition. Lemma 7.2. V = ( η ∗ min ◦ min ∈ ) ⊎ ( η ∗ max ◦ max ∈ ) ⊎ ( η ∗ av ◦ av D ) ⊎ c w , where ∈ ⊆ V × V is the “is-element-of ”-relation on V .

15s a composition of non-expansive functions, V is non-expansive as well. Sincewe are interested in the least ﬁxpoint we work in the dual sense and obtain thefollowing approximation, which intuitively says: we can decrease a value at node v by a constant only if, in the case of a Min node, we decrease the value of onesuccessor where the minimum is reached, in the case of a Max node, we decreasethe values of all successors where the maximum is reached, and in the case of anaverage node, we decrease the values of all successors. Lemma 7.3.

Let a : V → [0 , . The approximation for the value iteration func-tion V in the dual sense is V a : [ V ] a → [ V ] V ( a ) with V a ( V ′ ) = { v ∈ [ V ] V ( a ) | (cid:0) v ∈ MIN ∧ Min a | η min( v ) ∩ V ′ = ∅ (cid:1) ∨ (cid:0) v ∈ MAX ∧ Max a | η max( v ) ⊆ V ′ (cid:1) ∨ (cid:0) v ∈ AV ∧ supp ( η av ( v )) ⊆ V ′ (cid:1) } Strategy iteration from above and below.

We describe two algorithms based onstrategy iteration, ﬁrst introduced by Hoﬀman and Karp in [17], that are novel,as far as we know. The ﬁrst iterates to the least ﬁxpoint from above and usesthe techniques described in § τ : MIN → V such that τ ( v ) ∈ η min ( v ) for every v ∈ MIN . With such a strategy, Mindecides to always leave a node v via τ ( v ). Analogously σ : MAX → V ﬁxesa Max-strategy. Fixing a strategy for either player induces a modiﬁed valuefunction. If τ is a Min-strategy, we obtain V τ which is deﬁned exactly as V butfor v ∈ MIN where we set V τ ( a )( v ) = a ( τ ( v )). Analogously, for σ a Max-strategy, V σ is obtained by setting V σ ( a )( v ) = a ( σ ( v )) when v ∈ MAX . If both playersﬁx their strategies, the game reduces to a Markov chain.In order to describe our algorithms we also need the notion of a switch .Assume that τ is a Min-strategy and let a be a (pre-)ﬁxpoint of V τ . Min can nowpotentially improve her strategy for nodes v ∈ MIN where min v ′ ∈ η min ( v ) a ( v ′ )

1. Guess a Min-strategy τ (0) , i := 02. a ( i ) := µ V τ ( i ) τ ( i +1) := sw min ( τ ( i ) , a ( i ) )4. If τ ( i +1) = τ ( i ) i := i + 1 then goto 2.5. Compute V ′ = ν V a , where a = a ( i ) .6. If V ′ = ∅ then stop and return a ( i ) .Otherwise set a ( i +1) := a − ( ι a V ) V ′ , τ ( i +2) := sw min ( τ ( i ) , a ( i +1) ), i := i +2,goto 2.(a) Strategy iteration from above Determine µ V (from below)

1. Guess a Max-strategy σ (0) , i := 02. a ( i ) := µ V σ ( i ) σ ( i +1) := sw max ( σ ( i ) , a ( i ) )4. If σ ( i +1) = σ ( i ) set i := i +1and goto 2. Otherwise stopand return a ( i ) .(b) Strategy iteration from below Fig. 2: Strategy iteration from above and belowcan compute a ( i ) = µ V τ ( i ) eﬃciently by solving a linear program (cf. Lem. 7.4)by adapting [10]. Second, the chain of the a ( i ) decreases, which means that thealgorithm will eventually terminate (cf. Thm. 7.5).Strategy iteration from below is given in Figure 2b. At ﬁrst sight, the algo-rithm looks simpler than strategy iteration from above, since we do not haveto check whether we have already reached ν V , reduce and continue from there.However, in this case the computation of µ V σ ( i ) via a linear program is moreinvolved (cf. Lem. 7.4), since we have to pre-compute (via greatest ﬁxpoint it-eration over V ) the nodes where Min can force a cycle based on the currentstrategy of Max, thus obtaining payoﬀ 0.This algorithm does not directly use our technique but we can use our proofrules to prove the correctness of the algorithm (Thm. 7.5). In particular, theproof that the sequence a ( i ) increases is quite involved: we have to show that a ( i ) = µ V σ ( i ) ≤ µ V σ ( i +1) = a ( i +1) . This could be done by showing that µ V σ ( i +1) is a pre-ﬁxpoint of V σ ( i ) , but there is no straightforward way to do this. On theother hand, we prove this, using our proof rules, by showing that a ( i ) is belowthe least ﬁxpoint of V σ ( i +1) .The algorithm generalises strategy iteration by Hoﬀman and Karp [17]. Notethat we cannot simply adapt their proof, since we do not assume that the gameis stopping, which is a crucial ingredient. In the case where the game is stop-ping, the two algorithms coincide, meaning that we also provide an alternativecorrectness proof in this situation, while other correctness proof [10] are basedon linear algebra and inverse matrices. Lemma 7.4.

The least ﬁxpoints of V τ and V σ can be determined by solvinglinear programs. Theorem 7.5.

Strategy iteration from above and below both terminate and com-pute the least ﬁxpoint of V . xample 7.6. Ex. 7.1 is well suited to explain our two algorithms.Starting with strategy iteration from above, we may guess τ (0) (min) = . Inthis case, Max would choose av as successor and we would reach a ﬁxpoint, whereeach node except for ε is associated with a payoﬀ of . Next, our algorithm woulddetect the vicious cycle formed by min , av and max . We can reduce the valuesin this vicious cycle and reach the correct payoﬀ values for each node.For strategy iteration from below assume that σ (0) (max) = av . Given thisstrategy of Max, Min can force the play to stay in a cycle formed by min , av and max . Thus, the payoﬀ achieved by the Max strategy σ (0) and an optimal play byMin would be for each of these nodes. In the next iteration Max switches andchooses ε as successor, i.e. σ (1) (max) = ε , which results in the correct values. We implemented strategy iteration from above and below and classical Kleeneiteration in MATLAB. In Kleene iteration we terminate with a tolerance of10 − , i.e., we stop if the change from one iteration to the next is below thisbound. We tested the algorithms on random stochastic games and found thatKleene iteration is always the fastest, but only converges and it is known thatthe rate of convergence can be exponentially slow [9]. Strategy iteration frombelow is usually slightly faster than strategy iteration from above, but there aresituations – for instance when the weights of sinks are restricted to 0 , It is well-known that several computations in the context of system veriﬁcationcan be performed by various forms of ﬁxpoint iteration and it is worthwhile tostudy such methods at a high level of abstraction, typically in the setting ofcomplete lattices and monotone functions. Going beyond the classical resultsby Tarski [27], combination of ﬁxpoint iteration with approximations [13,5] andwith up-to techniques [24] has proven to be successful. Here we treated a morespeciﬁc setting, where the carrier set consists of functions from a ﬁnite set into anMV-chain and the ﬁxpoint functions are non-expansive (and hence monotone),and introduced a novel technique to obtain upper bounds for greatest and lowerbounds for least ﬁxpoints, including associated algorithms. Such techniques arewidely applicable to a wide range of examples and so far they have been studiedonly in quite speciﬁc scenarios, such as in [2,15,18].In the future we plan to lift some of the restrictions of our approach. First, anextension to an inﬁnite domain Y would of course be desirable, but since severalof our results currently depend on ﬁniteness, such a generalisation does not seemto be easy. Another restriction, to total orders, seems easier to lift: in particular,if the partially ordered MV-algebra ¯ M is of the form M I where I is a ﬁniteindex set and M an MV-chain. (E.g., ﬁnite Boolean algebras are of this type.)Then our function space is ¯ M Y = (cid:0) M I (cid:1) Y ∼ = M Y × I and we have reduced to thesetting presented in this paper. This will allow us to handle featured transition18ystems [11] where transitions are equipped with boolean formulas. We also planto determine the largest possible increase that can be added to a ﬁxpoint thatis not yet the greatest ﬁxpoint in order to maximally speed up ﬁxpoint iterationfrom below (this might be larger than ι fa ).There are several other application examples that did not ﬁt into this paper,but that can also be handled by our approach: for instance behavioural distancesfor metric transition systems [14] and other types of systems [4]. We also planto investigate other types of games, such as energy games [7]. While here we in-troduced strategy iteration techniques for simple stochastic games, we also wantto check whether we can provide an improvement to value iteration techniques,combining our approach with [18].We also plan to study whether some examples can be handled with othertypes of Galois connections: here we used an additive variant, but looking atmultiplicative variants (multiplication by a constant factor) might also be fruit-ful. Acknowledgements:

We are grateful to Ichiro Hasuo for making us aware ofstochastic games as application domain. Furthermore we would like to thankMatthias Kuntz and Timo Matt for their help with experiments.

References

1. Giorgio Bacci, Giovanni Bacci, Kim G. Larsen, and Radu Mardare. On-the-ﬂyexact computation of bisimilarity distances.

Logical Methods in Computer Science ,13(2:13):1–25, 2017.2. Giorgio Bacci, Giovanni Bacci, Kim G. Larsen, Radu Mardare, Qiyi Tang, andFranck van Breugel. Computing probabilistic bisimilarity distances for probabilisticautomata. In

Proc. of CONCUR ’19 , volume 140 of

LIPIcs , pages 9:1–9:17. SchlossDagstuhl – Leibniz Center for Informatics, 2019.3. Christel Baier and Joost-Pieter Katoen.

Principles of Model Checking . MIT Press,2008.4. Paolo Baldan, Filippo Bonchi, Henning Kerstan, and Barbara K¨onig. Coalgebraicbehavioral metrics.

Logical Methods in Computer Science , 14(3), 2018. SelectedPapers of the 6th Conference on Algebra and Coalgebra in Computer Science(CALCO 2015).5. Paolo Baldan, Barbara K¨onig, and Tommaso Padoan. Abstraction, up-to tech-niques and games for systems of ﬁxpoint equations. In

Proc. of CONCUR ’20 ,volume 171 of

LIPIcs , pages 25:1–25:20. Schloss Dagstuhl – Leibniz Center forInformatics, 2020.6. Henrik Bj¨orklund and Sergei Vorobyov. Combinatorial structure and random-ized subexponential algorithms for inﬁnite games.

Theoretical Computer Science ,349(3):347–360, 2005.7. Lubos Brim, Jakub Chaloupka, Laurent Doyen, Raﬀaella Gentilini, and Jean-Fran cois Raskin. Faster algorithms for mean-payoﬀ games.

Formal Methods inSystem Design , 38(2):97–118, 2011.8. Rance Cleaveland. On automatically explaining bisimulation inequivalence. In

Proc. of CAV ’90 , pages 364–372. Springer, 1990. LNCS 531. . Anne Condon. On algorithms for simple stochastic games. In Advances In Compu-tational Complexity Theory , volume 13 of

DIMACS Series in Discrete Mathematicsand Theoretical Computer Science , pages 51–71, 1990.10. Anne Condon. The complexity of stochastic games.

Information and Computation ,96(2):203–224, 1992.11. Maxime Cordy, Andreas Classen, Gilles Perrouin, Pierre-Yves Schobbens, PatrickHeymans, and Axel Legay. Simulation-based abstractions for software product-line model checking. In

Proc. of ICSE ’12 (International Conference on SoftwareEngineering) , pages 672–682. IEEE, 2012.12. Patrick Cousot and Radhia Cousot. Abstract interpretation: A uniﬁed latticemodel for static analysis of programs by construction or approximation of ﬁxpoints.In

Proc. of POPL ’77 (Los Angeles, California) , pages 238–252. ACM, 1977.13. Patrick Cousot and Radhia Cousot. Temporal abstract interpretation. In Mark N.Wegman and Thomas W. Reps, editors,

Proc. of POPL ’00 , pages 12–25. ACM,2000.14. Luca de Alfaro, Marco Faella, and Mari¨elle Stoelinga. Linear and branching systemmetrics.

IEEE Transactions on Software Engineering , 35(2):258–273, 2009.15. Hongfei Fu. Computing game metrics on Markov decision processes. In

Proc. ofICALP ’12, Part II , pages 227–238. Springer, 2012. LNCS 7392.16. Matthew Hennessy and Robin Milner. Algebraic laws for nondeterminism andconcurrency.

Journal of the ACM , 32:137–161, 1985.17. Richard M. Karp and Alan J. Hoﬀman. On nonterminating stochastic games.

Management Science , 12(5):359–370, 1966.18. Edon Kelmendi, Julia Kr¨amer, Jan Kˇret´ınsk´y, and Maximilian Weininger. Valueiteration for simple stochastic games: Stopping criterion and learning algorithm.In

Proc. of CAV ’18 , pages 623–642. Springer, 2018. LNCS 10981.19. Facundo M´emoli. Gromov-Wasserstein distances and the metric approach to objectmatching.

Foundations of Computational Mathematics , 11(4):417–487, 2011.20. Daniele Mundici. MV-algebras. A short tutorial. Available at .21. Daniele Mundici.

Advanced Lukasiewicz calculus and MV-algebras , volume 35 of

Trends in Logic . Springer, 2011.22. Flemming Nielson, Hanne R. Nielson, and Chris Hankin.

Principles of ProgramAnalysis . Springer, 2010.23. Gabriel Peyr´e and Marco Cuturi. Computational optimal transport, 2020.arXiv:1803.00567.24. Damien Pous. Complete lattices and up-to techniques. In

Proc. of APLAS ’07 ,pages 351–366. Springer, 2007. LNCS 4807.25. Davide Sangiorgi.

Introduction to Bisimulation and Coinduction . Cambridge Uni-versity Press, 2011.26. Colin Stirling. Bisimulation, model checking and other games. Notes for Mathﬁtinstructional meeting on games and computation, Edinburgh, June 1997.27. Alfred Tarski. A lattice-theoretical theorem and its applications.

Paciﬁc Journalof Mathematics , 5:285–309, 1955.28. Rahul Tripathi, Elena Valkanova, and V.S. Anil Kumar. On strategy improvementalgorithms for simple stochastic games.

Journal of Discrete Algorithms , 9:263–278,2011.29. C´edric Villani.

Optimal Transport – Old and New , volume 338 of

A Series ofComprehensive Studies in Mathematics . Springer, 2009. Proofs and Additional Material for § Observe that by using the derived operations, axioms (2) and (3) in Def. 2.1 canbe written as2. x ⊕ y ⊖ x ) ⊕ x = ( x ⊖ y ) ⊕ y We next review some properties of MV-algebras. They are taken from or easyconsequences of properties in [20].

Lemma A.1 (properties of MV-algebras).

Let M = ( M, ⊕ , , ( · )) be anMV-algebra. For all x, y, z ∈ M x ⊕ x = 1 x ⊑ y iﬀ x ⊕ y = 1 iﬀ x ⊗ y = 0 iﬀ y = x ⊕ ( y ⊖ x ) x ⊑ y iﬀ y ⊑ x ⊕ , ⊗ are monotone in both arguments, ⊖ monotone in the ﬁrst and antitonein the second argument.5. if x ⊏ y then ⊏ y ⊖ x ;6. ( x ⊕ y ) ⊖ y ⊑ x z ⊑ x ⊕ y if and only if z ⊖ x ⊑ y .8. if x ⊏ y and z ⊑ y then x ⊕ z ⊏ y ⊕ z ;9. y ⊑ x if and only if ( x ⊕ y ) ⊖ y = x ;10. x ⊖ ( x ⊖ y ) ⊑ y and if y ⊑ x then x ⊖ ( x ⊖ y ) = y .11. Whenever M is an MV-chain, x ⊏ y and ⊏ z imply ( x ⊕ z ) ⊖ y ⊏ z Proof.

The proof of properties (1), (2), (3), (4) can be found directly in [20]. Forthe rest:5. Immediate consequence of (2). In fact, given x, y ∈ M , if we had y ⊖ x = 0then by (2), y = x ⊕ ( y ⊖ x ) = x ⊕ x .6. Observe that ( x ⊕ y ) ⊖ y = ( x ⊕ y ) ⊕ y = ( x ⊖ y ) ⊕ y = ( y ⊖ x ) ⊕ x ⊑ x = x ,where the last inequality is motivated by the fact that x ⊑ ( y ⊖ x ) ⊕ x andpoint (3).7. The direction from left to right is an immediate consequence of (6). In fact,if z ⊑ x ⊕ y then z ⊖ x ⊑ ( x ⊕ y ) ⊖ x ⊑ y .The other direction goes as follows: if z ⊖ x ⊑ y , then – by monotonicity(4) – ( z ⊖ x ) ⊕ x ⊑ y ⊕ x = x ⊕ y . The left hand side can be rewritten to( x ⊖ z ) ⊕ z ⊒ z .8. Assume that x ⊏ y and z ⊑ y . We know, by property (4) that x ⊕ z ⊑ y ⊕ z .Assume by contradiction that x ⊕ z = y ⊕ z . Then we have x ⊑ [by properties (3) and (6)] ⊑ ( x ⊕ z ) ⊖ z [since x ⊕ z = y ⊕ z ] ⊑ ( y ⊕ z ) ⊖ z [deﬁnition of ⊖ ]21 ( y ⊖ z ) ⊕ z [since z ⊑ y and property (2)]= y And with point (3) this is a contradiction.9. Assume y ⊑ x . We know ( x ⊕ y ) ⊖ y ⊑ x . If it were ( x ⊕ y ) ⊖ y ⊏ x , then(( x ⊕ y ) ⊖ y ) ⊕ y ⊏ x ⊕ y , with (8). Since the left-hand side is equal to( y ⊖ ( x ⊕ y )) ⊕ ( x ⊕ y ) ⊒ x ⊕ y , this is a contradiction.For the other direction assume that ( x ⊕ y ) ⊖ y = x . Hence we have x =( x ⊕ y ) ⊖ y = ( x ⊕ y ) ⊕ y . By complementing on both sides we obtain x =( x ⊕ y ) ⊕ y which implies that y ⊑ x .10. Observe that, by (7), we have y ⊑ x ⊕ ( y ⊖ x ) = x ⊕ ( x ⊖ y ) = x ⊖ ( x ⊖ y ).Therefore, by (3), x ⊖ ( x ⊖ y ) ⊑ y , as desired.For the second part, assume if y ⊑ x and thus, by (3), x ⊑ y . Using (2), weobtain y = x ⊕ ( y ⊖ x ) = x ⊕ y ⊕ x = x ⊕ ( x ⊖ y ). Hence y = x ⊕ ( x ⊖ y ) = x ⊖ ( x ⊖ y ).11. We ﬁrst observe that x ⊑ y ⊕ ( x ⊖ y ). This is a direct consequence of axiom (3)of MV-algebras and the deﬁnition of natural order.Second, in an MV-chain if x, y ⊐

0, then x ⊖ y ⊏ x . In fact, if x ⊑ y andthus x ⊖ y = 0 ⊏ x . If instead, y ⊏ x we have 0 ⊏ y and x ⊖ y ⊑ ⊖ y = y ,hence by Lem. A.1(8) it holds that 0 ⊕ ( x ⊖ y ) ⊏ y ⊕ ( x ⊖ y ). Recalling that y ⊏ x and thus by Lem. A.1(2), ( x ⊖ y ) ⊕ y = x , we conclude x ⊖ y ⊏ x .Now( x ⊕ z ) ⊖ y ⊑ ( x ⊕ ( z ⊖ ( y ⊖ x )) ⊕ ( y ⊖ x )) ⊖ y [by ﬁrst obs. above]= ( y ⊕ ( z ⊖ ( y ⊖ x )) ⊖ y [since x ⊑ y , by Lem. A.1(2)] ⊑ z ⊖ ( y ⊖ x ) [by Lem. A.1(6)] ⊏ z [by second obs. above, since z ⊐ y ⊖ x ⊐ a ⊕ b ⊖ c should beread as ( a ⊕ b ) ⊖ c and not a ⊕ ( b ⊖ c ), which is in general diﬀerent. B Proofs and Additional Material for § Lemma B.1 (properties of the norm).

Let M be an MV-chain and let Y bea ﬁnite set. Then ||·|| : M Y → M satisﬁes, for all a, b ∈ M Y , δ ∈ M || a ⊕ b || ⊑ || a || ⊕ || b || ,2. || δ ⊗ a || = δ ⊗ || a || and3. || a || = 0 implies that a is the constant . roof. Concerning (1), let || a ⊕ b || be realised on some element y ∈ Y , i.e., || a ⊕ b || = a ( y ) ⊕ b ( y ). Since a ( y ) ⊑ || a || and b ( y ) ⊑ || b || , by monotonicity of ⊕ we deduce that || a ⊕ b || ⊑ || a || ⊕ || b || .Concerning (2), note that || δ ⊗ a || = max { δ ⊕ a ( y ) | y ∈ Y } = min { δ ⊕ a ( y ) | y ∈ Y } = δ ⊕ min { a ( y ) | y ∈ Y } = δ ⊕ max { a ( y ) | y ∈ Y } = δ ⊕ || a || = δ ⊗ || a || Finally, point (3) is straightforward, since 0 is the bottom of M . Lemma B.2 (non-expansiveness implies monotonicity).

Let M is an MV-chain and let Y, Z be ﬁnite sets. Every non-expansive function f : M Y → M Z ismonotone.Proof. Let a, b ∈ M Y be such that a ⊑ b . Therefore, by Lem. A.1(2), a ( y ) ⊖ b ( y ) =0 for all y ∈ Y , hence a ⊖ b = 0. Thus || f ( a ) ⊖ f ( b ) || ⊑ || a ⊖ b || = 0. In turn thisimplies that for all z ∈ Z , f ( a )( z ) ⊖ f ( b )( z ) = 0. Hence Lem. A.1(2), allows usto conclude f ( a )( z ) ⊑ f ( b )( z ) for all z ∈ Z , i.e., f ( a ) ⊑ f ( b ), as desired.The next lemma provides an equivalent characterisation of non-expansivenesswhich will be useful in the sequel. Lemma B.3 (characterisation of non-expansiveness).

Let f : M Y → M Z be a monotone function, where M is an MV-chain and Y, Z are ﬁnite sets. Then f is non-expansive iﬀ for all a ∈ M Y , θ ∈ M and z ∈ Z it holds f ( a ⊕ θ )( z ) ⊖ f ( a )( z ) ⊑ θ .Proof. Let f be non-expansive and let a ∈ M Y and θ ∈ M . We have that for all z ∈ Zf ( a ⊕ θ )( z ) ⊖ f ( a )( z ) ⊑⊑ || f ( a ⊕ θ ) ⊖ f ( a ) || [by deﬁnition of norm] ⊑ || ( a ⊕ θ ) ⊖ a || [by hypothesis] ⊑ || λy.θ || [by Lem. A.1(6) and monotonicity of norm]= θ [by deﬁnition of norm]Conversely, assume that for all a ∈ M Y , θ ∈ M and z ∈ Z it holds f ( a ⊕ θ )( z ) ⊖ f ( a )( z ) ⊑ θ . For a, b ∈ M Y , ﬁrst observe that for all y ∈ Y it holds b ( y ) ⊖ a ( y ) ⊑ || b ⊖ a || , hence, if we let θ = || b ⊖ a || , we have b ⊑ a ⊕ θ and thus,by monotonicity, f ( b ) ⊖ f ( a ) ⊑ f ( a ⊕ θ ) ⊖ f ( a ). Thus23 | f ( b ) ⊖ f ( a ) || ⊑⊑ || f ( a + θ ) ⊖ f ( a ) || =[by the observation above and monotonicity of norm]= max { f ( a + θ )( z ) ⊖ f ( a )( z ) | z ∈ Z } [by deﬁnition of norm] ⊑ θ [by hypothesis]= || b ⊖ a || [by the choice of θ ]It is immediate to see that non-expansive functions compose. Lemma B.4 (composing non-expansive functions).

Let M be an MV-chain and let Y, W, Z be ﬁnite sets. If g : M Y → M W and h : M W → M Z arenon-expansive then h ◦ g : M Y → M Z is non-expansive.Proof. Straightforward. We have for any a, b ∈ M Y that || h ( g ( b )) ⊖ h ( g ( a )) || ⊑⊑ || g ( b ) ⊖ g ( a ) || [by non-expansiveness of h ] ⊑ || b ⊖ a || [by non-expansiveness of g ] Lemma B.5 (well-deﬁnedness).

The functions α a,δ , γ a,δ from Def. 3.3 arewell-deﬁned and monotone.Proof. The involved functions α a,δ and γ a,δ are well-deﬁned. In fact, for Y ′ ⊆ [ Y ] a , clearly α a,δ = a ⊕ δ Y ′ ∈ [ a, a ⊕ δ ]. Moreover, for b ∈ [ a, a ⊕ δ ] we have γ a,δ ( b ) ⊆ [ Y ] a . In fact, if y [ Y ] a then a ( y ) = 1, hence b ( y ) = 1 and thus b ( y ) ⊖ a ( y ) = 0 δ , and thus y γ a,δ ( b ). Moreover, they are clearly monotone. Lemma 3.4 (Galois connection).

Let M be an MV-algebra and Y be a ﬁ-nite set. For = δ ⊑ δ a , the pair h α a,δ , γ a,δ i : [ Y ] a → [ a, a ⊕ δ ] is a Galoisconnection.Proof. For all Y ′ ∈ [ Y ] a it holds γ a,δ ( α a,δ ( Y ′ )) = γ a,δ ( a ⊕ δ Y ′ ) = Y ′ .In fact, for all y ∈ Y ′ , ( a ⊕ δ Y ′ )( y ) = a ( y ) ⊕ δ . Moreover, and by the choiceof δ and deﬁnition of [ Y ] a , we have δ ⊑ δ a ⊑ a ( y ), by Lem. A.1(9), we have( a ⊕ δ Y ′ )( y ) ⊖ a ( y ) = δ hence y ∈ γ a,δ ( α a,δ ( Y ′ )). Conversely, if y Y ′ , then( a ⊕ δ Y ′ )( y ) = a ( y ), and thus ( a ⊕ δ Y ′ )( y ) ⊖ a ( y ) = 0 δ .Moreover, for all b ∈ [ a, a ⊕ δ ] we have α a,δ ( γ a,δ ( b )) = a ⊕ δ γ a,δ ( b ) ⊑ b

24n fact, for all y ∈ Y , if y ∈ γ a,δ ( b ), i.e., δ ⊑ b ( y ) ⊖ a ( y ) then ( a ⊕ δ γ a,δ ( b ) )( y ) = a ( y ) ⊕ δ ⊑ a ( y ) ⊕ ( b ( y ) ⊖ a ( y )) = b ( y ), by Lem. A.1(2). If instead, y γ a,δ ( b ),then ( a ⊕ δ γ a,δ ( b ) ( b ))( y ) = a ( y ) ⊑ b ( y ). Lemma B.6 (restricting non-expansive functions to intervals).

Let M be an MV-chain, let Y, Z be ﬁnite sets f : M Y → M Z be a non-expansive func-tion. Then f restricts to a function f a,δ : [ a, a ⊕ δ ] → [ f ( a ) , f ( a ) ⊕ δ ] , deﬁned by f a,δ ( b ) = f ( b ) .Proof. Given b ∈ [ a, a ⊕ δ ], by monotonicity of f we have that f ( a ) ⊑ f ( b ).Moreover, f ( b ) ⊑ f ( a ⊕ δ ) ⊑ f ( a ) ⊕ δ , where the last passage is motivated byLem. B.3.In the following we will simply write f instead of f a,δ .Given an MV-chain M and a ﬁnite set Y , we ﬁrst observe that each function b ∈ M Y can be expressed as a suitable sum of functions of the shape δ Y ′ . Lemma B.7 (standard form).

Let M be an MV-chain and let Y be a ﬁniteset. Then for any b ∈ M Y there are Y , . . . , Y n ⊆ Y with Y i +1 ⊆ Y i for i ∈{ , . . . , n − } and δ i ∈ M , = δ i ⊑ L i − j =1 δ j for i ∈ { , . . . , n } such that b = L ni =1 δ iY i and || b || = L ni =1 δ i .where we assume that an empty sum evaluates to .Proof. Given b ∈ M Y , consider V = { b ( y ) | y ∈ Y } . If V is empty, then Y isempty and thus b = 1 Y , i.e., we can take n = 1, δ = 1 and Y = Y . Otherwise,if Y = ∅ , then V is a ﬁnite non-empty set. Let V = { v , . . . , v n } , with v i ⊑ v i +1 for i ∈ { , . . . , n − } . For i ∈ { , . . . , n } deﬁne Y i = { y ∈ Y | v i ⊑ b ( y ) } .Clearly, Y ⊇ Y ⊇ . . . ⊇ Y n . Moreover let δ = v and δ i +1 = v i +1 ⊖ v i for i ∈ { , . . . , n − } .Observe that for each i , we have v i = L ij =1 δ i , as it can easily shown byinduction. Hence δ i +1 = v i +1 ⊖ v i = v i +1 ⊖ L ij =1 δ i ⊑ ⊖ L ij =1 δ i = L ij =1 δ i .We now show that b = L ni =1 δ iY i by induction on n . – If n = 1 then V = { v } and thus b is a constant function b ( y ) = v for all y ∈ Y . Hence Y = Y and thus b = δ Y = δ Y , as desired. – If n >

1, let b ′ ∈ M Y deﬁned by b ′ ( y ) = b ( y ) for y ∈ Y \ Y n and b ′ ( y ) = v n − for y ∈ Y n . Note that { b ′ ( y ) | y ∈ Y } = { v , . . . , v n − } . Hence, by inductivehypothesis, b ′ = L n − i =1 δ iY i . Moreover, b ′ ( y ) = b ⊕ δ nY n , and thus we conclude.Finally observe that the statement requires δ i = 0 for all i . We can enjoythis property by just omitting the ﬁrst summand when v = 0. Lemma 3.6 (anti-monotonicity).

Let M be a MV-chain, let Y , Z be ﬁnitesets, let f : M Y → M Z be a non-expansive function and let a ∈ M Y . For θ, δ ∈ M , if θ ⊑ δ then f a,δ ⊆ f a,θ . roof. Let Y ′ ⊆ [ Y ] a and let us prove that f a,δ ( Y ′ ) ⊆ f a,θ ( Y ′ ). Take z ∈ f a,δ ( Y ′ ). This means that δ ⊑ f ( a ⊕ δ Y ′ )( z ) ⊖ f ( a )( z ).We have δ ⊑ f ( a ⊕ δ Y ′ )( z ) ⊖ f ( a )( z )[by hypothesis]= f ( a ⊕ θ Y ′ ⊕ ( δ ⊖ θ ) Y ′ )( z ) ⊖ f ( a )( z )= f ( a ⊕ θ Y ′ ⊕ ( δ ⊖ θ ) Y ′ )( z ) ⊖ f ( a ⊕ θ Y ′ )( z ) ⊕ f ( a ⊕ θ Y ′ )( z ) ⊖ f ( a )( z ) ⊑ || f ( a ⊕ θ Y ′ ⊕ ( δ ⊖ θ ) Y ′ ) ⊖ f ( a ⊕ θ Y ′ ) || ⊕ f ( a ⊕ θ Y ′ )( z ) ⊖ f ( a )( z )[by deﬁnition of norm and monotonicity of ⊕ ] ⊑ || a ⊕ θ Y ′ ⊕ ( δ ⊖ θ ) Y ′ ⊖ ( a ⊕ θ Y ′ ) || ⊕ f ( a ⊕ θ Y ′ )( z ) ⊖ f ( a )( z )[by non-expansiveness of f and monotonicity of ⊕ ] ⊑ || ( δ ⊖ θ ) Y ′ || ⊕ f ( a ⊕ θ Y ′ )( z ) ⊖ f ( a )( z ) ⊑ ( δ ⊖ θ ) ⊕ f ( a ⊕ θ Y ′ )( z ) ⊖ f ( a )( z )[by deﬁnition of norm]If we subtract δ ⊖ θ on both sides, we get δ ⊖ ( δ ⊖ θ ) ⊑ f ( a ⊕ θ Y ′ )( z ) ⊖ f ( a )( z ),and, as above, since, by Lem. A.1(10), δ ⊖ ( δ ⊖ θ ) = θ we conclude θ ⊑ f ( a ⊕ θ Y ′ )( z ) ⊖ f ( a )( z )which means z ∈ f a,θ ( Y ′ ). Lemma B.8 (largest increase for a point).

Let M be a complete MV-chain,let Y , Z be ﬁnite sets, let f : M Y → M Z be a non-expansive function and ﬁx a ∈ M Y . For all z ∈ [ Z ] f ( a ) and Y ′ ⊆ [ Y ] a the set { θ ⊑ δ a | z ∈ f a,θ ( Y ′ ) } has amaximum, that we denote by ι fa ( Y ′ , z ) .Proof. Let V = { θ ⊑ δ a | z ∈ f a,θ ( Y ′ ) } . Expanding the deﬁnition we have that V = { θ ⊑ δ a | θ ⊑ f ( a ⊕ θ Y ′ )( z ) ⊖ f ( a )( z ) } .If we let η = sup V , for all θ ∈ V , since θ Y ′ ⊑ η Y ′ , clearly, by monotonicity θ ⊑ f ( a ⊕ η Y ′ )( z ) ⊖ f ( a )( z )and therefore, by deﬁnition of supremum, η ⊑ f ( a ⊕ η Y ′ )( z ) ⊖ f ( a )( z ), i.e., η ∈ V is a maximum, as desired. Lemma B.9.

Let M be an MV-chain, let Y , Z be ﬁnite sets and let f : M Y → M Z be a non-expansive function. Let a ∈ M Y . For b ∈ [ a, a ⊕ δ ] , let b ⊖ a = L ni =1 δ iY i be a standard form for b ⊖ a . If γ f ( a ) ,δ ( f ( b )) = ∅ then Y n = γ a,δ ( b ) and γ f ( a ) ,δ ( f ( b )) ⊆ f a,δ n ( Y n ) .Proof. By hypothesis γ f ( a ) ,δ ( f ( b )) = ∅ . Let z ∈ γ f ( a ) ,δ ( f ( b )). This means that δ ⊑ f ( b )( z ) ⊖ f ( a )( z ). First observe that δ ⊑ f ( b )( z ) ⊖ f ( a )( z ) [by hypothesis]26 || f ( b ) ⊖ f ( a ) || [by deﬁnition of norm] ⊑ || b ⊖ a || [by non-expansiveness of f ] ⊑ δ [since b ∈ [ a, a ⊕ δ ]]Hence || f ( b ) ⊖ f ( a ) || = δ = || b ⊖ a || = L ni =1 δ i .Also observe that, since δ n = 0, we have ( b ⊖ a )( z ) = δ iﬀ z ∈ Y n . In fact, if z ∈ Y n then z ∈ Y i for all i ∈ { , . . . , n } and thus ( b ⊖ a )( z ) = L ni =1 δ iY i ( z ) = L ni =1 δ i = δ . Conversely, if z Y n , then ( b ⊖ a )( z ) ⊑ L n − i =1 δ i ⊏ δ . In fact,0 ⊏ δ n and L n − i =1 δ i ⊑ δ n . Thus by Lemma A.1(8), L n − i =1 δ i ⊏ δ n ⊕ L n − i =1 δ i = L ni =1 δ i = δ . Hence Y n = γ a,δ ( b ).Let us now show that γ f ( a ) ,δ ( f ( b )) ⊆ f a,δ n ( Y n ). Given z ∈ γ f ( a ) ,δ ( f ( b )), weshow that z ∈ f a,δ n ( Y n ). Observe that δ ⊑ f ( b )( z ) ⊖ f ( a )( z ) =[by hypothesis]= f ( a ⊕ ( b ⊖ a ))( z ) ⊖ f ( a )( z ) =[by Lem. A.1(2), since a ⊑ b ]= f ( a ⊕ n M i =1 δ iY i )( z ) ⊖ f ( a )( z ) =[by construction]= f ( a ⊕ n M i =1 δ iY i ))( z ) ⊖ f ( a ⊕ δ nY n )( z ) ⊕ f ( a ⊕ δ nY n )( z ) ⊖ f ( a )( z )[by Lem. A.1(2), since f ( a ⊕ δ nY n )( z ) ⊑ f ( a ⊕ L ni =1 δ iY i )( z )] ⊑ || f ( a ⊕ n M i =1 δ iY i ) ⊖ f ( a ⊕ δ nY n ) || ⊕ f ( a ⊕ δ nY n )( z ) ⊖ f ( a )( z )[by deﬁnition of norm and monotonicity of ⊕ ] ⊑ || a ⊕ n M i =1 δ iY i ⊖ ( a ⊕ δ nY n ) || ⊕ f ( a ⊕ δ nY n )( z ) ⊖ f ( a )( z )[by non-expansiveness of f and monotonicity of ⊕ ]= || a ⊕ δ nY n ⊕ n − M i =1 δ iY i ⊖ ( a ⊕ δ nY n ) || ⊕ f ( a ⊕ δ nY n )( z ) ⊖ f ( a )( z )[by algebraic manipulation] ⊑ || n − M i =1 δ iY i || ⊕ f ( a ⊕ δ nY n )( z ) ⊖ f ( a )( z )[by Lem. A.1(6) and monotonicity of norm]27 n − M i =1 δ i ⊕ f ( a ⊕ δ nY n )( z ) ⊖ f ( a )( z )[by Lem. B.1(1) and the fact that || δ iY i || = δ i ]= ( δ ⊖ δ n ) ⊕ f ( a ⊕ δ nY n )( z ) ⊖ f ( a )( z )[by construction, since δ n = L n − i =1 δ i ]If we subtract δ ⊖ δ n on both sides, we get δ ⊖ ( δ ⊖ δ n ) ⊑ f ( a ⊕ δ nY n )( z ) ⊖ f ( a )( z ),i.e., since, by Lem. A.1(10), δ ⊖ ( δ ⊖ δ n ) = δ n we conclude δ n ⊑ f ( a ⊕ δ nY n )( z ) ⊖ f ( a )( z ).Hence z ∈ γ f ( a ) ,δ n ( f ( α a,δ n ( Y n )) = f a,δ n ( Y n ), which is the desired result. Theorem 3.7 (approximation of non-expansive functions).

Let ι fa = min { ι fa ( Y ′ , z ) | Y ′ ⊆ [ Y ] a ∧ z ∈ [ Z ] f ( a ) ∧ ι fa ( Y ′ , z ) = 0 } ∪ { δ a } .Observe that for δ ⊑ ι fa we have f a,δ = f a,ι fa . (1)This immediately implies that for all δ, δ ′ ⊑ ι fa we have f a,δ = f a,δ ′ , as desired.In order to show (1), note that, by Lem. 3.6 we have f a,δ ⊇ f a,ι fa . For theother inclusion let Y ′ ⊆ [ Y ] a . We have f a,δ ( Y ′ ) = { z ∈ [ Z ] f ( a ) | f ( a ⊕ δ Y ′ )( z ) ⊖ f ( a )( z ) ⊒ δ } by deﬁnition. Assume that there exists z ∈ f a,δ ( Y ′ ) where f ( a ⊕ ( ι fa ) Y ′ )( z ) ⊖ f ( a )( z ) ι fa . But this is a contradiction, since ι fa is the minimum of all suchnon-zero values.We now show proceed and show properties (a-b).a. Let b ∈ [ a, a ⊕ δ ]. First note that whenever γ f ( a ) ,δ ( f ( b )) = ∅ , the desiredinclusion obviously holds. 28f instead γ f ( a ) ,δ ( f ( b )) = ∅ , let b ⊖ a = L ni =1 δ iY i be a standard form with δ n = 0. First observe that, by Lem. B.9, we have Y n = γ a,δ n ( b ) and γ f ( a ) ,δ ( f ( b )) ⊆ f a,δ n ( Y n ) . (2)For all z ∈ f a,δ n ( Y n ), by deﬁnition of ι fa ( Y n , z ) we have that 0 ⊏ δ n ⊑ ι fa ( Y n , z ), therefore ι fa ⊑ ι fa ( Y n , z ). Moreover, z ∈ f a,ι fa ( Y n ,z ) ( Y n ) ⊆ f a,ι fa ( Y n ) = f a ( Y n ), where the last inequality is motivated by Lem. 3.6 since ι fa ⊑ ι fa ( Y n , z ).Therefore, f a,δ n ( Y n ) ⊆ f a ( γ a,δ ( b )), which combined with (2) gives the desiredresult.b. For (b), we ﬁrst show the direction from left to right. Assume that δ ⊑ ι fa . By(a) clearly, γ f ( a ) ,δ ◦ f ( b ) ⊆ f a ◦ γ a,δ ( b ). For the converse inclusion, note that: f a ( γ a,δ ( b )) [by deﬁnition of f a ]= f a,ι fa ( γ a,δ ( b )) ⊆ [by Lem. 3.6, since δ ⊑ ι fa ] ⊆ f a,δ ( γ a,δ ( b )) [by deﬁnition of f a,δ ]= γ f ( a ) ,δ ( f ( α a,δ ( γ a,δ ( b )))) [since α a,δ ◦ γ a,δ ( b ) ⊑ b ] ⊆ γ f ( a ) ,δ ( f ( b ))as desired.For the other direction, assume γ f ( a ) ,δ ◦ f ( b ) = f a ◦ γ a,δ ( b ) holds for all b ∈ [ a, a ⊕ δ ]. Now, for every Y ′ ⊆ [ Y ] a we have f a,δ ( Y ′ ) = γ f ( a ) ,δ ◦ f ◦ α a,δ ( Y ′ ) = f a ◦ γ a,δ ◦ α a,δ ( Y ′ ). We also have γ a,δ ◦ α a,δ ( Y ′ ) = Y ′ (see proofof Lem. 3.4), thus f a,δ ( Y ′ ) = f a ( Y ′ ). For any δ with ι fa ⊏ δ ⊑ δ a there exists Y ′ ⊆ [ Y ] a and z ∈ [ Z ] f ( a ) with z ∈ f a ( Y ′ ) but z / ∈ f a,δ ( Y ′ ), by deﬁnition of ι fa . Therefore δ ⊑ ι fa has to hold.From the proof of Thm. 3.7 we can extract an explicit deﬁnition of ι fa and ofthe approximation of a function. Deﬁnition B.10 ( a -approximation for a function). Let M be a completeMV-chain and let Y, Z be ﬁnite sets and let f : M Y → M Z be a non-expansivefunction. Let ι fa = min { ι fa ( Y ′ , z ) | Y ′ ⊆ [ Y ] a ∧ z ∈ [ Z ] f ( a ) ∧ ι fa ( Y ′ , z ) =0 } ∪ { δ a } . The a -approximation of f is the function f a = f a,ι fa . C Proofs and Additional Material for § We ﬁrst prove a technical lemma.

Lemma C.1.

Let M be a complete MV-chain, Y a ﬁnite set and f : M Y → M Y be a non-expansive function. Let a ∈ M Y be a pre-ﬁxpoint of f , let f a : [ Y ] a → [ Y ] f ( a ) be the a -approximation of f (Def. B.10). Assume νf a and let Y ′ = { y ∈ [ Y ] a | νf ( y ) ⊖ a ( y ) = || νf ⊖ a ||} . Then for all y ∈ Y ′ it holds a ( y ) = f ( a )( y ) and Y ′ ⊆ f a ( Y ′ ) . roof. Let δ = || νf ⊖ a || . Assume νf a , i.e., there exists y ∈ Y such that νf ( y ) a ( y ). Since the order is total, this means that a ( y ) ⊏ νf ( y ). Hence, byLem. A.1(5), νf ( y ) ⊖ a ( y ) ⊐

0. Then δ = || νf ⊖ a || ⊐

0. Moreover, for all y ∈ Y ′ , a ( y ) = 1 ⊖ a ( y ) ⊒ νf ( y ) ⊖ a ( y ) = δ .First, observe that νf ⊑ a ⊕ δ, (3)since for all y ∈ Y νf ( y ) ⊖ a ( y ) ⊑ δ by deﬁnition of δ and then (3) follows fromLem. A.1(7).Concerning the ﬁrst part, let y ∈ Y ′ . Since a is a pre-ﬁxpoint, f ( a )( y ) ⊑ a ( y ).Assume by contradiction that f ( a )( y ) ⊏ a ( y ). Then we have f ( a ⊕ δ )( y ) =[by Lem. A.1(2), since f is monotone and thus f ( a ) ⊑ f ( a ⊕ δ )]= f ( a )( y ) ⊕ ( f ( a ⊕ δ )( y ) ⊖ f ( a )( y ))[since f is non-expansive, by Lem. B.3, hence f ( a ⊕ δ )( y ) ⊖ f ( a )( y ) ⊑ δ ] ⊑ f ( a )( y ) ⊕ δ [by f ( a )( y ) ⊏ a ( y ), δ ⊑ a ( y ) and Lem. A.1(6)] ⊏ a ( y ) ⊕ δ [by Lem. A.1(2) since a ( y ) ⊑ νf ( y ) and δ = νf ( y ) ⊖ a ( y )]= νf ( y )= f ( νf )( y )[since νf ⊑ a ⊕ δ (3) and f monotone] ⊑ f ( a ⊕ δ )( y )i.e., a contradiction. Hence it must be a ( y ) = f ( a )( y ).For the second part, in order to show Y ′ ⊆ f a ( Y ′ ), we let b = νf ⊔ a . Byusing (3) we immediately have that b ∈ [ a, a ⊕ δ ].We next prove that Y ′ = γ a,δ ( b ).We show separately the two inclusions. If y ∈ Y ′ then a ( y ) ⊏ νf ( y ) and thus b ( y ) = a ( y ) ⊔ νf ( y ) = νf ( y ) and thus b ( y ) ⊖ a ( y ) = νf ( y ) ⊖ a ( y ) = δ . Hence y ∈ γ a,δ ( b ). Conversely, if y ∈ γ a,δ ( b ), then a ( y ) ⊏ νf ( y ). In fact, if it were a ( y ) ⊒ νf ( y ), then, by deﬁnition of b we would have b ( y ) = a ( y ) and b ( y ) ⊖ a ( y ) = 0 δ .Therefore, b ( y ) = νf ( y ) and thus νf ( y ) ⊖ a ( y ) = b ( y ) ⊖ a ( y ) ⊒ δ , whence y ∈ Y ′ .We can now conclude. In fact, since f is non-expansive, by Thm. 3.7(a), wehave γ f ( a ) ,δ ( f ( b )) ⊆ f a ( Y ′ ) . Moreover Y ′ ⊆ γ f ( a ) ,δ ( f ( b )). In fact, let y ∈ Y ′ , i.e., y ∈ [ Y ] a and δ ⊑ b ( y ) ⊖ a ( y ). Since a ( y ) = f ( a )( y ), we have that y ∈ [ Y ] f ( a ) . In order to concludethat y ∈ γ f ( a ) ,δ ( f ( b )) it is left to show that δ ⊑ f ( b )( y ) ⊖ f ( a )( y ). We have f ( b )( y ) ⊖ f ( a )( y ) = f ( b )( y ) ⊖ a ( y ) [since y ∈ Y ′ ]30 f ( νf ⊔ a )( y ) ⊖ a ( y ) [deﬁnition of b ] ⊒ ( f ( νf )( y ) ⊔ f ( a )( y )) ⊖ a ( y ) [properties of ⊔ ]= ( νf ( y ) ⊔ a ( y )) ⊖ a ( y ) [since νf ﬁxpoint and y ∈ Y ′ ]= b ( y ) ⊖ a ( y ) [deﬁnition of b ] ⊒ δ [since y ∈ Y ′ ]Combining the two inclusions, we have Y ′ ⊆ f a ( Y ′ ), as desired. Theorem 4.1 (soundness and completeness for ﬁxpoints).

Let M be acomplete MV-chain, Y a ﬁnite set and f : M Y → M Y be a non-expansive func-tion. Let a ∈ M Y be a ﬁxpoint of f . Then νf a = ∅ if and only if a = νf .Proof. Let a be a ﬁxpoint of f and assume that a = νf . For δ = ι fa ⊑ δ a ,according to Lem. 3.4, we have a Galois connection: [ Y ] a [ a, a + δ ] α a,δ γ a,δ f a f a,δ Since a is a ﬁxpoint, then [ Y ] f ( a ) = [ Y ] a and, by Thm. 3.7(b), γ a,δ ◦ f = γ f ( a ) ,δ ◦ f = f a ◦ γ a,δ .Therefore by [13, Prop. 14], νf a = γ a,δ ( νf ). Recall that γ a,δ ( νf ) = { y ∈ Y | δ ⊑ νf ( y ) ⊖ a ( y ) } . Since a = νf and δ ⊐

0, we know that γ a,δ ( νf ) = ∅ and weconclude νf a = ∅ , as desired.Conversely, in order to prove that if νf a = ∅ then a = νf , we prove thecontrapositive. Assume that a = νf . Since a is a ﬁxpoint and νf is the largest,this means that a ⊏ νf and thus || νf ⊖ a || 6 = 0. Consider Y ′ = { y ∈ [ Y ] a | νf ( y ) ⊖ a ( y ) = || νf ⊖ a ||} 6 = ∅ . By Lem. C.1, Y ′ is a post-ﬁxpont of f a , i.e., Y ′ ⊆ f a ( Y ′ ), and thus νf a ⊇ Y ′ which implies νf a = ∅ , as desired. Lemma 4.2.

Let M be a complete MV-chain, f : M Y → M Y a non-expansivefunction, a ∈ M a ﬁxpoint of f , and let f a be the corresponding a -approximationand ι fa as in Thm. 3.7. Then α a,ι fa ( νf a ) = a ⊕ ( ι fa ) νf a is a post-ﬁxpoint of f .Proof. By Thm. 3.7(b) we know that f a ◦ γ a,ι fa = γ f ( a ) ,ι fa ◦ f = γ a,ι fa ◦ f , inparticular f a ◦ γ a,ι fa ⊆ γ a,ι fa ◦ f . Since ι fa ⊑ δ a , by Lem. 3.4, we have a Galoisconnection and thus we can rewrite this to α a,ι fa ◦ f a ⊑ f ◦ α a,ι fa . Thus we have α a,ι fa ( νf a ) = α a,ι fa ( f a ( νf a )) ⊑ f ( α a,ι fa ( νf a )) . Theorem 4.3 (soundness for pre-ﬁxpoints).

0. Then δ = || νf ⊖ a || ⊐ Y ′ = { y ∈ Y a | νf ( y ) ⊖ a ( y ) = || νf ⊖ a ||} 6 = ∅ . By Lem. C.1, Y ′ is a post-ﬁxpont of f a , i.e., Y ′ ⊆ f a ( Y ′ ), and thus Y ′ ⊆ νf a . Moreover,for all y ∈ Y ′ , a ( y ) = f ( a )( y ), i.e., Y ′ ⊆ [ Y ] a = f ( a ) . Therefore we conclude Y ′ ⊆ f a ( Y ′ ) ∩ [ Y ] a = f ( a ) = f ∗ a ( Y ′ ), i.e., Y ′ is a post-ﬁxpoint also for f ∗ a , andthus νf ∗ a ⊇ Y ′ = ∅ , as desired. C.1 The dual view for least ﬁxpoints

The theory developed so far can be easily dualised to check under-approximationsof least ﬁxpoints. Given a complete MV-algebra M = ( M, ⊕ , , ( · )) and a non-expansive function f : M Y → M Y , in order to show that post-ﬁxpoint a ∈ M Y is such that a ⊑ µf we can in fact simply work in the dual MV-algebra, M op =( M, ⊒ , ⊗ , ( · ) , , ⊕ could be the “standard” operation on M , it is convenient to formulatethe conditions using ⊕ and ⊖ and the original order.The pair of functions h α a,θ , γ a,θ i is as follows. Let a : Y → M and 0 ⊏ θ ∈ M .The set [ Y ] a = { y ∈ Y | a ( y ) = 0 } and δ a = min { a ( y ) | y ∈ [ Y ] a } The target of the approximation is [ a, a ⊗ θ ] in the reverse order, hence[ a ⊗ θ, a ] in the original order. Recall that a ⊗ θ = a ⊕ θ = a ⊖ θ . Hence wehave [ Y ] a [ a ⊖ θ, a ] α a,θ γ a,θ For Y ′ ∈ [ Y ] a we deﬁne α a,θ ( Y ′ ) = a ⊗ θ Y ′ = a ⊖ θ Y ′ For deﬁning γ a,θ we will use subtraction in the dual MV-algebra, which is x ⊖÷ y = x ⊗ y = x ⊕ y = y ⊖ x . Hence θ ⊒ b ( y ) ⊖÷ a ( y ) iﬀ a ( y ) ⊖ b ( y ) ⊒ θ and thusfor b ∈ [ a ⊖ θ, a ]. γ a,θ ( b ) = { y ∈ Y | θ ⊒ b ( y ) ⊖÷ a ( y ) } = { y ∈ Y | a ( y ) ⊖ b ( y ) ⊒ θ } .Let f : M Y → M Z be a monotone function. The norm becomes || a || =min { a ( y ) | y ∈ Y } . Non-expansiveness (Def. 3.2) in the dual MV-algebra be-comes: for all a, b ∈ M Y , || f ( b ) ⊖÷ f ( a ) || ⊒ || b ⊖÷ a || , which in turn ismin { f ( a ) ⊖ f ( b ) | y ∈ Y } ⊒ min { a ( y ) ⊖ b ( y ) | y ∈ Y } || f ( a ) ⊖ f ( b ) || ⊑ || a ⊖ b || , which coincides with non-expansiveness in the orig-inal MV-algebra.Observe that, instead of taking a generic θ ⊏ θ , wecan directly take 0 ⊏ θ and replace everywhere ¯ θ with θ .While the approximation of a function in the primal sense are called f a , theapproximations in the dual sense will be denoted by f a . D Proofs and Additional Material for § We start by introducing some basic functions, which will be used as the buildingblocks for the functions needed in applications. Note that we consider distri-butions on MV-chains that are more general than the probability distributionsstudied in the main body of the paper.

Deﬁnition D.1 (basic functions).

Let M be an MV-chain and let Y , Z beﬁnite sets.1. Constant:

For a ﬁxed k ∈ M Z , we deﬁne c k : M Y → M Z by c k ( a ) = k Reindexing:

For u : Z → Y , we deﬁne u ∗ : M Y → M Z by u ∗ ( a ) = a ◦ u. Min/Max:

For

R ⊆ Y × Z , we deﬁne min R , max R : M Y → M Z by min R ( a )( z ) = min y R z a ( y ) max R ( a )( z ) = max y R z a ( y ) Average:

Call a function p : Y → M a distribution when for all y ∈ Y ,it holds p ( y ) = L y ′ ∈ Y \{ y } p ( y ′ ) and let D ( Y ) be the set of distributions.Assume that M is endowed with an additional operation ⊙ such that ( M , ⊙ , is a commutative monoid, for x, y ∈ M x ⊙ y ⊑ x , and x ⊙ y = 0 iﬀ x = 0 or y = 0 , and ⊙ weakly distributes over ⊕ , i.e., for all x, y, z ∈ M with y ⊑ z , x ⊙ ( y ⊕ z ) = x ⊙ y ⊕ x ⊙ z . For D ⊆ D ( Y ) a ﬁnite set we deﬁne av D : M Y → M D by av D ( a )( p ) = M y ∈ Y p ( y ) ⊙ a ( y )A particularly interesting subcase of (3) is when we take as relation the belongs to relation ∈ ⊆ Y × Y . In this way we obtain functions for selectingthe minimum and the maximum, respectively, of an input function over a set Y ′ ⊆ Y , that is: min ∈ , max ∈ : M Y → M Y min ∈ ( a )( Y ′ ) = min y ∈ Y ′ a ( y ) max ∈ ( a )( Y ′ ) = max y ∈ Y ′ a ( y )33lso note that in the deﬁnition of av D , the operation ⊙ is necessarily mono-tone. In fact, if y ⊑ y ′ then, by Lemma A.1(2), we have y ′ = y ⊕ ( y ′ ⊖ y ).Therefore x ⊙ y ⊑ x ⊙ y ⊕ x ⊙ ( y ′ ⊖ y ) = x ⊙ ( y ⊕ ( y ′ ⊖ y )) = x ⊙ y ′ , where thethird central last passage is by weak distributivity.The basic functions can be shown to be non-expansive. Proposition D.2.

The basic functions from Def. D.1 are all non-expansive.Proof. – Constant functions: immediate. – Reindexing:

Let u : Z → Y . For all a, b ∈ M Y , we have || u ∗ ( b ) ⊖ u ∗ ( a ) || = max z ∈ Z ( b ( u ( z )) ⊖ a ( u ( z ))) ⊑ max y ∈ Y ( b ( y ) ⊖ a ( y )) [since u ( Z ) ⊆ Y ]= || b ⊖ a || [by def. of norm] – Minimum:

Let

R ⊆ Y × Z be a relation. For all a, b ∈ M Y , we have || min R ( b ) ⊖ min R ( a ) || = max z ∈ Z (min y R z b ( y ) ⊖ min y R z a ( y ))Observe thatmax z ∈ Z (min y R z b ( y ) ⊖ min y R z a ( y )) = max z ∈ Z ′ (min y R z b ( y ) ⊖ min y R z a ( y ))where Z ′ = { z ∈ Z | ∃ y ∈ Y. y R z } , since on every other z ∈ Z \ Z ′ thediﬀerence would be 0. Now, for every z ∈ Z ′ , take y z ∈ Y such that y z R z and a ( y z ) = min y R z a ( y ). Such a y z is guaranteed to exist whenever Y is ﬁnite.Then, we havemax z ∈ Z ′ (min y R z b ( y ) ⊖ min y R z a ( y )) ⊑ max z ∈ Z ′ ( b ( y z ) ⊖ a ( y z )) [ ⊖ monotone in ﬁrst arg.] ⊑ max z ∈ Z ′ || b ⊖ a || [by def. of norm]= || b ⊖ a || [ || b ⊖ a || is independent from z ] – Maximum:

Let

R ⊆ Y × Z be a relation. For all a, b ∈ M Y we have || max R ( b ) ⊖ max R ( a ) || = max z ∈ Z (max y R z b ( y ) ⊖ max y R z a ( y )) ⊑ max z ∈ Z (max y R z (( b ( y ) ⊖ a ( y )) ⊕ a ( y )) ⊖ max y R z a ( y ))34since ( b ( y ) ⊖ a ( y )) ⊕ a ( y ) = a ( y ) ⊔ b ( y ) and ⊖ monotone in ﬁrst arg.] ⊑ max z ∈ Z ((max y R z ( b ( y ) ⊖ a ( y )) ⊕ max y R z a ( y )) ⊖ max y R z a ( y ))[by def. of max and monotonicity of ⊕ ] ⊑ max z ∈ Z max y R z ( b ( y ) ⊖ a ( y )) [by Lem. A.1(6)] ⊑ max z ∈ Z max y R z || b ⊖ a || [by def. of norm]= || b ⊖ a || [since || b ⊖ a || is independent] – Average:

We ﬁrst note that, when p : Y → M , with Y ﬁnite, is a distribution,then an inductive argument based on weak distributivity, allows one to showthat for all x ∈ M , Y ′ ⊆ Y , x ⊙ L y ∈ Y ′ p ( y ) = L y ∈ Y ′ x ⊙ p ( y ).For all a, b ∈ M Y we have || av D ( b ) ⊖ av D ( a ) || = max p ∈ D ( M y ∈ Y p ( y ) ⊙ b ( y ) ⊖ M y ∈ Y p ( y ) ⊙ a ( y )) ⊑ max p ∈ D ( M y ∈ Y p ( y ) ⊙ (( b ( y ) ⊖ a ( y )) ⊕ a ( y )) ⊖ M y ∈ Y p ( y ) ⊙ a ( y ))[by monotonicity of ⊙ , ⊕ , ⊖ and ( b ( y ) ⊖ a ( y )) ⊕ a ( y ) = a ( y ) ⊔ b ( y )]= max p ∈ D ( M y ∈ Y ( p ( y ) ⊙ ( b ( y ) ⊖ a ( y ))) ⊕ ( p ( y ) ⊙ a ( y )) ⊖ M y ∈ Y p ( y ) ⊙ a ( y ))[since b ( y ) ⊖ a ( y ) ⊑ ⊖ a ( y ) = a ( y ), and ⊙ weakly distributes over ⊕ ]= max p ∈ D (( M y ∈ Y p ( y ) ⊙ ( b ( y ) ⊖ a ( y )) ⊕ M y ∈ Y p ( y ) ⊙ a ( y )) ⊖ M y ∈ Y p ( y ) ⊙ a ( y )) ⊑ max p ∈ D M y ∈ Y p ( y ) ⊙ ( b ( y ) ⊖ a ( y )) [by Lem. A.1(6)] ⊑ max p ∈ D M y ∈ Y p ( y ) ⊙ || b ⊖ a || [by def. of norm and monotonicity of ⊙ ]= max p ∈ D || b ⊖ a || ⊙ M y ∈ Y p ( y ) [since p is a distributionand ⊙ weakly distributes over ⊕ ]= max p ∈ D ( || b ⊖ a || ⊙

1) [since p is a distribution over Y ]= || b ⊖ a || [since || b ⊖ a || is independent from p ]The next result determines the approximations associated with the basicfunctions. Proposition D.3 (approximations of basic functions).

Let M be an MV-chain, Y, Z be ﬁnite sets and let a ∈ M Y . We deﬁneMin a = { y ∈ Y | a ( y ) minimal } Max a = { y ∈ Y | a ( y ) maximal } Constant: for k : M Z , the approximations ( c k ) a : [ Y ] a → [ Z ] ck ( a ) , ( c k ) a : [ Y ] a → [ Z ] ck ( a ) are ( c k ) a ( Y ′ ) = ∅ = ( c k ) a ( Y ′ ) – Reindexing: for u : Z → Y , the approximations ( u ∗ ) a : [ Y ] a → [ Z ] u ∗ ( a ) , ( u ∗ ) a : [ Y ] a → [ Z ] u ∗ ( a ) are ( u ∗ ) a ( Y ′ ) = ( u ∗ ) a ( Y ′ ) = u − ( Y ′ ) = { z ∈ [ Z ] u ∗ ( a ) | u ( z ) ∈ Y ′ } – Min: for

R ⊆ Y × Z , the approximations (min R ) a : [ Y ] a → [ Z ] min R ( a ) , (min R ) a : [ Y ] a → [ Z ] min R ( a ) are given below, where R − ( z ) = { y ∈ Y | y R z } : (min R ) a ( Y ′ ) = { z ∈ [ Z ] min R ( a ) | Min a | R− z ) ⊆ Y ′ } (min R ) a ( Y ′ ) = { z ∈ [ Z ] min R ( a ) | Min a | R− z ) ∩ Y ′ = ∅} – Max: for

R ⊆ Y × Z , the approximations (max R ) a : [ Y ] a → [ Z ] max R ( a ) , (max R ) a : [ Y ] a → [ Z ] max R ( a ) are (max R ) a ( Y ′ ) = { z ∈ [ Z ] max R ( a ) | Max a | R− z ) ∩ Y ′ = ∅} (max R ) a ( Y ′ ) = { z ∈ [ Z ] max R ( a ) | Max a | R− z ) ⊆ Y ′ } – Average: for a ﬁnite D ⊆ D ( Y ) , the approximations (av D ) a : [ Y ] a → [ Z ] D av D ( a ) , (av D ) a : [ Y ] a → [ Z ] D av D ( a ) are (av D ) a ( Y ′ ) = { p ∈ [ D ] av D ( a ) | supp ( p ) ⊆ Y ′ } (av D ) a ( Y ′ ) = { p ∈ [ D ] av D ( a ) | supp ( p ) ⊆ Y ′ } , where supp ( p ) = { y ∈ Y | p ( y ) > } for p ∈ D ( Y ) .Proof. We only consider the primal cases, the dual ones are analogous.Let a ∈ M Y . – Constant: for all 0 ⊏ θ ⊑ δ a and Y ′ ⊆ [ Y ] a we have( c k ) a,θ ( Y ′ ) = γ c k ( a ) ,θ ◦ c k ◦ α a,θ ( Y ′ )= { z ∈ [ Z ] c k ( a ) | θ ⊑ c k ( a ⊕ θ Y ′ )( z ) ⊖ c k ( a )( z ) } = { z ∈ [ Z ] c k ( a ) | θ ⊑ k ⊖ k } = { z ∈ Z | θ ⊑ } = ∅ Hence all values ι fa ( Y ′ , z ) are equal to 0 and we have ι fa = δ a . Replacing θ by ι fa we obtain ( c k ) a ( Y ′ ) = ∅ . 36 Reindexing: for all 0 ⊏ θ ⊑ δ a and Y ′ ⊆ [ Y ] a we have( u ∗ ) a,θ ( Y ′ ) = γ u ∗ ( a ) ,θ ◦ u ∗ ◦ α a,θ ( Y ′ )= { z ∈ [ Z ] u ∗ ( a ) | θ ⊑ ( a ⊕ θ Y ′ )( u ( z )) ⊖ a ( u ( z )) } . We show that this corresponds to u − ( Y ′ ) = { z ∈ Z | u ( z ) ∈ Y ′ } . It is easyto see that for all z ∈ u − ( Y ′ ), we have( a ⊕ θ Y ′ )( u ( z )) ⊖ a ( u ( z )) = θ = a ( u ( z )) ⊖ ( a ⊖ θ Y ′ )( u ( z ))since u ( z ) ∈ Y ′ and θ ⊑ δ a . Since u ( z ) ∈ Y ′ ⊆ [ Y ] a , we have u ∗ ( a )( z ) = a ( u ( z )) = 1 and hence z ∈ [ Z ] u ∗ ( a ) . On the other hand, for all z u − ( Y ′ ),we have ( a ⊕ θ Y ′ )( u ( z )) = a ( u ( z )) = ( a ⊖ θ Y ′ )( u ( z ))since u ( z ) / ∈ Y ′ , and so( a ⊕ θ Y ′ )( u ( z )) ⊖ a ( u ( z )) = a ( u ( z )) ⊖ ( a ⊖ θ Y ′ )( u ( z )) = 0 ⊏ θ. Therefore ( u ∗ ) a,θ ( Y ′ ) = u − ( Y ′ ).We observe that for Y ′ ⊆ [ Y ] a , z ∈ [ Z ] u ∗ ( a ) either u ∗ ( a ⊕ θ Y ′ )( z ) ⊖ u ∗ ( a )( z ) ⊏ θ for all 0 ⊏ θ ⊑ δ a – and in this case ι u ∗ a ( Y ′ , z ) = 0 – or u ∗ ( a ⊕ θ Y ′ )( z ) ⊖ u ∗ ( a )( z ) = θ for all 0 ⊏ θ ⊑ δ a – and in this case ι u ∗ a ( Y ′ , z ) = δ a . By takingthe minimum over all non-zero values, we get ι u ∗ a = δ a .And ﬁnally we observe that ( u ∗ ) a ( Y ′ ) = ( u ∗ ) a,ι u ∗ a ( Y ′ ) = u − ( Y ′ ). – Minimum: let 0 ⊏ θ ⊑ δ a . For all Y ′ ⊆ [ Y ] a we have(min R ) a,θ ( Y ′ ) = γ min R ( a ) ,θ ◦ min R ◦ α a,θ ( Y ′ )= { z ∈ [ Z ] min R ( a ) | θ ⊑ min y R z ( a ⊕ θ Y ′ )( y ) ⊖ min y R z a ( y ) } We compute the value V = min y R z ( a ⊕ θ Y ′ )( y ) ⊖ min y R z a ( y ) and considerthe following cases: • Assume that there exists ˆ y ∈ Min a | R− z ) where ˆ y Y ′ .Then ( a ⊕ θ Y ′ )(ˆ y ) = a (ˆ y ) ⊑ a ( y ) ⊑ ( a ⊕ θ Y ′ )( y ) for all y ∈ R − ( z ), whichimplies that min y R z ( a ⊕ θ Y ′ )( y ) = a (ˆ y ). We also have min y R z a ( y ) = a (ˆ y )and hence V = 0. • Assume that

Min a | R− z ) ⊆ Y ′ and θ ⊑ a ( y ) ⊖ a (ˆ y ) whenever ˆ y ∈ Min a | R− z ) , y Y ′ and y R z .Since Min a | R− z ) ⊆ Y ′ we observe thatmin y R z ( a ⊕ θ Y ′ )( y ) = min { min y ∈ Min a |R− z ) ( a ( y ) ⊕ θ ) , min y R z,y Y ′ a ( y ) } We can omit the values of all y with y R z , y Min a | R− z ) , y ∈ Y ′ , sincewe will never attain the minimum there.37ow let ˆ y ∈ Min a | R− z ) and y with y R z and y Y ′ . Then θ ⊑ a ( y ) ⊖ a (ˆ y )by assumption, which implies a (ˆ y ) ⊕ θ ⊑ a ( y ), since a (ˆ y ) ⊑ a ( y ) andLem. A.1(2) holds.From this we can deduce min y R z ( a ⊕ θ Y ′ )( y ) = a (ˆ y ) ⊕ θ . We also havemin y R z a ( y ) = a (ˆ y ) and hence – since a (ˆ y ) ⊑ θ (due to θ ⊑ δ a ⊑ a (ˆ y ))and Lem. A.1(9) holds – V = ( a (ˆ y ) ⊕ θ ) ⊖ a (ˆ y ) = θ . • In the remaining case

Min a | R− z ) ⊆ Y ′ and there exist ˆ y ∈ Min a | R− z ) , y Y ′ , y R z such that a ( y ) ⊖ a (ˆ y ) ⊏ θ .This implies a ( y ) ⊑ ( a ( y ) ⊖ a (ˆ y )) ⊕ a (ˆ y ) ⊏ θ ⊕ a (ˆ y ) since again a (ˆ y ) ⊑ θ and Lem. A.1(8) holds. Hence min y R z ( a ⊕ θ Y ′ )( y ) ⊑ a ( y ), which meansthat V ⊑ a ( y ) ⊖ a (ˆ y ) ⊏ θ .Summarizing, for θ ⊑ δ a we observe that V = θ if and only if Min a | R− z ) ⊆ Y ′ and θ ⊑ a ( y ) ⊖ a (ˆ y ) whenever ˆ y ∈ Min a | R− z ) , y Y ′ and y R z .Hence if Min a | R− z ) ⊆ Y ′ we have ι min R a ( Y ′ , z ) = min { a ( y ) ⊖ a (ˆ y ) | ˆ y ∈ Min a | R− z ) , y Y ′ , y R z } ∪ { δ a } otherwise ι min R a ( Y ′ , z ) = 0.The values above are minimal whenever Y ′ = Min a | R− z ) and thus we have: ι min R a = min z ∈ [ Z ] min R ( a ) { a ( y ) ⊖ a (ˆ y ) | y R z, ˆ y ∈ Min a | R− z ) , y Min a | R− z ) }∪{ δ a } . Finally we deduce that(min R ) a ( Y ′ ) = (min R ) a,ι min R a ( Y ′ ) = { z ∈ [ Z ] min R ( a ) | Min a | R− z ) ⊆ Y ′ } . – Maximum: let 0 ⊏ θ ⊑ δ a . For all Y ′ ⊆ [ Y ] a we have(max R ) a,θ ( Y ′ ) = γ max R ( a ) ,θ ◦ max R ◦ α a,θ ( Y ′ )= { z ∈ [ Z ] max R ( a ) | θ ⊑ max y R z ( a ⊕ θ Y ′ )( y ) ⊖ max y R z a ( y ) } We observe thatmax y R z ( a ⊕ θ Y ′ )( y ) = max { max y ∈ Max a |R− z ) ( a ⊕ θ Y ′ )( y ) , max y R z,y ∈ Y ′ y Max a |R− z ) ( a ( y ) ⊕ θ ) } We can omit the values of all y with y R z , y Max a | R− z ) , y Y ′ , since wewill never attain the maximum there.We now compute the value V = max y R z ( a ⊕ θ Y ′ )( y ) ⊖ max y R z a ( y ) andconsider the following cases: • Assume that there exists ˆ y ∈ Max a | R− z ) where ˆ y ∈ Y ′ .Then ( a ⊕ θ Y ′ )(ˆ y ) = a (ˆ y ) ⊕ θ ⊒ ( a ⊕ θ Y ′ )( y ) ⊒ a ( y ) for all y ∈R − ( z ), which implies that max y R z ( a ⊕ θ Y ′ )( y ) = a (ˆ y ) ⊕ θ . We alsohave max y R z a ( y ) = a (ˆ y ) and hence – since a (ˆ y ) ⊑ θ and Lem. A.1(9)holds – V = ( a (ˆ y ) ⊕ θ ) ⊖ a (ˆ y ) = θ .38 Assume that

Max a | R− z ) ∩ Y ′ = ∅ . Now let ˆ y ∈ Max a | R− z ) and y Max a | R− z ) with y R z and y ∈ Y ′ . Thenmax y ∈ Max a |R− z ) ( a ⊕ θ Y ′ )( y ) = a (ˆ y )max y R z,y ∈ Y ′ y Max a |R− z ) ( a ( y ) ⊕ θ ) = a ( y ′ ) ⊕ θ for some value y ′ with y ′ R z , y ′ ∈ Y ′ , y ′ Max a | R− z ) , that is a ( y ′ ) ⊏ a (ˆ y ).So then either max y R z ( a ⊕ θ Y ′ )( y ) = a (ˆ y ) and V = a (ˆ y ) ⊖ a (ˆ y ) = 0. Ormax y R z ( a ⊕ θ Y ′ )( y ) = a ( y ′ ) ⊕ θ and by Lem. A.1(11) V = ( a ( y ′ ) ⊕ θ ) ⊖ a (ˆ y ) ⊏ θ .Summarizing, for θ ⊑ δ a we observe that V = θ if and only if Max a | R− z ) ∩ Y ′ = ∅ , where the latter condition is independent of θ .Hence, as in the case of reindexing, we have ι max R a = δ a . Finally we have(max R ) a ( Y ′ ) = (max R ) a,ι max R a ( Y ′ ) = { z ∈ [ Z ] max R ( a ) | Max a | R− z ) ∩ Y ′ = ∅} . – Average: for all 0 ⊏ θ ⊑ δ a and Y ′ ⊆ [ Y ] a by deﬁnition(av D ) a,θ ( Y ′ ) = γ av D ( a ) ,θ ◦ av D ◦ α a,θ ( Y ′ )= { p ∈ [ D ] av D ( a ) | θ ⊑ M y ∈ Y p ( y ) ⊙ ( a ⊕ θ Y ′ )( y ) ⊖ M y ∈ Y p ( y ) ⊙ a ( y ) } We show that this set corresponds to { p ∈ [ D ] av D ( a ) | supp ( p ) ⊆ Y ′ } .Consider p ∈ [ D ] av D ( a ) such that supp ( p ) ⊆ Y ′ . Note that clearly L y ∈ Y ′ p ( y ) =1. Now we have M y ∈ Y p ( y ) ⊙ ( a ⊕ θ Y ′ )( y ) ⊖ M y ∈ Y p ( y ) ⊙ a ( y )= M y ∈ Y ′ p ( y ) ⊙ ( a ( y ) ⊕ θ ) ⊕ M y ∈ Y \ Y ′ p ( y ) ⊙ a ( y ) ⊖ M y ∈ Y p ( y ) ⊙ a ( y )= M y ∈ Y ′ ( p ( y ) ⊙ a ( y ) ⊕ p ( y ) ⊙ θ ) ⊕ M y ∈ Y \ Y ′ p ( y ) ⊙ a ( y ) ⊖ M y ∈ Y p ( y ) ⊙ a ( y )[by weak distributivity, since for y ∈ Y ′ ⊆ [ Y ] a , a ( y ) ⊑ δ a ]= M y ∈ Y ′ p ( y ) ⊙ θ ⊕ M y ∈ Y ′ p ( y ) ⊙ a ( y ) ⊕ M y ∈ Y \ Y ′ p ( y ) ⊙ a ( y ) ⊖ M y ∈ Y p ( y ) ⊙ a ( y )= M y ∈ Y ′ p ( y ) ⊙ θ ⊕ M y ∈ Y ′ p ( y ) ⊙ a ( y ) ⊖ M y ∈ Y ′ p ( y ) ⊙ a ( y )[since, for y Y ′ ⊇ supp ( p ), p ( y ) = 0 and thus p ( y ) ⊙ a ( y ) = 0]= ( M y ∈ Y ′ p ( y )) ⊙ θ ⊕ M y ∈ Y ′ p ( y ) ⊙ a ( y ) ⊖ M y ∈ Y ′ p ( y ) ⊙ a ( y )39by weak distributivity, since p is a distribution]= 1 ⊙ θ ⊕ M y ∈ Y ′ p ( y ) ⊙ a ( y ) ⊖ M y ∈ Y ′ p ( y ) ⊙ a ( y )[since p is a distribution]= θ ⊕ M y ∈ Y ′ p ( y ) ⊙ a ( y ) ⊖ M y ∈ Y ′ p ( y ) ⊙ a ( y )= θ In order to motivate the last passage, observe that for all y ∈ Y ′ ⊆ [ Y ] a ,we have a ( y ) ⊑ δ a , and thus L y ∈ Y ′ p ( y ) ⊙ a ( y ) ⊑ L y ∈ Y ′ p ( y ) ⊙ δ a =( L y ∈ Y ′ p ( y )) ⊙ δ a = 1 ⊙ δ a = δ a , where the third last passage is motivated byweak distributivity. Since θ ⊑ δ a , by Lem. A.1(3), we have δ a ⊑ θ and thus L y ∈ Y ′ p ( y ) ⊙ a ( y ) ⊑ θ . In turn, using this fact, Lem. A.1(9) motivates the lastequality in the chain above, i.e., θ ⊕ L y ∈ Y ′ p ( y ) ⊙ a ( y ) ⊖ L y ∈ Y ′ p ( y ) ⊙ a ( y ) = θ .On the other hand, for all p ∈ [ D ] av D ( a ) such that supp ( p ) Y ′ , there exists y ′ ∈ Y \ Y ′ such that p ( y ′ ) = 0. Then, we have M y ∈ Y p ( y ) ⊙ ( a ⊕ θ Y ′ )( y ) ⊖ M y ∈ Y p ( y ) ⊙ a ( y )= M y ∈ Y ′ p ( y ) ⊙ ( a ( y ) ⊕ θ ) ⊕ M y ∈ Y \ Y ′ p ( y ) ⊙ a ( y ) ⊖ M y ∈ Y p ( y ) ⊙ a ( y )= M y ∈ Y ′ p ( y ) ⊙ θ ⊕ M y ∈ Y ′ p ( y ) ⊙ a ( y ) ⊕ M y ∈ Y \ Y ′ p ( y ) ⊙ a ( y ) ⊖ M y ∈ Y p ( y ) ⊙ a ( y )[by weak distributivity, since for y ∈ Y ′ ⊆ [ Y ] a , a ( y ) ⊑ δ a ]= M y ∈ Y ′ p ( y ) ⊙ θ ⊕ M y ∈ Y p ( y ) ⊙ a ( y ) ⊖ M y ∈ Y p ( y ) ⊙ a ( y ) ⊑ M y ∈ Y ′ p ( y ) ⊙ θ [by Lem. A.1(6)]= θ ⊙ M y ∈ Y ′ p ( y )[by weak distributivity, since p is a distribution] ⊏ θ In order to motivate the last inequality, we proceed as follows. We have that supp ( p ) Y ′ . Let y ∈ supp ( p ) \ Y ′ . We know that p ( y ) ⊑ L y ∈ Y \{ y } p ( y ) ⊑ L y ∈ Y ′ p ( y ). Therefore L y ∈ Y ′ p ( y ) ⊑ p ( y ) = 0. Hence L y ∈ Y ′ p ( y ) ⊏ x ∈ M , x = 1 then θ ⊙ x ⊏ θ . Note that x = 0. Therefore θ = θ ⊙ θ ⊙ ( x ⊕ x ) = θ ⊙ x ⊕ θ ⊙ x , where the last equality follows by weak40istributivity. Now θ ⊙ x ⊑ x ⊑ θ ⊙ x , and thus, by Lem. A.1(9), we obtain θ ⊙ x = θ ⊙ x ⊕ θ ⊙ x ⊖ θ ⊙ x = θ ⊖ θ ⊙ x ⊏ θ , as desired. The last passagefollows by the fact that θ, x = 0 and thus θ ⊙ x = 0.Since these results hold for all θ ⊑ δ a , we have ι av D a = δ a .And ﬁnally (av D ) a,θ ( Y ′ ) = (av D ) a,θ ( Y ′ ) = { p ∈ [ D ] av D ( a ) | supp ( p ) ⊆ Y ′ } .When a non-expansive function arises as the composition of simpler ones (seeLem. B.4) we can obtain the corresponding approximation by just composingthe approximations of the simpler functions. Proposition D.4 (composing approximations).

Let g : M Y → M W and h : M W → M Z be non-expansive functions. For all a ∈ M Y we have that ( h ◦ g ) a = h g ( a ) ◦ g a . Analogously ( h ◦ g ) a = h g ( a ) ◦ g a for the dual case.Proof. Here we only consider the primal case, the dual case for ( h ◦ g ) a is anal-ogous.Let 0 ⊏ θ ⊑ min { ι ga , ι hg ( a ) } . Then, by Thm. 3.7(b) we know that g a = g a,θ = γ g ( a ) ,θ ◦ g ◦ α a,θ h g ( a ) = h g ( a ) ,θ = γ h ( g ( a )) ,θ ◦ h ◦ α g ( a ) ,θ Now we will prove that ( h ◦ g ) a,θ = h g ( a ) ,θ ◦ g a,θ First observe that g ( α a,θ ( Y ′ )) ∈ [ g ( a ) , g ( a ⊕ θ )] ⊆ [ g ( a ) , g ( a ) ⊕ θ ] for all Y ′ ⊆ [ Y ] a by Lem. B.3. Applying Thm. 3.7(b) on h we obtain( h ◦ g ) a,θ = γ h ( g ( a )) ,θ ◦ h ◦ g ◦ α a,θ ( Y ′ ) = h g ( a ) ,θ ◦ γ g ( a ) ,θ ◦ g ◦ α a,θ ( Y ′ )= h g ( a ) ,θ ◦ g a,θ ( Y ′ ) = h g ( a ) ◦ g a ( Y ′ )Hence all functions ( h ◦ g ) a,θ are equal and independent of θ and so it must holdthat ( h ◦ g ) a,θ = ( h ◦ g ) a . Then from Thm. 3.7 we can conclude min { ι ga , ι hg ( a ) } ⊑ ι h ◦ ga .Furthermore functions can be combined via disjoint union, preserving non-expansiveness, as follows. Proposition D.5 (disjoint union of non-expansive functions).

Let f i : M Y i → M Z i , for i ∈ I , be non-expansive and such that the sets Z i are pairwisedisjoint. The function U i ∈ I f i : M S i ∈ I Y i → M U i ∈ I Z i deﬁned by ] i ∈ I f i ( a )( z ) = f i ( a | Y i )( z ) if z ∈ Z i is non-expansive. roof. For all a, b ∈ M S i ∈ I Y i we have || ] i ∈ I f i ( b ) ⊖ ] i ∈ I f i ( a ) || = max z ∈ U i ∈ I Z i ( ] i ∈ I f i ( b )( z ) ⊖ ] i ∈ I f i ( a )( z ))= max i ∈ I max z ∈ Z i ( f i ( b | Y i )( z ) ⊖ f i ( a | Y i )( z )) [since all Z i are disjoint]= max i ∈ I || f i ( b | Y i ) ⊖ f i ( a | Y i ) || [by def. of norm] ⊑ max i ∈ I || b | Y i ⊖ a | Y i || [since all f i are non-expansive]= max i ∈ I max y ∈ Y i ( b ( y ) ⊖ a ( y ))= max y ∈ S i ∈ I Y i ( b ( y ) ⊖ a ( y ))= || b ⊖ a || [by def. of norm]Also, the corresponding approximation of a disjoint union can be convenientlyassembled from the approximations of its components. Proposition D.6 (disjoint union and approximations).

The approxima-tions for U i ∈ I f i , where f i : M Y i → M Z i are non-expansive and Z i are pairwisedisjoint, have the following form. For all a : S i ∈ I Y i → M and Y ′ ⊆ S i ∈ I Y i : (cid:0) ] i ∈ I f i (cid:1) a ( Y ′ ) = ] i ∈ I ( f i ) a | Yi ( Y ′ ∩ Y i ) (cid:0) ] i ∈ I f i (cid:1) a ( Y ′ ) = ] i ∈ I ( f i ) a | Yi ( Y ′ ∩ Y i ) Proof.

We just show the statement for the primal case, the dual case is analogous.We abbreviate Y = S i ∈ I Y i .Let 0 ⊏ θ ⊑ δ a . According to Def. B.10 we have for Y ′ ⊆ [ Y ] a : (cid:0) ] i ∈ I f i (cid:1) a,θ ( Y ′ ) = γ U i ∈ I f i ( a ) ,θ ◦ ] i ∈ I f i ◦ α a,θ ( f i ) a | Yi ,θ = γ f i ( a | Yi ) ,θ ◦ f i ◦ α a | Yi ,θ for all i ∈ I . Our ﬁrst step is prove that γ U i ∈ I f i ( a ) ,θ ◦ ] i ∈ I f i ◦ α a,θ ( Y ′ ) = ] i ∈ I γ f i ( a | Yi ) ,θ ◦ f i ◦ α a | Yi ,θ ( Y ′ ∩ Y i )By simply expanding the functions we obtain γ U i ∈ I f i ( a ) ,θ ◦ ] i ∈ I f i ◦ α a,θ ( Y ′ ) = { z ∈ Z i | i ∈ I ∧ θ ⊑ f i (( a ⊕ θ Y ′ ) | Y i )( z ) ⊖ f i ( a | Y i )( z ) } i ∈ I γ f i ( a | Yi ) ,θ ◦ f i ◦ α a | Yi ,θ ( Y ′ ∩ Y i ) = ] i ∈ I { z ∈ Z i | θ ⊑ f i ( a | Y i ⊕ θ Y ′ ∩ Y i )( z ) ⊖ f i ( a | Y i )( z ) } which are the same set, since for all i ∈ I clearly ( a ⊕ θ Y ′ ) | Y i = a | Y i ⊕ θ Y ′ ∩ Y i .This implies (cid:0) ] i ∈ I f i (cid:1) a,θ ( Y ′ ) = ] i ∈ I ( f i ) a | Yi ,θ ( Y ′ ∩ Y i ) . Whenever θ ⊑ min i ∈ I ι f i a , this can be rewritten to (cid:0) ] i ∈ I f i (cid:1) a,θ ( Y ′ ) = ] i ∈ I ( f i ) a | Yi ( Y ′ ∩ Y i ) . All functions (cid:0) U i ∈ I f i (cid:1) a,θ are equal and independent of θ and so it must hold that (cid:0) U i ∈ I f i (cid:1) a,θ = (cid:0) U i ∈ I f i (cid:1) a . Then with Thm. 3.7 we can also conclude min i ∈ I ι f i a ⊑ ι U i ∈ I f i a . Theorem 5.1.

Follows directly from Propositions D.2, D.3, D.4, D.5, D.6 and Lem. B.4.We can also specify the maximal decrease respectively increase that is prop-agated (here we are using the notation of Def. B.10).

Corollary D.7.

Let f : M Y → M Z , a ∈ M Y and ι fa be deﬁned as in Lem. B.10.In the dual view we have ι af = min { ι af ( Y ′ , z ) | Y ′ ⊆ Y ∧ z ∈ Z ∧ ι af ( Y ′ , z ) =0 } ∪ { δ a } , where the set { θ ⊑ δ a | z ∈ f a,θ ( Y ′ ) } has a maximum for each z ∈ [ Z ] f ( a ) and Y ′ ⊆ [ Y ] a , that we denote by ι af ( Y ′ , z ) .We consider the basic functions from Def. D.1, function composition as inLem. B.4 and disjoint union as in Prop. D.5 and give the corresponding valuesfor ι fa and ι af .For greatest ﬁxpoints (primal case) we obtain: – ι c k a = ι u ∗ a = ι max R a = ι av D a = δ a – ι min R a = min z ∈ [ Z ] min R ( a ) { a ( y ) ⊖ a (ˆ y ) | y R z, y / ∈ Min a | R− z ) , ˆ y ∈ Min a | R− z ) } ∪ { δ a } – ι g ◦ fa ⊒ min { ι fa , ι gf ( a ) } – ι U i ∈ I f i a = min i ∈ I ι f i a | Yi For least ﬁxpoints (dual case) we obtain: – ι ac k = ι au ∗ = ι a min R = ι a av D = δ a ι a max R = min z ∈ [ Z ] min R ( a ) { a (ˆ y ) ⊖ a ( y ) | y R z, ˆ y ∈ Max a | R− z ) , y / ∈ Max a | R− z ) } ∪ { δ a } – ι ag ◦ f ⊒ min { ι af , ι f ( a ) g } – ι a U i ∈ I f i = min i ∈ I ι a | Yi f i Proof.

The values ι fa can be obtained by inspecting the proofs of Propositions D.3,D.4 and D.5.It only remains to show that ι := ι U i ∈ I f i a ⊑ min i ∈ I ι f i a | Yi (cf. Prop. D.5),which means showing ι ⊑ ι f i a | Yi for every i ∈ I . We abbreviate ι i := ι f i a | Yi .If ι ⊐ ι i for some i ∈ I , we will ﬁnd a z ∈ [ Z i ] f i ( a ) and Y ′ ⊆ [ Y ] a , such that z ∈ ( f i ) a | Yi ,ι i ( Y ′ ∩ Y i ) = ( f i ) a | Yi ( Y ′ ∩ Y i ) but z / ∈ ( f i ) a | Yi ,ι ( Y ′ ∩ Y i ) by deﬁnition(cf. Lem. B.8). This is a contradiction since z ∈ ] i ∈ I ( f i ) a | Yi ( Y ′ ∩ Y i ) = (cid:0) ] i ∈ I f i (cid:1) a ( Y ′ ) = (cid:0) ] i ∈ I f i (cid:1) a,ι ( Y ′ ) = ] i ∈ I ( f i ) a | Yi ,ι ( Y ′ ∩ Y i )and since z ∈ Z i , z ( f i ) a | Yi ,ι ( Y ′ ∩ Y i ) and cannot be contained in the union.The arguments for the values ι af in the dual case are analogous. E Proofs and Additional Material for § Lemma 6.1.

The function T can be written as T = ( η ∗ ◦ av D ) ⊎ c k where k : T → [0 , is the constant function deﬁned only on terminal states.Proof. Let t : S → [0 , s ∈ T we have(( η ∗ ◦ av D ) ⊎ c k )( t )( s )= c k ( t )( s ) [since s ∈ T ]= k ( s ) = 1 [by deﬁnition of c k and k ]= T ( t )( s ) [since s ∈ T ]For s / ∈ T we have(( η ∗ ◦ av D ) ⊎ c k )( t )( s )= η ∗ ◦ av D ( t )( s ) [since s / ∈ T ]= av D ( t )( η ( s )) [by deﬁnition of reindexing]= X s ′ ∈ S η ( s )( s ′ ) · t ( s ′ ) [by deﬁnition of av D ]= T ( t )( s ) [since s / ∈ T ]44 emma 6.2. Let t : S → [0 , . The approximation for T in the dual sense is T t : [ S ] t → [ S ] T ( t ) with T t ( S ′ ) = { s ∈ [ S ] T ( t ) | s / ∈ T ∧ supp ( η ( s )) ⊆ S ′ } . Proof.

In the following let t : S → [0 ,

1] and S ′ ⊆ [ S ] t . By Lem. 6.1 we knowthat T = ( η ∗ ◦ av D ) ⊎ c k , then by Propositions D.6, D.4, and D.3 we have T t ( S ′ ) = (( η ∗ ◦ av D ) ⊎ c k ) t ( S ′ )= ( η ∗ ◦ av D ) t ( S ′ ) ∪ ( c k ) t ( S ′ )= ( η ∗ ) av D ( t ) ◦ (av D ) t ( S ′ ) ∪ ( c k ) t ( S ′ )= { s ∈ [ S \ T ] η ∗ (av D ( t )) | η ( s ) ∈ { q ∈ [ D ] av D ( t ) | supp ( q ) ⊆ S ′ }} ∪ ∅ = { s ∈ [ S \ T ] η ∗ (av D ( t )) | η ( s ) ∈ [ D ] av D ( t ) ∧ supp ( η ( s )) ⊆ S ′ } Observe that actually for all s ∈ [ S \ T ] η ∗ (av D ( t )) it always holds that η ( s ) ∈ [ D ] av D ( t ) . In fact, since s ∈ [ S \ T ] η ∗ (av D ( t )) we must have that η ∗ (av D ( t ))( s ) =av D ( t )( η ( s )) = 0, and thus η ( s ) ∈ { q ∈ D | av D ( t )( q ) = 0 } = [ D ] av D ( t ) . There-fore, we have that { s ∈ [ S \ T ] η ∗ (av D ( t )) | η ( s ) ∈ [ D ] av D ( t ) ∧ supp ( η ( s )) ⊆ S ′ } = { s ∈ [ S \ T ] η ∗ (av D ( t )) | supp ( η ( s )) ⊆ S ′ } Finally, the set above is the same as { s ∈ [ S ] T ( t ) | s / ∈ T ∧ supp ( η ( s )) ⊆ S ′ } = { s ∈ [ S \ T ] T ( t ) | supp ( η ( s )) ⊆ S ′ } because, for all s ∈ S \ T , hence s / ∈ T , we have that T ( t )( s ) = P s ′ ∈ S η ( s )( s ′ ) · t ( s ′ ) = η ∗ (av D ( t ))( s ), and so [ S \ T ] T ( t ) = [ S \ T ] η ∗ (av D ( t )) . Lemma 6.3. H = min u ◦ max ∈ where max ∈ : M X × X → M X × X ( ∈ ⊆ ( X × X ) × X × X is the “is-element-of ”-relation on X × X ), min u : M X × X → M X × X .Proof. Let for d : X × X → M , X , X ⊆ X . Then we havemin u (max ∈ ( d ))( X , X )= min u ( C )=( X ,X ) (max ∈ ( d ))( C ) = min u ( C )=( X ,X ) max ( x ,x ) ∈ C a ( x , x )which is exactly the deﬁnition of the Hausdorﬀ lifting H ( d )( X , X ) via cou-plings, due to M´emoli [19].We will now consider the approximation of the Hausdorﬀ lifting in the dualsense. Intuitively, given a distance function d and a relation R on X it char-acterises those pairs ( X , X ) ( X , X ⊆ X ) whose distance in the Hausdorﬀmetric decreases by a constant when we decrease the distance d for all pairs in R by the same constant. 45 emma E.1. The approximation for the Hausdorﬀ lifting H in the dual senseis as follows. Let d : X × X → M , then H d : [ X × X ] d → [ X × X ] H ( d ) with H d ( R ) = { ( X , X ) ∈ [ X × X ] H ( d ) |∀ x ∈ X (cid:0) min x ′ ∈ X d ( x , x ′ ) = H ( d )( X , X ) ⇒ ∃ x ∈ X :( x , x ) ∈ R ∧ d ( x , x ) = H ( d )( X , X ) (cid:1) ∧∀ x ∈ X (cid:0) min x ′ ∈ X d ( x ′ , x ) = H ( d )( X , X ) ⇒ ∃ x ∈ X :( x , x ) ∈ R ∧ d ( x , x ) = H ( d )( X , X ) (cid:1) } Proof.

Let d : X × X → M and R ⊆ [ X × X ] d . Then we have: H d ( R ) = (min u ) max ∈ ( d ) ((max ∈ ) d ( R ))where (max ∈ ) d : [ X × X ] d → [ X × X ] max ∈ ( d ) (min u ) max ∈ ( d ) : [ X × X ] max ∈ ( d ) → [ X × X ] H ( d ) We are using the approximations associated to non-expansive functions, givenin Prop. D.3, and obtain: H d ( R )= { ( X , X ) ∈ [ X × X ] H ( d ) | Min max ∈ ( d ) | u − X ,X ∩ (max ∈ ) d ( R ) = ∅} = { ( X , X ) ∈ [ X × X ] H ( d ) | ∃ C ⊆ X × X, u ( C ) = ( X , X ) ,C ∈ (max ∈ ) d ( R ) , max ∈ ( d )( C ) = min u ( C ′ )=( X ,X ) max ∈ ( d )( C ′ ) } = { ( X , X ) ∈ [ X × X ] H ( d ) | ∃ C ⊆ X × X, u ( C ) = ( X , X ) ,C ∈ (max ∈ ) d ( R ) , max d [ C ] = min u ( C ′ )=( X ,X ) max d [ C ′ ] } = { ( X , X ) ∈ [ X × X ] H ( d ) | ∃ C ⊆ X × X, u ( C ) = ( X , X ) , Max d | C ⊆ R, max d [ C ] = H ( d )( X , X ) } We show that this is equivalent to the characterisation in the statement of thelemma. – Assume that for all x ∈ X such that min x ′ ∈ X d ( x , x ′ ) = H ( d )( X , X ),there exists x ∈ X such that ( x , x ) ∈ R and d ( x , x ) = H ( d )( X , X )(and vice versa).We deﬁne a set C m that contains all such pairs ( x , x ), obtained from thisguarantee. Now let x π [ C m ]. Then necessarily min x ′ ∈ X d ( x , x ′ ) < H ( d )( X , X ) (because the minimal distance to an element of X cannotexceed the Hausdorﬀ distance of the two sets). Construct another set C ′ that contains all such ( x , x ) where x is an argument where the minimum46s obtained. Also add elements x π [ C m ] and their corresponding partnersto C ′ .The C = C m ∪ C ′ is a coupling for X , X , i.e., u ( C ) = ( X , X ). Furthermore Max d | C = C m ⊆ R and max d [ C ] = max d [ C m ] = H ( d )( X , X ). – Assume that there exists C ⊆ X × X , u ( C ) = ( X , X ), Max d | C ⊆ R ,max d [ C ] = H ( d )( X , X ).Now let x ∈ X such that min x ′ ∈ X d ( x , x ′ ) = H ( d )( X , X ). Since C isa coupling of X , X , there exists x ∈ X such that ( x , x ) ∈ C ⊆ R . It isleft to show that d ( x , x ) = H ( d )( X , X ), which can be done as follows: H ( d )( X , X ) = min x ′ ∈ X d ( x , x ′ ) ≤ d ( x , x ) ≤ max d [ C ] = H ( d )( X , X ) . For an x ∈ X such that min x ′ ∈ X d ( x ′ , x ) = H ( d )( X , X ) the proof isanalogous. Lemma 6.4.

Let u : VP D → D × D , u ( c ) = ( m Lc , m Rc ) . Then K = min u ◦ av VP D where av VP D : [0 , X × X → [0 , VP D , min u : [0 , VP D → [0 , D × D .Proof. It holds that u − ( p, q ) = Ω ( p, q ) ∩ VP D for p, q ∈ D . Furthermore noteit is suﬃcient to consider as couplings the vertices, i.e., the elements of VP D ,since the minimum is always attained there [23].Hence we obtain for d : X × X → [0 , p, q ∈ D :min u (av VP D ( d ))( p, q ) = min c ∈ Ω ( p,q ) ∩ VP D av VP D ( d )( c )= min c ∈ Ω ( p,q ) ∩ VP D X x ,x ∈ X × X c ( x , x ) · d ( x , x )= min c ∈ Ω ( p,q ) X x ,x ∈ X × X c ( x , x ) · d ( x , x )= K ( d )( p, q )We now present the approximation of the Kantorovich lifting in the dualsense. Intuitively, given a distance function d and a relation M on X it char-acterises those pairs ( p, q ) of distributions whose distance in the Kantorovichmetric decreases by a constant when we decrease the distance d for all pairs in M by the same constant. Lemma E.2.

Let d : X × X → [0 , . The approximation for the Kantorovichlifting K in the dual sense is K d : [ X × X ] d → [ D × D ] K ( d ) with K d ( M ) = { ( p, q ) ∈ [ D × D ] K ( d ) | ∃ c ∈ Ω ( p, q ) , supp ( c ) ⊆ M, X u,v ∈ S d ( u, v ) · c ( u, v ) = K ( d )( p, q ) } . roof. Let d : X × X → [0 ,

1] and M ⊆ [ X × X ] d . Then we have: K d ( M ) = (min u ) av VP D ( d ) ((av VP D ) d ( M ))where (av VP D ) d : [ X × X ] d → [ VP D ] av VP D ( d ) (min u ) av X × X VP D ( d ) : [ VP D ] av VP D ( d ) → [ D × D ] K ( d ) We are using the approximations associated to non-expansive functions, givenin Prop. D.3, and obtain: K d ( M ) = { ( p, q ) ∈ [ D × D ] K ( d ) | Min av X × X VP D ( d ) | u − p,q ) ∩ (av X × X VP D ) d ( M ) = ∅} = { ( p, q ) ∈ [ D × D ] K ( d ) | ∃ c ∈ Ω ( p, q ) , c ∈ (av VP D ) d ( M ) , av VP D ( d )( c ) = min c ′ ∈ Ω ( p,q ) av X × X VP D ( d )( c ′ ) } = { ( p, q ) ∈ [ D × D ] K ( d ) | ∃ c ∈ Ω ( p, q ) , c ∈ (av VP D ) d ( M ) , av VP D ( d )( c ) = K ( d )( p, q ) } = { ( p, q ) ∈ [ D × D ] K ( d ) | ∃ c ∈ Ω ( p, q ) , supp ( c ) ⊆ M, X u,v ∈ S d ( u, v ) · c ( u, v ) = K ( d )( p, q ) } Lemma 6.6.

The ﬁxpoint function M characterizing probabilistic bisimilaritypseudometrics can be written as: M = max ρ ◦ ((( η × η ) ∗ ◦ H ◦ K ) ⊎ c l ) where ρ : ( S × S ) ⊎ ( S × S ) → ( S × S ) with ρ (( s, t ) , i ) = ( s, t ) . Furthermore l : S × S → [0 , is deﬁned as l ( s, t ) = 0 if ℓ ( s ) = ℓ ( t ) and l ( s, t ) = 1 if ℓ ( s ) = ℓ ( t ) .Proof. In fact, given d : S × S → [0 , ρ (((( η × η ) ∗ ◦ H ◦ K ) ⊎ c l )( d ))( s, t )= max { ( η × η ) ∗ ◦ H ◦ K )( d )( s, t ) , c l ( s, t ) } = max {H ( K ( d )( η ( s ) , η ( t )) , c l ( s, t ) } = M ( d )( s, t ) Proposition 6.7.

Let d : S × S → [0 , where d = M ( d ) . Then M d : [ S × S ] d → [ S × S ] d , where [ S × S ] d = { ( s, t ) ∈ S × S | d ( s, t ) > } .Then M is a self-closed relation wrt. d if and only if M ⊆ [ S × S ] d and M is a post-ﬁxpoint of M d . Here we use i ∈ { , } as indices to distinguish the elements in the disjoint union. roof. First note that whenever M is self-closed, it holds that d ( s, t ) > s, t ) ∈ M and hence M ⊆ [ S × S ] d .We abbreviate g = ( η × η ) ∗ ◦ H ◦ K : [0 , S × S → [0 , S × S and hence M =max ρ ◦ ( g ⊎ c l ).The approximation g d (yet to be determined) is of type g d : [ S × S ] d → [ S × S ] g ( d ) . In the following, we are using the approximations associated to non-expansive functions, given in Prop. D.3.Since c l : [0 , S × S → [0 , S × S is a constant function, we have( c l ) d : [ S × S ] d → [ S × S ] l , ( c l ) d ( M ) = ∅ . Hence ( g ⊎ c l ) d : [ S × S ] d → [ S × S ] g ( d ) ⊎ [ S × S ] l ( g ⊎ c l ) d ( M ) = g d ( M ) × { } ∪ ∅ × { } = g d ( M ) × { } for M ⊆ [ S × S ] d .Furthermore we obtain M d ( M ) = (max ρ ) g ( d ) ⊎ l (( g ⊎ c l ) d ( M ))= (max ρ ) g ( d ) ⊎ l ( g d ( M ) × { } )= { ( s, t ) ∈ [ S × S ] M ( d ) | Max ( g ( d ) ⊎ c l ) | ρ − { ( s,t ) } ) ⊆ g d ( M ) × { }} In order to proceed, we examine ρ − ( { ( s, t ) } = { (( s, t ) , , (( s, t ) , } . Whenever ℓ ( s ) = ℓ ( t ), we have c l ( s, t ) = 1 ≥ g ( d )( s, t ), hence Max ( g ( d ) ⊎ l ) | ρ − { ( s,t ) } ) containsat least (( s, t ) , g d ( M ) × { } , which means that thecondition is not satisﬁed.Whenever ℓ ( s ) = ℓ ( t ), we have c l ( s, t ) = 0 < g ( d )( s, t ) (note that ( s, t ) ∈ [ S × S ] M ( d ) , hence g ( d )( s, t ) > Max ( g ( d ) ⊎ l ) | ρ − { ( s,t ) } ) contains only (( s, t ) , g d ( M ) × { } iﬀ ( s, t ) ∈ g d ( M ).Summarizing, we obtain M d ( M ) = { ( s, t ) ∈ [ S × S ] M ( d ) | ℓ ( s ) = ℓ ( t ) , ( s, t ) ∈ g d ( M ) } = { ( s, t ) ∈ S × S | d ( s, t ) > , ℓ ( s ) = ℓ ( t ) , ( s, t ) ∈ g d ( M ) } For the last step observe that d = M ( d ).It is left to characterise g d , where g = ( η × η ) ∗ ◦ H ◦ K . We have g d = (( η × η ) ∗ ) H ( K ( d )) ◦ H K ( d ) ◦ K d where K d : [ S × S ] d → [ D × D ] K ( d ) H K ( d ) : [ D × D ] K ( d ) → [ D × D ] H ( K ( d )) η × η ) ∗ ) H ( K ( d )) : [ D × D ] H ( K ( d )) → [ S × S ] g ( d ) . It holds that (( η × η ) ∗ ) H ( K ( d )) = ( η × η ) − and hence( s, t ) ∈ g d ( M ) ⇐⇒ ( η ( s ) , η ( t )) ∈ H K ( d ) ( K d ( M )) . Using the characterisation of the associated approximation of the Hausdorﬀ lift-ing in Lem. E.1, we obtain that this is equivalent tofor all p ∈ η ( s ), whenever min q ′ ∈ η ( t ) K ( d )( p, q ′ ) = H ( K ( d ))( η ( s ) , η ( t )), thenthere exists q ∈ η ( t ) such that ( p, q ) ∈ K d ( M ) and K ( d )( p, q ) = H ( K ( d ))( η ( s ) , η ( t )) (and vice versa),assuming that ℓ ( s ) = ℓ ( t ) (this is a requirement in the deﬁnition of M d ( M )),since then we have H ( K ( d ))( η ( s ) , η ( t )) = d ( s, t ) > η ( s ) , η ( t )) ∈ [ D × D ] H ( K ( d )) .Since also d = M ( d ), the condition above can be rewritten tofor all p ∈ η ( s ), whenever min q ′ ∈ η ( t ) K ( d )( p, q ′ ) = d ( s, t ), then there exists q ∈ η ( t ) such that ( p, q ) ∈ K d ( M ) and K ( d )( p, q ) = d ( s, t ) (and vice versa).From Lem. E.2 we know that ( p, q ) ∈ K d ( M ) iﬀ K ( d )( p, q ) > c ∈ Ω ( p, q ) such that supp ( c ) ⊆ M and P u,v ∈ S c ( u, v ) · d ( u, v ) = K ( d )( p, q ). Weinstantiate the condition above accordingly and obtainfor all p ∈ η ( s ), whenever d ( s, t ) = min q ′ ∈ η ( t ) K ( d )( p, q ′ ), then there exists q ∈ η ( t ) such that there exists c ∈ Ω ( p, q ) with supp ( c ) ⊆ M , K ( d )( p, q ) = P u,v ∈ S c ( u, v ) · d ( u, v ) and K ( d )( p, q ) = d ( s, t ) (and vice versa).The two last equalities can be simpliﬁed to d ( s, t ) = P u,v ∈ S c ( u, v ) · d ( u, v ), since K ( d )( p, q ) ≤ X u,v ∈ S c ( u, v ) · d ( u, v ) = d ( s, t ) = min q ′ ∈ η ( t ) K ( d )( p, q ′ ) ≤ K ( d )( p, q )and hence K ( d )( p, q ) = d ( s, t ) can be inferred from the remaining conditions.We ﬁnally obtain the following equivalent characterisation:for all p ∈ η ( s ), whenever d ( s, t ) = min q ′ ∈ η ( t ) K ( d )( p, q ′ ), then there exists q ∈ η ( t ) such that there exists c ∈ Ω ( p, q ) with supp ( c ) ⊆ M , d ( s, t ) = P u,v ∈ S c ( u, v ) · d ( u, v ) (and vice versa).Hence we obtain that ( s, t ) ∈ g d ( M ) is equivalent to the the second and thirditem of Def. 6.5 (under the assumption that ℓ ( s ) = ℓ ( t )), while the ﬁrst item iscovered by the other conditions ( d ( s, t ) > ℓ ( s ) = ℓ ( t )) in the characterisa-tion of M d ( M ). Lemma E.3.

The approximation for the adapted Hausdorﬀ lifting G in the pri-mal sense is as follows. Let d : X × X → { , } , then G d : [ X × X ] d → [ X × X ] d with G d ( R ) = { ( X , X ) ∈ [ X × X ] H ( d ) | x ∈ X ∃ x ∈ X : (cid:0) ( x , x ) [ X × X ] d ∨ ( x , x ) ∈ R (cid:1) ∧ ∀ x ∈ X ∃ x ∈ X : (cid:0) ( x , x ) [ X × X ] d ∨ ( x , x ) ∈ R (cid:1) } Proof.

We rely on the characterisation of H d (dual case) of Lem. E.1 and weexamine the case where M = { , } . In this case, whenever we have ( X , X ) ∈ [ X × X ] H ( d ) it must necessarily hold that H ( d )( X , X ) = 1. Hence, the ﬁrstpart of the conjunction simpliﬁes to: ∀ x ∈ X (cid:0) min x ′ ∈ X d ( x , x ′ ) = 1 ⇒ ∃ x ∈ X : ( x , x ) ∈ R ∧ d ( x , x ) = 1 (cid:1) , from which we can omit d ( x , x ) = 1 from the conclusion, since this holdsautomatically. Furthermore min x ′ ∈ X d ( x , x ′ ) = 1 can be rewritten to ∀ x ∈ X : d ( x , x ) = 1. This gives us: ∀ x ∈ X (cid:0) ¬∀ x ∈ X : d ( x , x ) = 1 ∨ ∃ x ∈ X : ( x , x ) ∈ R (cid:1) ≡ ∀ x ∈ X (cid:0) ∃ x ∈ X : d ( x , x ) = 0 ∨ ∃ x ∈ X : ( x , x ) ∈ R (cid:1) ≡ ∀ x ∈ X ∃ x ∈ X (cid:0) ( x , x ) [ X × X ] d ∨ ( x , x ) ∈ R (cid:1) . Since this characterisation is independent of the order, we can replace [ X × X ] d by [ X × X ] d and obtain a characterizing condition for G d (primal case). Lemma 6.8.

Bisimilarity on η is the greatest ﬁxpoint of B = ( η × η ) ∗ ◦ G .Proof. Let for d : X × X → { , } , x, y ∈ X . Then we have( η × η ) ∗ ◦ G ( d )( x, y ) = G ( d )( η ( x ) , η ( y ))= max u (min ∈ ( d ))( η ( x ) , η ( y ))= max u ( C )=( η ( x ) ,η ( y )) (min X × X ∈ ( d ))( C )= max u ( C )=( η ( x ) ,η ( y )) min ( x ′ ,y ′ ) ∈ C d ( x ′ , y ′ )Now we prove that this, indeed, corresponds with the standard bisimulationfunction, i.e. max u ( C )=( η ( x ) ,η ( y )) min ( x ′ ,y ′ ) ∈ C d ( x ′ , y ′ ) = 1 if and only if for all x ′ ∈ η ( x ) there exists y ′ ∈ η ( y ) such that d ( x ′ , y ′ ) = 1 and vice versa. Forthe ﬁrst implication, assume that max u ( C )=( η ( x ) ,η ( y )) min ( x ′ ,y ′ ) ∈ C d ( x ′ , y ′ ) = 1.This means that there exists C ⊆ X × X such that u ( C ) = ( π ( C ) , π ( C )) =( η ( x ) , η ( y )) and min ( x ′ ,y ′ ) ∈ C d ( x ′ , y ′ ) = 1. Then we have two cases. Either C = ∅ ,which means that η ( x ) = η ( y ) = ∅ , that is, x and y have no successors, and sothe bisimulation property vacuously holds. Otherwise, C = ∅ , and we must have d ( x ′ , y ′ ) = 1 for all ( x ′ , y ′ ) ∈ C . Then, since ( π ( C ) , π ( C )) = ( η ( x ) , η ( y )),for all x ′ ∈ η ( x ) there must exists y ′ ∈ η ( y ) such that ( x ′ , y ′ ) ∈ C , and thus d ( x ′ , y ′ ) = 1. Vice versa, for all y ′ ∈ η ( y ) there must exists x ′ ∈ η ( x ) such that( x ′ , y ′ ) ∈ C , and thus d ( x ′ , y ′ ) = 1. So the bisimulation property holds.For the other implication, assume that for all x ′ ∈ η ( x ) there exists y ′ ∈ η ( y ) such that d ( x ′ , y ′ ) = 1 and call c ( x ′ ) such a y ′ . Vice versa, assume also51hat for all y ′ ∈ η ( y ) there exists x ′ ∈ η ( x ) such that d ( x ′ , y ′ ) = 1 and call c ( y ′ ) such a x ′ . This means that for all x ′ ∈ η ( x ) and y ′ ∈ η ( y ), we have d ( x ′ , c ( x ′ )) = d ( c ( y ′ ) , y ′ ) = 1. Now let C ′ = { ( x ′ , y ′ ) ∈ η ( x ) × η ( y ) | c ( x ′ ) = y ′ ∨ x ′ = c ( y ′ ) } . Since we assumed that for all x ′ ∈ η ( x ) there exists y ′ ∈ η ( y )such that c ( x ′ ) = y ′ , we must have that π ( C ′ ) = η ( x ). The same holds for all y ′ ∈ η ( y ), thus π ( C ′ ) = η ( y ). Therefore, we know that u ( C ′ ) = ( η ( x ) , η ( y )),and we can conclude by showing that d ( x ′ , y ′ ) = 1 for all ( x ′ , y ′ ) ∈ C ′ , in whichcase also max u ( C )=( η ( x ) ,η ( y )) min ( x ′ ,y ′ ) ∈ C d ( x ′ , y ′ ) = 1. By deﬁnition of C ′ either c ( x ′ ) = y ′ or x ′ = c ( y ′ ), or both, must hold. Assume the ﬁrst one holds, theother case is similar. Then, we can immediately conclude since by hypothesis weknow that d ( x ′ , c ( x ′ )) = 1.Since we proved that the function B is the same of the standard bisimulationfunction, then its greatest ﬁxpoint ν B is the bisimilarity on η . Lemma 6.9.

From Lem. E.1 we know that G d : [ X × X ] d → [ X × X ] G ( d ) G d ( R ) = { ( X , X ) ∈ [ X × X ] G ( d ) |∀ x ∈ X ∃ x ∈ X : (cid:0) ( x , x ) [ X × X ] d ∨ ( x , x ) ∈ R (cid:1) ∧ ∀ x ∈ X ∃ x ∈ X : (cid:0) ( x , x ) [ X × X ] d ∨ ( x , x ) ∈ R (cid:1) } . Furthermore (( η × η ) ∗ ) G ( d ) : [ X × X ] G ( d ) → [ X × X ] B ( d ) (( η × η ) ∗ ) G ( d ) ( R ) = ( η × η ) − ( R )Composing these functions we obtain: B d : [ X × X ] d → [ X × X ] B ( d ) B d ( R ) = ( η × η ) − ( { ( Y , Y ) ∈ [ X × X ] G ( d ) |∀ y ∈ Y ∃ y ∈ Y : (cid:0) ( y , y ) [ X × X ] d ∨ ( y , y ) ∈ R (cid:1) ∧ ∀ y ∈ Y ∃ y ∈ Y : (cid:0) ( y , y ) [ X × X ] d ∨ ( y , y ) ∈ R (cid:1) } )= { ( x , x ) ∈ [ X × X ] B ( d ) |∀ y ∈ η ( x ) ∃ y ∈ η ( x ) : (cid:0) ( y , y ) [ X × X ] d ∨ ( y , y ) ∈ R (cid:1) ∧ ∀ y ∈ η ( x ) ∃ y ∈ η ( x ) : (cid:0) ( y , y ) [ X × X ] d ∨ ( y , y ) ∈ R (cid:1) } . Proofs and Additional Material for § Lemma 7.2. V = ( η ∗ min ◦ min ∈ ) ⊎ ( η ∗ max ◦ max ∈ ) ⊎ ( η ∗ av ◦ av D ) ⊎ c w , where ∈ ⊆ V × V is the “is-element-of ”-relation on V .Proof. Let a : V → [0 , v ∈ MAX we have V ( a )( v ) = ( η ∗ max ◦ max ∈ )( a )( v ) = max ∈ ( a )( η max ( v )) = max v ′ ∈ η max ( v ) a ( v ′ ) . For v ∈ MIN we have V ( a )( v ) = ( η ∗ min ◦ min ∈ )( a )( v ) = min ∈ ( a )( η min ( v )) = min v ′ ∈ η min ( v ) a ( v ′ ) . For v ∈ AV we have V ( a )( v ) = ( η ∗ av ◦ av D )( a )( v ) = av D ( a )( η av ( v )) = X v ′ ∈ V η av ( v )( v ′ ) · a ( v ′ ) . For v ∈ SINK we have V ( a )( v ) = c w ( a )( v ) = w ( v ). Lemma 7.3.

Let a : V → [0 ,

1] and V ′ ⊆ [ V ] a . By Proposition D.6 we have: V a ( V ′ ) = (cid:0) MIN ∩ ( η ∗ min ◦ min ∈ ) a ( V ′ ) (cid:1) ∪ (cid:0) MAX ∩ ( η ∗ max ◦ max ∈ ) a ( V ′ ) (cid:1) ∪ (cid:0) AV ∩ ( η ∗ av ◦ av D ) a ( V ′ ) (cid:1) ∪ (cid:0) SINK ∩ ( c w ) a ( V ′ ) (cid:1) It holds that ( η ∗ min ) min ∈ ( v ) = η − , ( η ∗ max ) max ∈ ( v ) = η − and ( η ∗ av ) av D ( v ) = η − .Using previous results (Prop. D.3) we deduce v ∈ ( η ∗ min ◦ min ∈ ) a ( V ′ ) ⇔ η min ( v ) ∈ (min ∈ ) a ( V ′ ) ⇔ Min a | η min( v ) ∩ V ′ = ∅ v ∈ ( η ∗ max ◦ max ∈ ) a ( V ′ ) ⇔ η max ( v ) ∈ (max ∈ ) a ( V ′ ) ⇔ Max a | η max( v ) ⊆ V ′ v ∈ ( η ∗ av ◦ av VD ) a ( V ′ ) ⇔ η av ( v ) ∈ (av VD ) a ( V ′ ) ⇔ supp ( η av ( v )) ⊆ V ′ Lastly ( c w ) a ( V ′ ) = ∅ for any V ′ ⊆ V since c w is a constant function whichconcludes the proof. Lemma F.1.

For any pair of strategies σ, τ we have V σ ≤ V ≤ V τ . roof. Given any a : V → [0 ,

1] and v ∈ V , we have V σ ( a )( v ) =  min v ′ ∈ η min ( v ) a ( v ′ ) v ∈ MIN a ( σ ( v ′ )) v ∈ MAX P v ′ ∈ V a ( v ′ ) · η av ( v )( v ′ ) v ∈ AV c w ( v ) v ∈ SINK ≤  min v ′ ∈ η min ( v ) a ( v ′ ) v ∈ MIN max v ′ ∈ η max ( v ) a ( v ′ ) v ∈ MAX P v ′ ∈ V a ( v ′ ) · η av ( v )( v ′ ) v ∈ AV c w ( v ) v ∈ SINK = V ( a )( v )The same proof idea can be applied to show V ≤ V τ . Lemma 7.4.

The least ﬁxpoints of V τ and V σ can be determined by solvinglinear programs.Proof. We adapt the linear programs found in the literature on simple stochasticgames (see e.g. [10]).The least ﬁxpoint a = µ V τ can be determined by solving the following linearprogram:min X v ∈ V a ( v ) a ( v ) = a ( τ ( v )) v ∈ MIN a ( v ) ≥ a ( v ′ ) ∀ v ′ ∈ η max ( v ) , v ∈ MAX a ( v ) = X v ′ ∈ V a ( v ′ ) · η av ( v )( v ′ ) v ∈ AV a ( v ) = w ( v ) v ∈ SINK

By having a ( v ) ≥ a ( v ′ ) for all v ′ ∈ η max ( v ) and v ∈ MAX we guarantee a ( v ) = max v ′ ∈ η max ( v ) a ( i ) ( v ′ ) since we minimise. The minimisation also guar-antees computation of the least ﬁxpoint (in particular, nodes that lie on a cyclewill get a value of 0). Hence, the linear program correctly characterises µ V τ .Given a strategy σ for Max, we can determine a = µ V σ by solving thefollowing linear program:max X v ∈ V a ( v ) a ( v ) = 0 v ∈ C σ a ( v ) ≤ a ( v ′ ) ∀ v ′ ∈ η min ( v ) , v ∈ MIN , v C σ a ( v ) = a ( σ ( v )) v ∈ MAX , v C σ ( v ) = X v ′ ∈ V a ( v ′ ) · η av ( v )( v ′ ) v ∈ AV , v C σ a ( v ) = w ( v ) v ∈ SINK

The set C σ contains those nodes which will guarantee a non-terminating play ifMin plays optimally, given the ﬁxed Max-strategy σ .The set C σ can again be computed via ﬁxpoint-iteration by computing thegreatest ﬁxpoint of c σ via Kleene iteration on V from above: c σ : V → V c σ ( V ′ ) = { v ∈ V | ( v ∈ MIN ∧ η min ( v ) ∩ V ′ = ∅ ) ∨ ( v ∈ MAX ∧ σ ( v ) ∈ V ′ ) ∨ ( v ∈ AV ∧ supp ( η av ( v )) ⊆ V ′ ) } It is easy to see that C σ = νc σ contains all those nodes from which Min canforce a non-terminating play and hence achieve payoﬀ 0. (Note that there arefurther nodes that guarantee payoﬀ 0 – namely sinks with that payoﬀ and nodeswhich can reach such sinks – but those will obtain value 0 in any case.)We now show that this linear program computes µ V σ : ﬁrst, by requiring a ( v ) ≤ a ( v ′ ) for all v ∈ MIN , v ′ ∈ η min ( v ), we guarantee a ( v ) = min v ′ ∈ η min a ( v ′ )since we maximise. Hence we obtain the greatest ﬁxpoint of the following func-tion V ′ σ : [0 , V → [0 , V : V ′ σ ( a )( v ) =  v ∈ C σ P v ′ ∈ V a ( v ′ ) · η av ( v )( v ′ ) v ∈ AV , v C σ a ( σ ( v )) v ∈ MAX , v C σ min v ′ ∈ η min ( v ) a ( v ′ ) v ∈ MIN , v C σ w ( v ) v ∈ SINK

It is easy to show that the least ﬁxpoints of V ′ σ and V σ agree, i.e., µ V ′ σ and µ V σ : – µ V ′ σ ≤ µ V σ can be shown by observing that V ′ σ ≤ V σ . – µ V σ ≤ µ V ′ σ can be shown by proving that µ V ′ σ is a pre-ﬁxpoint of V σ , whichcan be done via a straightforward case analysis.We have to show V σ ( µ V ′ σ )( v ) ≤ µ V ′ σ ( v ) for all v ∈ V . We only spell out thecase where v ∈ AV , the other cases are similar. In this case either v C σ ,which means that V σ ( µ V ′ σ )( v ) = V ′ σ ( µ V ′ σ )( v ) = µ V ′ σ ( v ) . If instead v ∈ C σ , we have that supp ( η av ( v )) ⊆ C σ and so µ V ′ σ ( v ′ ) = 0 forall v ′ ∈ supp ( η av ( v )). Hence V σ ( µ V ′ σ )( v ) = X v ′ ∈ V η av ( v )( v ′ ) · µ V ′ σ ( v ′ ) = 0 = µ V ′ σ ( v )55f we can now show that V ′ σ has a unique ﬁxpoint, we are done. The argumentfor this goes as follows: assume that this function has another ﬁxpoint a ′ diﬀerentfrom µ V ′ σ . Clearly [ V ] a ′ ∩ C σ = ∅ , where [ V ] a ′ = { v ∈ V | a ′ ( v ) = 0 } . Hence, ifwe compare ( V ′ σ ) a : [ V ] a → [ V ] V′ σ ( a ) (deﬁned analogously to Lem. 7.3) and c σ above, we observe that ( V ′ σ ) a ′ ⊆ c σ | [ V ] a ′ . (Both functions coincide, apart fromtheir treatment of nodes v ∈ MIN , where c σ ( V ′ ) contains v whenever one of itssuccessors is contained in V ′ , whereas ( V ′ σ ) a ′ ( V ′ ) additionally requires that thevalue of this successor is minimal.) Since a ′ is not the least ﬁxpoint we have byThm. 4.1 that ∅ 6 = ν ( V ′ σ ) a ′ ⊆ ν ( c σ | [ V ] a ′ ) ⊆ νc σ = C σ . This is a contradiction, since [ V ] a ′ ∩ C σ = ∅ as observed above.This shows that V ′ σ has a unique ﬁxpoint and completes the proof. Note thatif we do not explicitly require that the values of all nodes in C σ are 0, V ′ σ willpotentially have several ﬁxpoints and the linear program would not characterisethe least ﬁxpoint. Theorem 7.5.

Strategy iteration from above and below both terminate and com-pute the least ﬁxpoint of V .Proof.Strategy iteration from above: We start by showing the following: Given any a ( i ) and a new switched Min-strategy τ ( i +1) , i.e., τ ( i +1) = sw min ( τ ( i ) , a ( i ) ), then a ( i ) is a pre-ﬁxpoint of V τ ( i +1) .By choice of τ ( i +1) we have V τ ( i +1) ( a ( i ) )( v ) =  a ( i ) ( τ ( i +1) ( v )) v ∈ MIN max v ′ ∈ η max ( v ) a ( i ) ( v ′ ) v ∈ MAX P v ′ ∈ V a ( i ) ( v ′ ) · η av ( v )( v ′ ) v ∈ AV w ( v ) v ∈ SINK =  min v ′ ∈ η min ( v ) a ( i ) ( v ′ ) v ∈ MIN max v ′ ∈ η max ( v ) a ( i ) ( v ′ ) v ∈ MAX P v ′ ∈ V a ( i ) ( v ′ ) · η av ( v )( v ′ ) v ∈ AV w ( v ) v ∈ SINK = V ( a ( i ) )( v )By Thm. F.1 V ≤ V τ ( i ) holds and since a ( i ) is a ﬁxpoint of V τ ( i ) we conclude V τ ( i +1) ( a ( i ) )( v ) = V ( a ( i ) )( v ) ≤ V τ ( i ) ( a ( i ) )( v ) = a ( i ) ( v )Thus we have a ( i +1) ≤ a ( i ) (by Knaster-Tarski, since a ( i ) is a pre-ﬁxpoint of V τ ( i +1) and a ( i +1) is its least ﬁxpoint). Furthermore we know that a ( i ) is not aﬁxpoint of V τ ( i +1) (otherwise we could not have performed a switch) and hence a ( i +1) is strictly smaller than a ( i ) for at least one input. Since there are only56nitely many strategies we will eventually stop switching and reach a ﬁxpoint a = a ( j ) for an index j .Then, if V ′ = ν V a = ∅ then a is the least ﬁxpoint and we conclude.Otherwise, we determine a ( j +1) = a − ( ι V a ) V ′ . By Lem. 4.2 (dual version), a ( j +1) is a pre-ﬁxpoint of V . Now Min will choose her best strategy τ = τ ( j +2) = sw min ( τ ( i ) , a ( i +1) ) and we continue computing a ( j +2) = µ V τ ( j +2) . First, observethat since a ( j +1) is a pre-ﬁxpoint of V , it is also a pre-ﬁxpoint of V τ ( j +2) . In fact, V and V τ ( j +1) coincide on all nodes v MIN . If v ∈ MIN , we have V τ ( j +2) ( a ( j +1) )( v ) = a ( j +1) ( τ ( j +2) ( v ))= min v ′ ∈ η min ( v ) a ( j +1) ( v ′ ) = V ( a ( j +1) )( v ) ≤ a ( j +1) ( v ) . Hence it follows by Knaster-Tarski that a ( j +2) = µ V τ ( j +2) ≤ a ( j +1) . In turn, a ( j +1) < a ( j ) since V ′ is non-empty and hence also a ( j +2) < a ( j ) (where < ontuples means means ≤ in all components and < in at least one component.)This means that the chain a ( i ) is strictly descending. Hence, at each iterationwe obtain a new strategy and, since the number of strategies is ﬁnite, the iterationwill eventually stop.Hence the algorithm terminates and stops at the least ﬁxpoint of V . Strategy iteration from below:

We start as follows: Assume a is the least ﬁxpoint of V σ , i.e. a = µ V σ and σ ′ the new best strategy for Max obtained by switching with respect to a , i.e., σ ′ = sw max ( σ, a ). We have to show that a ′ = µ V σ ′ lies above a ( a ′ ≥ a ). Herewe use our proof rules (see Thm. 4.3) and show the following: – First, observe that a is a post-ﬁxpoint of V σ ′ . For any v ∈ V we have a ( v ) = V σ ( a )( v ) =  min v ′ ∈ η min ( v ) a ( v ′ ) v ∈ MIN a ( σ ( v )) v ∈ MAX P v ′ ∈ V a ( v ′ ) · η av ( v )( v ′ ) v ∈ AV w ( v ) v ∈ SINK ≤  min v ′ ∈ η min ( v ) a ( v ′ ) v ∈ MIN max v ′ ∈ η max ( v ) a ( v ′ ) v ∈ MAX P v ′ ∈ V a ( v ′ ) · η av ( v )( v ′ ) v ∈ AV w ( v ) v ∈ SINK =  min v ′ ∈ η min ( v ) a ( v ′ ) v ∈ MIN a ( σ ′ ( v )) v ∈ MAX P v ′ ∈ V a ( v ′ ) · η av ( v )( v ′ ) v ∈ AV w ( v ) v ∈ SINK = V σ ′ max ( a )( v ) – Next we show that ν ( V σ ′ ) a ∗ = ∅ , thus proving that a ≤ µ V σ ′ = a ′ byThm. 4.3. Note that ( V σ ′ ) a ∗ : [ V ] a = V σ ′ ( a ) → [ V ] a = V σ ′ ( a ) , i.e., it restricts tothose elements of a where a and V σ ′ ( a ) coincide.57henever v ∈ MAX is a node where the strategy has been “switched” withrespect to a , we have V σ ′ ( a )( v ) = a ( σ ′ ( v )) > a ( σ ( v )) = a ( v ) . The ﬁrst equality above is true by the deﬁnition of V σ ′ and the last equalityholds since a is a ﬁxpoint of V σ . So if v is a switch node, it holds that v [ V ] a = V σ ( a ) . By contraposition if v ∈ [ V ] a = V σ ( a ) , v cannot be a switchnode.We next show that ( V σ ) a ∗ , ( V σ ′ ) a ∗ agree on [ V ] a = V σ ′ ( a ) ⊆ [ V ] a = [ V ] a = V σ ( a ) (remember that a is a ﬁxpoint of V σ ). It holds that( V σ ) a ∗ ( V ′ ) = γ V σ ( a ) ,ι ( V σ ( α a,ι ( V ′ )))( V σ ′ ) a ∗ ( V ′ ) = γ V σ ′ ( a ) ,ι ( V σ ′ ( α a,ι ( V ′ ))) ∩ [ V ] a = V σ ′ ( a ) for a suitable constant ι and if we choose ι small enough we can use thesame constant in both cases. Now let v ∈ [ V ] a = V σ ′ ( a ) : by deﬁnition it holdsthat v ∈ ( V σ ) a ∗ ( V ′ ) = γ V σ ( a ) ,ι ( V σ ( α a,ι ( V ′ ))) if and only if V σ ( α a,ι ( V ′ ))( v ) ⊖V σ ( a )( v ) ≥ ι . Since, by the considerations above, v is not a switch node, V σ ( b )( v ) = V σ ′ ( b )( v ) for all b and we can replace V σ by V σ ′ , resulting inthe equivalent statement v ∈ γ V σ ′ ( a ) ,ι ( V σ ′ ( α a,ι ( V ′ ))), also equivalent to v ∈ ( V σ ′ ) a ∗ ( V ′ ).Thus ν ( V σ ′ ) a ∗ ⊆ ν ( V σ ) a ∗ = ∅ .Hence we obtain an ascending sequence a ( i ) . Furthermore, whenever we per-form a switch, we know that a ( i ) is not a ﬁxpoint of V σ ( i +1) (otherwise we couldnot have performed a switch) and hence a ( i +1) is strictly larger than a ( i ) for atleast one input. Since there are only ﬁnitely many strategies we will eventuallystop switching and reach the least ﬁxpoint. Runtime results