[PDF] Beating Two-Thirds For Random-Order Streaming Matching

Abstract

We study the maximum matching problem in the random-order semi-streaming setting. In this problem, the edges of an arbitrary n-vertex graph G=(V, E) arrive in a stream one by one and in a random order. The goal is to have a single pass over the stream, use n \cdot poly(\log n) space, and output a large matching of G. We prove that for an absolute constant \epsilon_0 > 0, one can find a (2/3 + \epsilon_0)-approximate maximum matching of G using O(n \log n) space with high probability. This breaks the natural boundary of 2/3 for this problem prevalent in the prior work and resolves an open problem of Bernstein [ICALP'20] on whether a (2/3 + \Omega(1))-approximation is achievable.

Full PDF

BBeating Two-Thirds For Random-Order Streaming Matching

Sepehr Assadi ∗ Soheil Behnezhad † Abstract

0, one can ﬁnd a (2 / ε )-approximatemaximum matching of G using O ( n log n ) space with high probability. This breaks the naturalboundary of 2 / / ∗ ( [email protected] ) Department of Computer Science, Rutgers University. Research supported in bypart by the NSF CAREER award CCF-2047061. † ( [email protected] ) Department of Computer Science, University of Maryland. Research supported by GooglePh.D. Fellowship. i a r X i v : . [ c s . D S ] F e b ontents ( / ) -Approximation Early On 5 A Tools from Information Theory 35 ii Introduction

A matching in a graph G = ( V, E ) is any collection of vertex-disjoint edges and in the maximummatching problem, we are interested in ﬁnding a matching of largest size in G . This problem hasbeen a cornerstone of algorithmic research and its study has led to numerous breakthrough resultsin theoretical computer science. In this paper, we study the maximum matching problem in the semi-streaming model of computation [FKM +

05] deﬁned as follows.

Deﬁnition 1.1.

Given a graph G = ( V, E ) with n vertices V = { , . . . , n } and m edges in E presented in a stream S = (cid:104) e , . . . , e m (cid:105) , a semi-streaming algorithm makes a single pass over thestream of edges S and uses O ( n · polylog( n )) space, measured in words of size Θ(log n ) bits, and atthe end outputs an approximate maximum matching of G . The greedy algorithm for maximal matching gives a simple / -approximation algorithm to thisproblem in O ( n ) space. When the stream of edges is adversarially ordered, this is simply the bestresult known for this problem, while it is also known that a better than ∼ . random order streams . This line of work was pioneered in [KMM12] whoshowed that the / -approximation of greedy can be broken in this case and obtained an algorithmwith approximation ratio ( / + 0 . +

20] followed up on theapproach of [KMM12] and improved the approximation ratio all the way to 6 /

11 [FHM + +

19] built on the sparsiﬁcation approach of [BS15,BS16] in dynamic graphs to achievean (almost) / -approximation but at the cost of (cid:101) O ( n . ) space, which is no longer semi-streaming.A beautiful work of [Ber20] then obtained a semi-streaming (almost) / -approximation by showinghow a generalization of the sparsiﬁcation approach in [ABB +

19] can be found in (cid:101) O ( n ) space.The / -approximation ratio of the algorithm of [Ber20] is the best possible among all priortechniques for this problem: the ﬁrst line of attack in [KMM12,Kon18,GKMS19,FHM +

20] is basedon ﬁnding length-3 augmenting paths and even ﬁnding all these paths does not lead to a better-than- / -approximation . The second line in [ABB +

19, Ber20] is based on ﬁnding an edge-degreeconstrained subgraph (EDCS) which hits the same exact barrier as there are graphs whose EDCSdoes not provide a better than / -approximation (see [BS15]). Finally, even for an algorithmicallyeasier variant of this problem, the one-way communication problem, which roughly corresponds toonly measuring the space of the algorithm when crossing the midpoint of the stream, the best knownapproximation ratio is still / which is known to be tight for adversarial orders/partitions [GKK12].Given this state-of-aﬀairs, the / -approximation ratio for random-order streaming matching hasemerged as natural barrier [Kon18, Ber20]. In particular, [Ber20] posed obtaining a ( / + Ω(1))-approximation to this problem as an important open question. We resolve this question in theaﬃrmative in our work. Our main result is a semi-streaming algorithm for maximum matching in random-order streamswith approximation ratio strictly-better-than- / . The work of [FHM +

20] also considers length-5 augmenting paths. However, these paths are used instead of length-3 paths “missed” by the algorithm not in addition to length-3 paths and thus the same shortcoming persists. heorem 1 (Main Result) . Let G be an n -vertex graph whose edges arrive in a random-orderstream. For an absolute constant ε > , there is a single-pass streaming algorithm that obtainsa ( + ε ) -approximate maximum matching of G using O ( n log n ) space with high probability. Theorem 1 breaks the / -barrier of all prior work in [KMM12,Kon18,GKMS19,ABB + + / is minuscule in this theorem (while wedid not optimize for constants, the bound on ε is only ∼ − at this point), it still proves that( ⁄ )-approximation is not the “right” answer to this problem. This is in contrast to some otherproblems of similar ﬂavor such as one-way communication complexity of matching (on adversarialpartitions) [GKK12, AB19] or the fault-tolerant matching problem [AB19] which are both solvedusing similar techniques (see the unifying framework of [AB19] based on EDCS) and for both / -approximation is provably best possible. Beyond ( / ) -approximation. Breaking this / -barrier naturally raises the question on whatis the right bound on the approximation ratio of random-order streaming matching. In particular,is (1 − ε )-approximation possible? We make progress toward settling this question by showingthat no “truly” space-eﬃcient algorithm exists for this latter problem: there is provably no semi-streaming matching algorithm even on bipartite graphs that can achieve a (1 − ε )-approximationin O (exp((1 /ε ) . ) · n · polylog( n )) space; in other words, if one hopes for achieving a (1 − ε )-approximation, an exponential dependence on (1 /ε ) in the space is unavoidable (see Corollary 5.1).As the main focus of our work is on the algorithm in Theorem 1, we postpone the details andthe ideas behind this result to Section 5. Prior work.

As stated earlier, there has been two main lines of attack on the streaming matchingproblem in random-order streams. The ﬁrst approach aims to ﬁnd a large matching of the graph G early on in the stream, and then spends the rest of the stream augmenting this matching. Forinstance, [KMM12] showed that in order for the greedy algorithm to fail to ﬁnd a better-than- / -approximation, the algorithm should necessarily pick many “wrong” edges early on in the stream.As such, in instances where greedy is not beating the / -approximation itself, we already have analmost / -approximation by the middle of the stream, and we can thus focus on augmenting thismatching in the remainder half to beat / -approximation. The work of [Kon18] then improved thisresult further by showing that a modiﬁed greedy algorithm, when unsuccessful in obtaining a largematching itself, ﬁnds an almost / -approximation when only o (1)-fraction of the stream has passed(as opposed to middle), which gives us more room for augmentation. Finally, [FHM +

20] built onthis approach and further improved the augmentation phase.The second approach to this problem was based on obtaining an EDCS, a subgraph deﬁnedby [BS15, BS16] and studied further in [AB19], that acts as a “matching sparsiﬁer”. On a highlevel, an EDCS is a sparse subgraph satisfying the following two constraints: ( i ) edge-degree ofedges in the EDCS cannot be “high”, while ( ii ) edge-degree of missing edges cannot be “low”.These constraints ensure that an EDCS always contains an almost / -approximate matching of thegraph and has additional robustness properties [BS15, BS16, ABB +

19, AB19, Ber20]. For instance,[ABB +

19] proved that union of several EDCS computed on diﬀerent parts of a random stream, isitself an EDCS for the entire stream. This allowed them to compute an EDCS of the input in (cid:101) O ( n . )space and directly obtain their almost / -approximation. Finally, [Ber20] gave an elegant proof thatweakening the requirement of EDCS allows one to still preserve the almost / -approximation butnow recover this subgraph in only O ( n log n ) space. More speciﬁcally, the algorithm of [Ber20] ﬁrst2nds a subgraph only satisfying property ( i ) of the EDCS in the ﬁrst o (1) fraction of the stream,and then picks all (potentially) necessary edges for satisfying property ( ii ) in the remainder; theproof then shows that this set of potentially necessary edges is of size only O ( n log n ). Our work.

Our approach can be seen as a natural combination of these two mostly disjointlines of work. The ﬁrst part comes from a better understanding of EDCS. We present a roughcharacterization of when an EDCS cannot beat the / -approximation, which shows that in theseinstances, we can eﬀectively ignore the second constraint of EDCS. As a result, we obtain thatthe only way for the algorithm of [Ber20] to fail to achieve a better-than- / -approximation, is ifit already picks an almost / -approximation in the ﬁrst o (1) fraction of the stream. Note thatthis is conceptually similar to the ﬁrst line of work on random-order streaming matching, but thetechniques are entirely disjoint. In particular, our proof is a deterministic property of EDCS not arandomized property of a greedy algorithm on a particular ordering.We are now in the familiar territory of having a large matching very early on in the stream,and we can spend the remainder of the stream augmenting it. The main diﬀerence however is thatstarting from an almost / -approximation matching, there is essentially no length-3 paths for usto augment and we instead need to handle length-5 augmenting paths. The key challenge is to ﬁndthe middle edge of these length-5 augmenting paths. Indeed, we note that the / -approximationlower bound of [GKK12] for adversarial order streams gives away a / -approximate matching earlyon for free, yet it is provably impossible to augment it in the remainder of the stream using asemi-streaming algorithm. To get around this, we crucially use the random arrival assumptionagain. Particularly, we regard any length-5 augmenting path whose middle edge arrives after itstwo endpoint edges as a “discoverable” path and then ﬁnd a constant fraction of such paths. Sincethe edges arrive in a random order, a constant fraction of length-5 augmenting will be discoverableand thus we are able to beat / -approximation in our setting. General notation.

For a graph G = ( V, E ) and v ∈ V , we use deg G ( v ) to denote the degree of v in G and N G ( v ) to denote the neighborset of v (when clear from the context, we may drop thesubscript G ). For any edge e = ( u, v ) ∈ E , we deﬁne the edge-degree of e in G as deg( u ) + deg( v ).We use µ ( G ) to denote the size (i.e., the number of edges) of the maximum matching in G .For integer k ≥ p ∈ [0 , B ( k, p ) to denote the binomial distribution withparameters k and p . That is, B ( k, p ) is the discrete probability distribution of the number ofsuccessful experiments out of k experiments each with an independent probability p of success. Random-order streams.

We consider the random-order streaming setting where the edges of G arrive one by one in an order chosen uniformly at random from all possible orderings. Let e i bethe i -th edge that arrives in the stream. For any two parameters a, b satisfying 1 ≤ a < b ≤ m weuse G [ a, b ] to denote the subgraph of G on vertex-set V and edge-set { e a , . . . , e b } . We may also use G

1] and G [ a, m ].For the input graph G deﬁned by the stream, we can assume w.l.o.g. that µ ( G ) ≥ c log n forany desirably large constant c . The reason is that any graph can be easily shown to have at most2 n · µ ( G ) edges and if µ ( G ) = O (log n ) then we can store the whole input in the memory andreport an optimal solution using O ( n log n ) space. We further assume throughout the paper thatthe number of edges m is known by the algorithm in advance. This is a common assumption in theliterature and can be removed via standard techniques by guessing m in geometrically increasingvalues at the expense of multiplying the space by an O (log n ) factor.3 .1 Preliminaries Probabilistic tools.

We use the following standard forms of Chernoﬀ bound.

Proposition 2.1 (Chernoﬀ Bound; cf. [AS04]) . Suppose X , . . . , X t are t independent randomvariables with values in [0 , . Let X := (cid:80) ti =1 X i and assume E [ X ] ≤ b . For any δ > and k ≥ , Pr (cid:16) | X − E [ X ] | ≥ δ · b (cid:17) ≤ · exp (cid:16) − δ · b (cid:17) & Pr (cid:16) | X − E [ X ] | ≥ k (cid:17) ≤ · exp (cid:16) − k t (cid:17) . We also need Lov´asz Local Lemma (LLL) in our proofs.

Proposition 2.2 (Lov´asz Local Lemma; cf. [AS04]) . Let p ∈ (0 , and d ≥ . Suppose E , . . . , E t are t events such that Pr ( E i ) ≤ p for all i ∈ [ t ] and each E i is mutually independent of all but (atmost) d other events E j . If p · ( d + 1) < /e then Pr (cid:0) ∩ ni =1 E i (cid:1) > . Hall’s theorem.

We use the following standard extension of the Hall’s marriage theorem forcharacterizing maximum matching size in bipartite graphs.

Fact 2.3 (Extended Hall’s Theorem; cf. [Hal35]) . Let G = ( L, R, E ) be a bipartite graph and | L | = | R | = n . Then, max (cid:16) | A | − | N ( A ) | (cid:17) = n − µ ( G ) , where A ranges over L or R , separately. We refer to such set A as a witness set . Fact 2.3 follows from Tutte-Berge formula for matching size in general graphs [Tut47, Ber62] ora simple extension of the proof of Hall’s marriage theorem itself. Alternating and augmenting paths.

Given a matching M , an alternating path P for M is apath whose edges alternatively belong to M and do not belong to M . An augmenting path for M isan alternative path that starts and ends with edges that do not belong to M . Given an augmentingpath P for M , we use notation M ⊕ P := ( M \ P ) ∪ ( P \ M ) to denote the matching obtained byﬂipping the containment of edges of P in M . Given two matchings M and M (cid:48) , their symmetricdiﬀerence M ∆ M (cid:48) is a graph including only the edges that belong to exactly one of M and M (cid:48) . We brieﬂy review the parameters and guarantees of the algorithm of Bernstein [Ber20] that we usein our paper. In the following, we slightly increase the constants in the parameters which is neededfor our results.

Deﬁnition 2.4 ( Parameters ) . For some small ε ∈ (0 , ) to be determined later, let λ := ε , β + := 64 · λ − log(1 /λ ) , β − = (1 − λ ) · β + . A high level overview of the algorithm of [Ber20] is as follows: Simply add n − µ ( G ) vertices to each side of the graph and connect them to all the original vertices; then applyoriginal’s Hall’s theorem for perfect matching to this graph as this graph now has one. lgorithm 1. Bernstein’s Algorithm [Ber20].The algorithm of [Ber20] proceeds in two phases as follows: • Phase I terminates within the ﬁrst εm edges of the stream. At the end of Phase I, the algorithmconstructs a subgraph H ⊆ G <εm such that for all ( u, v ) ∈ H :deg H ( u ) + deg H ( v ) ≤ β + . Moreover, let U be the set of all edges in G ≥ εm such thatdeg H ( u ) + deg H ( v ) < β − . • In Phase II, the algorithm simply stores U in the memory and at the end of the stream returnsa maximum matching of H ∪ U .The following lemma is all we need from [Ber20] in our paper. Lemma 2.5 (Lemma 4.1 of [Ber20]) . There is a way of constructing the subgraph H of G <εm suchthat with probability at least − n − , | H ∪ U | = O ( n log ( n ) · poly(1 /ε )) . ( / ) -Approximation Early On We start by characterizing the tight instances of the algorithm of [Ber20] (Algorithm 1). Roughlyspeaking, we show that the only way for Algorithm 1 to end up with a (2 / H that already has an almost (2 / / In this section we prove the following structural result:

Theorem 2.

Let λ ∈ (0 , / and β − ≤ β + be such that β + ≥ λ and β − ≥ (1 − λ ) β + . Suppose G = ( L, R, E ) is any bipartite graph and: ( i ) H is a subgraph of G where for all ( u, v ) ∈ H : deg H ( u ) + deg H ( v ) ≤ β + ; and ( ii ) U is the set of all edges ( u, v ) in G \ H such that deg H ( u ) + deg H ( v ) < β − .Then, for any parameter δ ∈ (0 , , either: µ ( H ) ≥ (1 − λ ) · ( 23 − δ ) · µ ( G ) or µ ( H ∪ U ) ≥ (1 − λ ) · (cid:18)

23 + δ (cid:19) · µ ( G ) . Let us deﬁne the following (see Figure 1 for an illustration): • Let M ∗ be a maximum matching of G and deﬁne M ∗ U := M ∗ ∩ U and M ∗ ¯ U := M ∗ \ U . • A is Hall’s theorem witness set in H ∪ M ∗ U (as in Fact 2.3) and B := N H ∪ M ∗ U ( A ). Withoutloss of generality we assume A ⊆ L and deﬁne ¯ A := L \ A and ¯ B := R \ B .5e start with the following simple claim that follows easily from Fact 2.3. Claim 3.1.

For the witness set A : ( i ) | ¯ A | + | B | ≤ µ ( H ∪ U ) . ( ii ) There is a matching ¯ M ⊆ M ∗ ¯ U between A and ¯ B in G with size | ¯ M | = µ ( G ) − µ ( H ∪ M ∗ U ) .Proof. For part ( i ), note that | ¯ A | + | B | = n − ( | A | − | B | ) = n − ( n − µ ( H ∪ M ∗ U )) ≤ µ ( H ∪ U ) wherethe second to last equation is since A is a witness set in H ∪ M ∗ U , and the last equation is because M ∗ U is a subset of U .For part ( ii ), consider the graph consisting of only M ∗ . Given that for the set A in thisnew graph, we have | A | − | N M ∗ ( A ) | ≤ n − µ ( G ) by Fact 2.3, we get that | N M ∗ ( A ) | − | B | ≥ µ ( G ) − µ ( H ∪ M ∗ U ). Moreover, since M ∗ is a matching, these new neighbors of A are only formedvia a matching. Finally, as these edges are missing from H ∪ M ∗ U , this matching from A to ¯ B shouldentirely belong to M ∗ ¯ U . A ¯AB ¯BS S¯M TT F F

Figure 1:

An illustration of the Hall’s witness set and our notation in the proof of Theorem 2. Note thatin particular, there are no edges between A and ¯ B in H ∪ M ∗ U , and the matching ¯ M belongs entirely to M ∗ ¯ U . Consider any edge ( u, v ) ∈ ¯ M deﬁned in Claim 3.1. As ¯ M ⊆ M ∗ ¯ U , by property ( ii ) of Theorem 2statement, we have, deg H ( u ) + deg H ( v ) ≥ β − . We arbitrarily remove the edges on u and v until theabove inequality becomes tight for every edge (since ¯ M is a matching, this is possible indeed). Welet F be the remaining edges. Note that any edge in F is incident on exactly one vertex of ¯ M asthere are no edges in H ∪ M ∗ U between the endpoints of ¯ M . We record these properties as follows: ∀ ( u, v ) ∈ ¯ M : deg F ( u ) + deg F ( v ) = β − and | F | = | ¯ M | · β − . (1)In the following, we ﬁrst give some illustrating examples that highlight the ideas for prov-ing Theorem 2, and then proceed to the formal proof. Illustrating Examples and The High Level Idea

By Claim 3.1, µ ( H ∪ U ) ≥ µ ( G ) − | ¯ M | ; thus, if ¯ M is suﬃciently smaller than µ ( G ) /

3, we alreadysatisfy the second condition of Theorem 2 and we would be done. As such, in this informaldiscussion, we are simply going to assume that | ¯ M | = µ ( G ) /

3. Moreover, we deﬁne the endpointsof ¯ M as S , and their neighborset of S in H as the set T . See Figure 1 for an illustration. Let usnow consider two extreme cases: 6 hen degrees of edges in ¯ M are “highly balanced”. That is, both endpoints of edges in¯ M , namely, vertices in S , have degree β − / M is β − ). We claim that in this case, there is a large matching in H already that satisﬁes conditionone of Theorem 2.Firstly, note that the degrees of vertices in T needs to be at most β + − β − / ≤ (1 + λ ) β + / i ) of Theorem 2 for edges of H between S and T . As such, the subgraph between S and T has degree β − / S -side and degree at most β + / T -side. By putting a massof λ ) β + on every edge of this subgraph, we can create a feasible fractional matching of value | S | · ( β − / · (2 / ((1 + λ ) · β + )) ≥ (1 − Θ( λ )) | S | in this subgraph (and thus H ). Considering theintegrality gap of the matching polytope in bipartite graphs is one, this means there is a matchingof size (1 − Θ( λ )) | S | = (1 − Θ( λ )) · | ¯ M | = (1 − Θ( λ )) · µ ( G ) / H . Thus, in this case, H alreadyhas a large matching that satisﬁes the ﬁrst condition of Theorem 2.It is worth mentioning that the tight 2 / H ∪ U may not have a matching of size largerthan 2 µ ( G ) /

3, i.e., the second condition of Theorem 2 may indeed not hold here.

When degrees of edges in ¯ M are “mostly unbalanced”. Let us for our informal discussionassume that for every edge in ¯ M its endpoint in L has degree β − / R hasdegree 2 β − / β − by Eq (1)). We claimthat in this case, H ∪ U has a large matching that satisﬁes condition two of Theorem 2.In this case, to satisfy property ( i ) of Theorem 2 for edges of H between S and T , we need thatvertices in T ∩ L should have degree at most β + − β − / ≤ (1 + λ ) β + /

3. Given the bound of 2 β − / S ∩ R , we have that, | T ∩ L | ≥ (1 − Θ( λ )) · · | S ∩ R | . A similar argument also proves that | T ∩ R | ≥ (1 − Θ( λ )) · · | S ∩ L | . Now note that by Claim 3.1, | S ∩ R | = | S ∩ L | = | ¯ M | = µ ( G ) − µ ( H ∪ M ∗ U ) ≥ µ ( G ) − µ ( H ∪ U ),while | T ∩ L | + | T ∩ R | = | T | ≤ | ¯ A | + | B | ≤ µ ( H ∪ U ). Combining these with the above two bounds,we get that, µ ( H ∪ U ) ≥ (1 − Θ( λ )) · · µ ( G ) . Thus, in this case, H ∪ U has a matching which is a (much) better than 2 / H may not have a matching larger than3 / · | ¯ M | = µ ( G ) /

2, which means the ﬁrst condition of Theorem 2 may indeed not hold here.The above extreme examples suggest that when edge-degrees of ¯ M are more toward beingbalanced, the subgraph H has a close to 2 / H ∪ U is strictly better than 2 / The Formal Proof

In the following lemma, we prove a lower bound on µ ( H ). This lemma can then be used as follows:if degree of most edges in ¯ M are “balanced”, i.e., both endpoints have degree ≈ β − /

2, then µ ( H )will already be of size 2 · | ¯ M | which will be suﬃcient for the ﬁrst condition of Theorem 2.7 emma 3.2 ( matching of H is large ) . We have µ ( H ) ≥ β − λ · (cid:80) ( u,v ) ∈ ¯ M { deg F ( u ) , deg F ( v ) } .Proof. For every edge ( u, v ) ∈ ¯ M , deﬁne F ( u, v ) as set of edges in F that are incident on u or v .We deﬁne the following fractional matching x ∈ R F on edges of F : • for any edge e ∈ F ( u, v ): set x e := λ · { deg F ( u ) , deg F ( v ) } .Let us now prove that this is indeed a valid fractional matching. For any vertex w matched by ¯ M , x w := (cid:88) e (cid:51) w x e ≤ deg F ( w ) ·

11 + 4 λ · F ( w ) < , thus satisfying the fractional matching constraint.Now ﬁx a vertex w not matched by ¯ M . Let u , . . . , u deg F ( w ) denote the neighbors of w in F . Bydeﬁnition, all these vertices are matched by ¯ M . Let v , . . . , v deg F ( w ) be the matched pairs of thesevertices. We need the following simple claim. Claim 3.3.

For every i ∈ [deg F ( w )] , deg F ( w ) ≤ (1 + 4 λ ) · max { deg F ( u i ) , deg F ( v i ) } . Proof.

We ﬁrst have the following two equations:deg F ( w ) + deg F ( u i ) ≤ β + , (by the property ( i ) of Theorem 2 statement)deg F ( u i ) + deg F ( v i ) = β − . (by Eq (1))As such, deg F ( w ) − deg F ( v i ) ≤ β + − β − ≤ λβ − (as λ ≤ /

2, and β − ≥ (1 − λ ) β + )Noting that max { deg F ( u i ) , deg F ( v i ) } ≥ β − / Claim 3.3

To ﬁnalize Lemma 3.2, for any vertex w not matched by ¯ M , we have, x w := (cid:88) e =( w,u i ) x e = (cid:88) u i

11 + 4 λ · { deg F ( u i ) , deg F ( v i ) } ≤ Claim 3.3 (cid:88) u i F ( w ) = 1 , thus satisfying the fractional matching constraint. This implies that x is a valid fractional matching.Finally, the value of this fractional matching is: (cid:88) e ∈ F x e = (cid:88) ( u,v ) ∈ N (cid:88) e ∈ F ( u,v ) x e = (cid:88) ( u,v ) ∈ N deg F ( u ) + deg F ( v )(1 + 4 λ ) · max { deg F ( u ) , deg F ( v ) } = β − λ · (cid:88) ( u,v ) ∈ N { deg F ( u ) , deg F ( v ) } , where the last equation is by Eq (1). As the integrality gap of matching polytope on bipartitegraphs is one, we obtain that the desired lower bound on µ ( H ). Lemma 3.2

We now prove that if on the other hand most edges of ¯ M are “unbalanced”, then µ ( H ∪ U )should be suﬃciently large. To continue, we need a quick deﬁnition. Let S denote the endpointsof the matching ¯ M and T be the neighborset of these vertices in F . Recall that by Eq (1), S and T are disjoint (see Figure 1). 8 emma 3.4 ( matching of µ ( H ∪ U ) is large ) . We have µ ( H ∪ U ) ≥ | ¯ M | · β − | ¯ M |· β − · β + − (cid:80) s ∈ S (deg F ( s )) .Proof. Since F ⊆ H , by property ( i ) of Theorem 2, we have that | F | · β + ≥ (cid:88) ( u,v ) ∈ F deg F ( u ) + deg F ( v ) = (cid:88) s ∈ S (deg F ( s )) + (cid:88) t ∈ T (deg F ( t )) . (2)We can lower bound the second term of the RHS as follows. Recall that sum of quadratics isminimized over all-equal terms. As (cid:80) t ∈ T deg F ( t ) = | F | , this implies that, (cid:88) t ∈ T (deg F ( t )) ≥ (cid:88) t ∈ T ( | F || T | ) = | T | · ( | F || T | ) = | F | | T | . By plugging in this bound in Eq (2) and moving the terms around, we have that | T | ≥ | F | | F | · β + − (cid:80) s (deg F ( s )) = | ¯ M | · β − | ¯ M | · β − · β + − (cid:80) s (deg F ( s )) . (as | F | = | ¯ M | · β − by Eq (1))Finally, T ⊆ ¯ A ∪ B (as there are no edges between A and ¯ B ) and thus by Claim 3.1, | T | ≤ µ ( H ∪ U )which ﬁnalizes the proof. Lemma 3.4

Lemma 3.4 can be used as follows: when degree of most edges in ¯ M are “balanced”, thequantity (cid:80) s (deg F ( s )) will be close to | ¯ M | · ( β − ) / µ ( H ∪ U ) will be almost2 · | ¯ M | ; however, when degrees of edges in ¯ M are “unbalanced”, the quantity (cid:80) s (deg F ( s )) cannot decrease all the way to | ¯ M | · ( β − ) / µ ( H ∪ U ) which breaks the (2 / (cid:80) s ∈ S (deg F ( s )) in the RHS of Lemma 3.4, in the cases where RHS of Lemma 3.2 is small. Claim 3.5.

Suppose (cid:80) ( u,v ) ∈ ¯ M β − max { deg F ( u ) , deg F ( v ) } = (2 − γ ) · | ¯ M | for some γ ∈ [0 , ; then (cid:80) s (deg F ( s )) ≥ | ¯ M | · (cid:16) (2+ γ − γ ) · β − γ − γ (cid:17) .Proof. The intuition behind the proof is that (cid:80) s (deg F ( s )) term is a quadratic sum and is thusminimized in the most “balanced” case possible under the given constraints. Formally, we deﬁnethe following vector of vertex degrees d ∈ R S (recall that S is the endpoints of matching ¯ M ): • For any edge ( u, v ) ∈ ¯ M , let d u := β − − γ and d v := β − − d u .Notice that these vertex degrees satisfy the ﬁrst constraint of Eq (1) and that (cid:88) ( u,v ) ∈ ¯ M β − max { d u , d v } = (2 − γ ) · | ¯ M | , thus satisfying the assumption of the lemma as well. We now prove that these degrees minimizethe quadratic sum, namely, (cid:88) s ∈ S (deg F ( s )) ≥ (cid:88) s ∈ S d s . (3)9uppose there is an edge ( u , v ) where deg F ( u ) > d u and thus deg F ( v ) < d v (as both pairssatisfy Eq (1)). This also implies that there is another edge ( u , v ) where deg F ( u ) < d u anddeg F ( v ) > d v so that the sum of all degrees satisﬁes the condition of Eq (1).Now consider a suﬃciently small parameter θ ∈ (0 ,

1) and the new “more balanced” degreesˆ d u := deg F ( u ) − θ , ˆ d v := deg F ( v ) + θ , ˆ d u := deg F ( u ) + θ , ˆ d v := deg F ( v ) − θ , where θ is deﬁned using the following equation:1deg F ( u ) + 1deg F ( u ) = 1deg F ( u ) − θ + 1deg F ( u ) + θ = 1ˆ d u + 1ˆ d u . Considering deg F ( u ) > deg F ( u ), we have that θ > θ . Note that these new degrees (assumingwe keep the degrees of all other vertices unchanged) satisfy all the constraints as before. We have, (cid:88) s ∈{ u ,v ,u ,v } deg F ( s ) = ( ˆ d u + θ ) + ( ˆ d v − θ ) + ( ˆ d u − θ ) + ( ˆ d v + θ ) ≥ θ · ( ˆ d u − ˆ d v ) − θ · ( ˆ d u − ˆ d v ) + ˆ d u + ˆ d v + ˆ d u + ˆ d v (by ignoring the postive θ , θ terms) > ˆ d u + ˆ d v + ˆ d u + ˆ d v (as ˆ d u − ˆ d v > ˆ d u − ˆ d v and θ > θ )Thus, this change reduces the value of (cid:80) s ∈ S deg F ( s ) term as expected. We can now repeatedlycontinue this until we converge to the degree distribution { d s } s ∈ S deﬁned earlier. This proves Eq (3).By plugging in the bounds for { d s } s ∈ S in the RHS of Eq (3), we have that, (cid:88) s ∈ S (deg F ( s )) ≥ (cid:88) s ∈ S (deg F ( s )) = (cid:88) ( u,v ) ∈ ¯ M d u + d v = | ¯ M | · (cid:18) β − (2 − γ ) + ( β − − β − (2 − γ ) ) (cid:19) = | ¯ M | · (cid:18) (2 + γ − γ ) · β − γ − γ (cid:19) , as desired. Claim 3.5

Proof of Theorem 2.

Let us pick γ ∈ [0 ,

1) such that (cid:80) ( u,v ) ∈ ¯ M β − max { deg F ( u ) , deg F ( v ) } = (2 − γ ) · | ¯ M | (as the max-term is at least β − /

2, such a γ always exist). By plugging in the bound of Claim 3.5in Lemma 3.4, we have that, µ ( H ∪ U ) ≥ | ¯ M | · β − | ¯ M | · β − · β + − | ¯ M | · (cid:16) (2+ γ − γ ) · β − γ − γ (cid:17) ≥ (1 − λ ) · | ¯ M | · − (cid:16) (2+ γ − γ )4+ γ − γ (cid:17) (as β − ≥ (1 − λ ) β + )= (1 − λ ) · | ¯ M | · γ − γ − γ = (1 − λ ) · | ¯ M | · (2 + γ − γ ) . Considering | ¯ M | ≥ µ ( G ) − µ ( H ∪ U ) by Claim 3.1, we obtain that µ ( H ∪ U ) ≥ (1 − λ ) · µ ( G ) · (cid:18)

23 + γ − γ + 3 γ (cid:19) ≥ (1 − λ ) · µ ( G ) · (cid:18)

23 + γ (cid:19) . δ in Theorem 2, we already have γ ≥ δ , we will obtain the secondcondition. Further, without loss of generality, we can assume that | ¯ M | ≥ ( − δ ) · µ ( G ) as otherwise µ ( H ∪ M ∗ U ) ≥ ( + δ ) · µ ( G ) by Claim 3.1 which is stronger than the second condition of Theorem 2.Suppose γ < δ and | ¯ M | ≥ ( − δ ) · µ ( G ) then. In this case, by the deﬁnition of γ and Lemma 3.2, µ ( H ) ≥

11 + 4 λ · (2 − γ ) · | ¯ M | ≥

11 + 4 λ · (2 − δ ) · ( 13 − δ · µ ( G ) ≥ (1 − λ ) · (cid:18) − δ (cid:19) · µ ( G ) , thus satisfying the ﬁrst condition. This concludes the proof. Theorem 2

We now extend the results of Theorem 2 to general (non-bipartite) graphs following the probabilisticmethod technique of [AB19] for the original EDCS.

Corollary 3.6.

Let λ ∈ (0 , / and β − ≤ β + be such that β + ≥ λ · log (1 /λ ) and β − ≥ (1 − λ ) β + .Suppose G = ( V, E ) is any graph (not necessarily bipartite) and: ( i ) H is a subgraph of G where for all ( u, v ) ∈ H : deg H ( u ) + deg H ( v ) ≤ β + ; and ( ii ) U is the set of all edges ( u, v ) in G \ H such that deg H ( u ) + deg H ( v ) < β − .Then, for any parameter δ ∈ (0 , , either: µ ( H ) ≥ (1 − λ ) · ( 23 − δ ) · µ ( G ) or µ ( H ∪ U ) ≥ (1 − λ ) · (cid:18)

23 + δ (cid:19) · µ ( G ) . Proof.

The proof is based on the probabilistic method and Lov´asz Local Lemma. Let M ∗ be amaximum matching of G . Consider the following randomly chosen bipartite subgraph ˜ G = ( L, R, ˜ E )of G with respect to M ∗ , where L ∪ R = V : • For any edge ( u, v ) ∈ M ∗ , with probability 1 / u belongs to L and v belongs to R , and withprobability 1 /

2, the opposite (the choices between diﬀerent edges of M ∗ are independent). • For any vertex w ∈ V not matched by M ∗ , we assign w to L or R uniformly at random(again, the choices are independent across vertices). • The set of edges in ˜ E are all edges in E with one end point in L and the other one in R .Note that by the deﬁnition of ˜ G , every edge of M ∗ belongs to ˜ G as well and thus µ ( ˜ G ) = µ ( G ).Deﬁne ˜ H := H ∩ ˜ G and ˜ U := U ∩ ˜ G . We prove that with non-zero probability:( i ) For all ( u, v ) ∈ ˜ H : deg ˜ H ( u ) + deg ˜ H ( v ) ≤ (1 + λ ) · β + / ii ) ˜ U is the set of all edges ( u, v ) in ˜ G \ ˜ H where deg ˜ H ( u ) + deg ˜ H ( v ) < (1 − λ ) β − / G of G and the sets ˜ H and ˜ U . Since ˜ G is bipartite and ˜ H and ˜ U satisfy the requirementsof Theorem 2 for parameters ˜ β + = (1 + λ ) · β + /

2, ˜ β − = (1 − λ ) β − /

2, and ˜ λ = λ/

2, we get either µ ( ˜ H ) ≥ (1 − λ ) · ( 23 − δ ) · µ ( ˜ G ) or µ ( ˜ H ∪ ˜ U ) ≥ (1 − λ ) · (cid:18)

23 + δ (cid:19) · µ ( ˜ G ) .

11s ˜ H ⊆ H , ˜ U ⊆ U , and µ ( ˜ G ) = µ ( G ), we obtain the ﬁnal result (notice that for this argument, weonly need existence of ˜ H and ˜ U and not a way of ﬁnding them; as such, the non-zero probabilityguarantee completely suﬃces for us).To prove either property, we need the following auxiliary claim. Claim 3.7.

With non-zero probability, for every vertex v ∈ V , | deg ˜ H ( v ) − deg H ( v ) / | < λ · β − . Proof.

Fix any vertex v ∈ V and let N H ( v ) := (cid:8) u , . . . , u deg H ( v ) (cid:9) be the neighbors of v in H . Letus assume v is assigned to L in ˜ G (the other case is symmetric). Hence, degree of v in ˜ H is exactlyequal to the number of vertices in N H ( v ) that are chosen in R . By construction of ˜ G , E (cid:2) deg ˜ H ( v ) (cid:3) = (cid:40) (deg H ( v ) + 1) / v is incident on M ∗ ∩ H deg H ( v ) / . Also, if two vertices u i , u j in N H ( v ) are matched by M ∗ , then exactly one of them will be a neighborto v in ˜ H ; otherwise the choices are independent. Thus, by Chernoﬀ bound (Proposition 2.1), Pr (cid:18) | deg ˜ H ( v ) − deg H ( v ) / | ≥ λ · β − (cid:19) ≤ (cid:18) − λ · β − β − (cid:19) ≤ − β + ) ≤ β . (as β + ≥ λ − log (1 /λ ) and β − ≥ (1 − λ ) β + , we have β − ≥ λ − · log β + )For every vertex v ∈ V , deﬁne: • event E v : the event that | deg ˜ H ( v ) − d v / | ≥ λ · β − .The event E v depends only on the choice of vertices in N H ( v ) and hence can depend on at most β other events E u for vertices u which are neighbors to N H ( v ). As such, we can apply LovaszLocal Lemma (Proposition 2.2) to argue that with a non-zero probability, ∩ v ∈ V E v happens, whichconcludes the proof. Claim 3.7

In the following, we condition on the non-zero probability event of Claim 3.7.

Proof of property ( i ) . For any edge ( u, v ) ∈ ˜ H , we have,deg ˜ H ( u ) + deg ˜ H ( v ) ≤ · (deg H ( u ) + deg H ( v )) + λ · β − ≤ β + / λ · β − ≤ (1 + λ ) · β + / , where the second to last inequality is because ( u, v ) ∈ H . As such all edge ( u, v ) ∈ ˜ H have thedesired bound on edge-degree. Proof of property ( ii ) . For any edge ( u, v ) ∈ ˜ G \ ˜ H with deg ˜ H ( u ) + deg ˜ H ( v ) < (1 − λ ) · β − / H ( u ) + deg H ( v ) ≤ · (cid:0) deg ˜ H ( u ) + deg ˜ H ( v ) (cid:1) + λ · β − < (1 − λ ) · β − + λ · β − < β − . This implies that this edge belongs to U and thus since ˜ U := ˜ G ∩ U , it also belongs to ˜ U . As aresult, any edge with “low” edge-degree belongs to U .This concludes the proof. Corollary 3.6 An Improved Algorithm via Augmentation

In this section, we show that the maximum matching of the subgraph H constructed in the earlypart of the stream of Algorithm 1 can be augmented well via the remaining edges. Combined withour Corollary 3.6 of Section 3, we complete in this section the proof of Theorem 1. Namely, weshow that for some parameter ε >

0, there is a single-pass random-order streaming algorithm(formalized as Algorithm 2) that obtains a ( + ε )-approximate maximum matching of G using O ( n log n ) space with high probability of 1 − / poly( n ). Our starting point is Algorithm 1. Recall that this algorithm stores two subgraphs H and U of G of size O ( n log n ). Subgraph H is constructed early on, after merely observing εm edges of thestream. In addition to H and U , here we store an additional subset of edges that we use to augmenta matching of H with. Particularly, let M H be an arbitrary maximum matching of H . Havingmatching M H early on, in our algorithm we augment M H using the edges that arrive in the rest ofthe stream (i.e., Phase II) in parallel to storing U . The augmenting paths that we ﬁnd may be ofsize up to ﬁve . This is crucial since we may not have enough augmenting paths of length smallerthan ﬁve to go beyond (2 / H ∪ U includes our desired approximation of strictly better that 2 /

3, or M H is almost a (2 / / − ε ) m edges of Phase II into Phase II.A andPhase II.B. To do this, we ﬁrst draw a random variable τ ∼ B ((1 − ε ) m, γ ). Phase II.A will thenproceed on the edges that arrive up to the τ -th edge of Phase II and Phase II.B proceeds on the restof the edges. Drawing random variable τ (instead of having a ﬁxed threshold) is particularly usefulin the analysis: Conditioned on the edges that are to arrive in Phase II (but not their ordering),each edge now belongs to Phase II.A independently with probability γ and to Phase II.B otherwise.Note that with a ﬁxed threshold, we do not get this independence. Figure 2:

An example of an execution of Algorithm 2. Here the black zig-zagged edges are those in matching M H which is ﬁxed by the end of Phase I and we would like to augment it. The black nodes are those matchedby M H and the white ones are those left unmatched by M H . The edges between white and black nodes(colored green) are the edges in T . Each black node has at most two edges in T and the green nodes canhave up to b . The red edges are those that arrive in Phase II.B. Three augmenting paths of length one,three, and ﬁve that are discoverable by the algorithm are also highlighted in the ﬁgure. For Phase II.A, let us deﬁne G H to be the subgraph of G whose edges arrive in Phase II.A andhave exactly one endpoint matched by M H . Note that G H is bipartite (even though G may not13 lgorithm 2. A random-order streaming matching algorithm with approximation ratio > / Parameters: γ = 2 / b = 500, and a suﬃciently small constant ε < .

01 to be ﬁxed later.(1) In Phase I of the algorithm, which consists of the ﬁrst εm edges of the stream, we constructa subgraph H of G as in Phase I of Algorithm 1. At the end of Phase I, we ﬁx an arbitrarymaximum matching M H of H .(2) In Phase II, which includes all the edges that arrive after Phase II, we store subgraph U using Phase II of Algorithm 1. In addition, we store another subset of edges that we use toaugment M H . These edges are constructed in two sub-phases Phase II.A and Phase II.B.(3) Draw random variable τ from the Binomial distribution B ((1 − ε ) m, γ ). Note that this canbe done in O ( m ) time and O (1) space as we only need a counter to count the successes.(4) Phase II.A starts after Phase I and ends upon arrival of the τ ’th edge of Phase II.(a) Let G H ( V H , U H , E H ) be a bipartite subgraph of G where V H := V ( M H ) is the set ofvertices matched in M H , U H := V \ V ( M H ) is the set of vertices left unmatched in M H ,and E H is the edges of G between V H and U H that arrive in Phase II.A.(b) We initialize T ← ∅ and upon arrival of an edge e = ( u, v ) of G H with u ∈ U H and v ∈ V H , if deg T ( v ) < T ( u ) < b we add e to T . That is, T is a maximal(2 , b )-matching of G H which requires O ( nb ) space to store.(5) Phase II.B starts after Phase II.A and continues to the end of the stream:(a) M ← M H . Upon arrival of each edge e in Phase II.B, we iteratively take an arbitraryaugmenting path P for M of length up to ﬁve using the edges in M ∪ T ∪ { e } and let M ← M ⊕ P . We repeat this process until no more augmenting paths of length up toﬁve exist in M ∪ T ∪ { e } ; we then continue to the next edge of the stream in Phase II.B.(6) Finally, we return a maximum matching of M ∪ H ∪ U .be) with one partition corresponding to vertices V ( M H ) and another to V \ V ( M H ). In Phase II.A,we only consider the edges of G H and greedily construct a maximal (2 , b )-matching T of G H (forsome constant b ≥ V ( M H ) of G H that have maximum degree 2in T and those in the other partition can have degree up to b . In our analysis, we show that theedges of T can be used as the two endpoint edges of many augmenting paths of length three or ﬁvefor M H (see Figure 2).In Phase II.B, we ﬁrst let M ← M H and upon arrival of each edge e , we iteratively augment M via length-up-to-ﬁve augmenting paths using the edges in T ∪ { e } until no such path is left. In ouranalysis, we use the edges of Phase II.B either as the middle edge of length-ﬁve augmenting pathsor as the single edge of the length-one augmenting paths the algorithm may ﬁnd (see Figure 2).At the end of the stream, we return a maximum matching of M ∪ H ∪ U . The algorithm outlinedabove is formalized as Algorithm 2. 14 pace Complexity We know already from Lemma 2.5 that | H ∪ U | = O ( n log( n ) · poly(1 /ε )) = O ( n log n ) for constant ε with high probability. In addition, subgraph T that we store in the memory has maximum degree b = O (1) and thus requires O ( n ) space to store. Other than these, we only store a matching M and augment it only using the edges stored in memory. Hence, overall, the space complexity of thealgorithm is O ( n log n ) with high probability. Analysis of Approximation Ratio

Let M (cid:63) be an arbitrary maximum matching of G ≥ εm . Fixing an arbitrary maximum matching of G , each of its edges appears in G ≥ εm with probability (1 − ε ), thus E | M (cid:63) | ≥ (1 − ε ) µ ( G ). Now solong as µ ( G ) ≥

20 log( n ) ε − and ε < / M (cid:63) via a Chernoﬀ bound on negativelyassociated random variables. See, e.g., [Ber20, Lemma 2.2] for the proof of the following: Observation 4.1. If µ ( G ) ≥

20 log( n ) ε − and ε < / , then Pr [ | M (cid:63) | ≥ (1 − ε ) µ ( G )] ≥ − n − . From now on, we condition on G <εm which ﬁxes subgraph H and matching M (cid:63) . We onlyassume that G <εm is chosen such that the high probability event of Observation 4.1 holds. Assumption 4.2. | M (cid:63) | ≥ (1 − ε ) µ ( G ) . Other than Assumption 4.2, we do not need any other assumption on how G <εm is chosen forthe rest of the analysis of the approximation ratio. By conditioning on the outcome of Phase I, theonly randomization that will be left, is the order with which the edges of G ≥ εm arrive in the stream.For brevity, we do not explicitly write the conditioning on G <εm for the rest of the section, but itshould be noted that all random statements are conditioned on the outcome of Phase I .Let P be the set of all augmenting paths of M H in S := M (cid:63) ∆ M H with length at most ﬁve.Note that since we regard H (and thus M H ) as given, the set P is deterministic (as it only dependson M H and M (cid:63) and not on the order of edges in G ≥ εm ). Observation 4.3.

We have |P| ≥ | M (cid:63) | − · µ ( H ) .Proof. Let P (cid:48) denote the set of augmenting paths of length larger than 5 in S . Note that there mustbe at least | M (cid:63) | − | M H | augmenting paths for M H in S , hence |P| + |P (cid:48) | ≥ | M (cid:63) | − | M H | . Moreover,any augmenting path in P (cid:48) must have at least 3 edges of M H ; thus |P (cid:48) | ≤ | M H | /

3. Combinationof the two bounds gives |P| ≥ | M (cid:63) | − | M H | − | M H | = | M (cid:63) | − | M H | = | M (cid:63) | − µ ( H ).We use G II.A to denote the subgraph of G that arrives in Phase II.A and use G II.B to denotethe subgraph of G that arrives in Phase II.B. Deﬁnition 4.4.

We say an augmenting path P ∈ P is “lucky” under the following conditions:1. If P = (cid:104) e (cid:105) then e ∈ G II.B .2. If P = (cid:104) e , e , e (cid:105) then e , e ∈ G II.A .3. If P = (cid:104) e , e , e , e , e (cid:105) then e , e ∈ G II.A and e ∈ G II.B .We denote the set of lucky augmenting paths in P by P L . We note, however, that the randomization in G <εm is crucial for arguing that the algorithm uses O ( n log n )space. Here, however, we are only analyzing the approximation ratio. P L of P is now random since it depends on the order of edges in G ≥ εm .Lemma 4.5 below proves that a relatively large fraction of augmenting paths in P will turn out tobe lucky with high probability. The proof is straightforward and is given in Section 4.3. Lemma 4.5.

It holds that Pr (cid:16) |P L | ≤ γ (1 − γ ) |P| − (cid:112) µ ( G ) ln n (cid:17) ≤ n − . Next, observe that in Phase II.B of Algorithm 2 where we iteratively discover augmenting paths,we do not have the whole subgraph G II.A and have stored only a subgraph T of G II.A in the memory.In addition, when ﬁnding augmenting paths we use only the current edge e of G II.B in Algorithm 2.Therefore, not all lucky paths are actually discoverable by Algorithm 2. This motivates our nextdeﬁnition for “discoverable paths”.

Deﬁnition 4.6.

We say an augmenting path P (not necessarily in P ) for M H is “discoverable” if | P | ≤ , all edges of P are in M H ∪ T ∪ G II.B , and P has at most one edge in G II.B . The next lemma proves there are many vertex-disjoint discoverable augmenting paths, by re-lating them to the number of lucky augmenting paths |P L | . We provide the proof in Section 4.2. Lemma 4.7.

There exists a set Q of vertex-disjoint discoverable augmenting paths for M H with |Q| ≥ b + 3 (cid:18) |P L | − b · µ ( H ) (cid:19) . Observe that Q is only a set of vertex-disjoint discoverable augmenting paths. However, since Al-gorithm 2 applies augmenting paths greedily and in an arbitrary order, the set of applied augment-ing paths may be very diﬀerent from Q . The next claim shows that we can nonetheless relate thenumber of augmenting paths that Algorithm 2 applies to the size of Q . Claim 4.8.

Let Q be as in Lemma 4.7. Algorithm 2 applies at least |Q| / augmenting paths in Phase II.B . In other words, | M | ≥ µ ( H ) + |Q| .Proof. Take an augmenting path P ∈ Q . Since P is discoverable, there must be a moment duringPhase II.B of Algorithm 2 where all the edges of P are stored in the memory. Note, however, that P is by deﬁnition an augmenting path for M H whereas Algorithm 2 tries to augment matching M (which is the result of iteratively augmenting M H ). The crucial observation, here, is that if P is notan augmenting path for M , then at some point one of the augmenting paths that Algorithm 2 hasapplied on M must have intersected with P (through a vertex). Now, recall that each augmentingpaths that Algorithm 2 applies has length at most ﬁve, and thus has at most six vertices. Thismeans that any augmenting path that Algorithm 2 applies can intersect (and thus “destroy”) atmost six paths in Q (since recall Q is a collection of vertex-disjoint paths). Hence Algorithm 2must apply at least |Q| / M . Since each augmenting path increases the sizeof M by one and initially M = M H , we have | M | ≥ | M H | + |Q| = µ ( H ) + |Q| . Lemma 4.9.

There is an absolute constant ε (cid:48) > such that for any ε < . , if µ ( H ) ≤ . µ ( G ) then with probability − / poly( n ) , we have | M | ≥ µ ( H ) + ε (cid:48) · µ ( G ) .Proof. We have | M | Claim 4.8 ≥ µ ( H ) + 16 |Q| Lemma 4.7 ≥ µ ( H ) + |P L | − b µ ( H )6(2 b + 3) . (4)On the other hand, by Lemma 4.5 we know that with 1 − / poly( n ) probability, |P L | > γ (1 − γ ) |P| − (cid:112) µ ( G ) ln n (By Lemma 4.5)16 427 |P| − (cid:112) µ ( G ) ln n (Since γ = 2 / ≥ (cid:18) | M (cid:63) | − µ ( H ) (cid:19) − (cid:112) µ ( G ) ln n (By Observation 4.3) ≥ (cid:18) (1 − ε ) µ ( G ) − µ ( H ) (cid:19) − (cid:112) µ ( G ) ln n (By Assumption 4.2) > . µ ( G ) − (cid:112) µ ( G ) ln n ( ε < .

01 and µ ( H ) ≤ . µ ( G )) > . µ ( G ) . (Since µ ( G ) > c log n for any desirably large constant c .)Replacing this high probability lower bound for |P L | into (4) we get that w.h.p., | M | ≥ µ ( H ) + 0 . µ ( G ) − b µ ( H )6(2 b + 3) > µ ( H ) + 10 − µ ( G ) . (Replacing b = 500 and noting µ ( H ) ≤ . µ ( G ).)This completes the proof.We are now ready to prove that Algorithm 2, w.h.p., achieves a better-than-(2 /

3) approximation.

Lemma 4.10.

For some absolute constant ε > the matching returned by Algorithm 2 withprobability − / poly( n ) has size at least (2 / ε ) · µ ( G ) .Proof. Let M O be the matching returned by Algorithm 2 which has size at least as large as maximumof | M | and µ ( H ∪ U ); we thus get | M O | ≥ max {| M | , µ ( H ∪ U ) } . Hence, from the lower bound ofLemma 4.9 for | M | , we get that there is a constant ε (cid:48) > − / poly( n ), | M O | ≥ max (cid:110) µ ( H ) + ε (cid:48) · µ ( G ) , µ ( H ∪ U ) (cid:111) . (5)In the next step, we employ Corollary 3.6 to argue that the lower bound above implies that | M O | ≥ (2 / µ ( G ). In particular, let us consider subgraph G (cid:48) of G which includes all theedges in H as well as all the edges in G >εm . In other words, the only edges of G that do not belongto G (cid:48) are those that arrive in Phase I and are not included in subgraph H . One can verify that H and U (constructed in Algorithm 2) satisfy the constraints of Corollary 3.6 for graph G (cid:48) (but notnecessarily G since the edges in G − G (cid:48) may have a small edge-degree). Corollary 3.6 thus impliesthat for any δ ∈ (0 , µ ( H ) ≥ (1 − λ ) · ( 23 − δ ) · µ ( G (cid:48) ) or µ ( H ∪ U ) ≥ (1 − λ ) · (cid:18)

23 + δ (cid:19) · µ ( G (cid:48) ) . Recall that M (cid:63) is the maximum matching of G >εm which is entirely included in G (cid:48) . Also recallfrom Observation 4.1 that w.h.p. | M (cid:63) | ≥ (1 − ε ) µ ( G ). Hence, w.h.p., µ ( G (cid:48) ) ≥ (1 − ε ) µ ( G ) whichcombined with λ = ε/

128 (Deﬁnition 2.4) simpliﬁes the equation above to the following: µ ( H ) ≥ (1 − O ( ε )) · ( 23 − δ ) · µ ( G ) or µ ( H ∪ U ) ≥ (1 − O ( ε )) · (cid:18)

23 + δ (cid:19) · µ ( G ) . (6)Plugging (6) into (5) implies for any δ ∈ (0 ,

1) that | M O | ≥ (1 − O ( ε )) · min (cid:26)(cid:18) − δ (cid:19) µ ( G ) + ε (cid:48) µ ( G ) , (cid:18)

23 + δ (cid:19) µ ( G ) (cid:27) (1 − O ( ε )) · min (cid:26)(cid:18) − δ + ε (cid:48) (cid:19) , (cid:18)

23 + δ (cid:19)(cid:27) · µ ( G ) . (Note that inequality above takes minimum of the two terms whereas (5) takes maximum. This isbecause Corollary 3.6 only guarantees either the lower bound of µ ( H ) or that of µ ( H ∪ U ) and wedo not know which one holds for our instance.)Now letting δ = ε (cid:48) /

2, we get | M O | ≥ (1 − O ( ε )) · min (cid:26)(cid:18)

23 + ε (cid:48) (cid:19) , (cid:18)

23 + ( ε (cid:48) / (cid:19)(cid:27) · µ ( G ) ≥ (1 − O ( ε )) (cid:18)

23 + ( ε (cid:48) / (cid:19) µ ( G ) . Finally, noting that ε can be made arbitrarily small (without aﬀecting ε (cid:48) ), combined with thefact that ε (cid:48) is an absolute positive constant, we get that there must be some ε > | M O | ≥ (cid:0) + ε (cid:1) µ ( G ) with probability 1 − / poly( n ).Theorem 1 now follows immediately from this. Observe that not all augmenting path P ∈ P L are discoverable. For example, if P ∈ P L is of lengthﬁve, despite its two endpoints e and e being part of G II.A by Deﬁnition 4.4, it may still be thecase that e , e (cid:54)∈ T and thus e , e (cid:54)∈ M H ∪ T ∪ G II.B implying that P may not be discoverable. Toprove Lemma 4.7, however, we show in this section that for most augmenting paths P ∈ P L , wecan modify P , particularly, by changing its two endpoint edges (if any and if necessary) and turn P into a discoverable augmenting path φ ( P ).Take an augmenting path P ∈ P L and recall from deﬁnition that P L ⊆ P and thus | P | ∈{ , , } . We deﬁne φ ( P ) as follows depending on the size of P : • | P | = 1: In this case, we simply let φ ( P ) ← P . • | P | = 3: Let (cid:104) e , e , e (cid:105) be the edges in P and note that e ∈ M H since P is an augmentingpath for M H . If edges e (cid:48) , e (cid:48) ∈ T exist such that (cid:104) e (cid:48) , e , e (cid:48) (cid:105) forms a length-three augmentingpath for M H , we let φ ( P ) ← (cid:104) e (cid:48) , e , e (cid:48) (cid:105) . Otherwise, φ ( P ) ← ∅ . • | P | = 5: Let (cid:104) e , e , e , e , e (cid:105) be the edges in P . Note that e , e ∈ M H since P is anaugmenting path for M H and e ∈ G II.B since P ∈ P L . Now if there are edges e (cid:48) , e (cid:48) ∈ T such that (cid:104) e (cid:48) , e , e , e , e (cid:48) (cid:105) is an augmenting path for M H , we let φ ( P ) to denote this path.Otherwise, φ ( P ) ← ∅ .The properties enlisted in Observation 4.11 are immediate consequences of construction above: Observation 4.11.

Let P ∈ P L and suppose φ ( P ) (cid:54) = ∅ . It holds that1. | φ ( P ) | = | P | .2. If P = (cid:104) e , . . . , e k (cid:105) and φ ( P ) = (cid:104) e (cid:48) , . . . , e (cid:48) k (cid:105) then e i = e (cid:48) i for any ≤ i ≤ k − .3. The endpoint vertices of φ ( P ) are unmatched in M H since it is an augmenting path for M H .4. If | φ ( P ) | > then the two endpoint edges of φ ( P ) belong to T .5. If φ ( P ) (cid:54) = ∅ , then φ ( P ) is discoverable.

18e let Φ := { φ ( P ) | P ∈ P L , φ ( P ) (cid:54) = ∅} . Although each element in Φ is a discoverableaugmenting path for M H , it has to be noted that these augmenting paths may not necessarilybe vertex-disjoint. In the ﬁrst part of the proof, we show that a large fraction of paths in Φ arevertex-disjoint. In the second part, we show that Φ is itself large. The combination of these two,gives that there is a large number of vertex-disjoint paths in Φ. A Large Fraction of Paths in Φ are Vertex-Disjoint We ﬁrst need an auxiliary claim:

Claim 4.12.

Let P ∈ P L and P (cid:48) ∈ P L be such that P (cid:54) = P (cid:48) , φ ( P ) (cid:54) = ∅ , and φ ( P (cid:48) ) (cid:54) = ∅ . Then:1. If φ ( P ) and φ ( P (cid:48) ) intersect at some vertex v , then v is an endpoint of both φ ( P ) and φ ( P (cid:48) ) .2. If e ∈ φ ( P ) then e (cid:54)∈ φ ( P (cid:48) ) .Proof. Note that P and P (cid:48) are vertex-disjoint since both belong to P L ⊆ P . By Observation 4.11part 2, only the endpoint edges of φ ( P ) and φ ( P (cid:48) ) may diﬀer from P and P (cid:48) respectively. Com-bination of these two observations implies that any vertex v that belongs to both of φ ( P ) and φ ( P (cid:48) ) must be an endpoint of at least one of the two paths. Now using Observation 4.11 part 3,we get that v cannot be an intermediate vertex of one path and an endpoint of another since anintermediate vertex must be matched in M H (as both φ ( P ) and φ ( P (cid:48) ) are augmenting paths for M H ). Hence, v must be an endpoint of both φ ( P ) and φ ( P (cid:48) ).To prove the second part, we know from the ﬁrst part that if e belongs to both φ ( P ) and φ ( P (cid:48) ),then both of the endpoints of e must be endpoints of paths φ ( P ) and φ ( P (cid:48) ). This means that weshould have | φ ( P ) | = | φ ( P (cid:48) ) | = 1 and P = P (cid:48) contradicting P (cid:54) = P (cid:48) .The next claim is the formal statement that a large fraction of paths in Φ are vertex-disjoint. Claim 4.13.

There is a subset

Q ⊆ Φ such that all the augmenting paths in Q are vertex-disjointand |Q| ≥ b +3 | Φ | where we recall b is the parameter of Algorithm 2.Proof. We greedily construct

Q ⊆

Φ by iterating over the augmenting paths in Φ in an arbitraryorder and including in Q any encountered augmenting path φ ∈ Φ which does not intersect withaugmenting paths already added to Q .Take an augmenting path φ ( P ) ∈ Φ. We know from Claim 4.12 part 1, that any other path φ ( P (cid:48) ) ∈ Φ that intersects φ ( P ) must do so at an endpoint vertex of φ ( P ). Furthermore, byClaim 4.12 part 2, φ ( P (cid:48) ) and φ ( P (cid:48)(cid:48) ) for P (cid:48) (cid:54) = P (cid:48)(cid:48) cannot be connected to an endpoint of φ ( P ) viathe same edge. Hence, any φ ( P (cid:48) ) intersecting φ ( P ) must do so via a unique edge to an endpointof P . Since the two endpoint edges of any path φ ( P (cid:48) ) of size larger than one belong to T byObservation 4.11 part 4, and that the maximum degree of T is b , there are at most 2 b such pathsintersecting φ ( P ). Moreover, at most one path φ ( P (cid:48) ) of length one can intersect each endpoint of φ ( P ) since φ ( P (cid:48) ) = P (cid:48) for length-one paths and thus all of them are vertex-disjoint. Therefore,overall, φ ( P ) intersects at most 2 b + 2 other paths φ ( P (cid:48) ).Now every time that we add a path φ ( P ) to Q , let us remove the remaining paths in Φ thatintersect φ ( P ). By our discussion above, every time we add a path to Q , we remove at most 2 b + 2other paths from Φ. Hence |Q| ≥ b +3 | Φ | . 19 he Set Φ is Large The main statement that Φ is large is formally given as Claim 4.16. Before proving it, we need twoauxiliary Claims 4.14 and 4.15.

Claim 4.14.

Let P = (cid:104) e , . . . , e k (cid:105) be an augmenting path of length three or ﬁve in P L . Let us denotethe endpoints of e and e k respectively by ( u , v ) and ( v k , u k ) where v is the vertex connected to e and v k is the vertex connected to e k − . If it holds that ( e ∈ T or deg T ( v ) ≥ and ( e k ∈ T or deg T ( v k ) ≥ , (7) then φ ( P ) (cid:54) = ∅ .Proof. It suﬃces from our construction of φ ( P ) to show there are edges e (cid:48) , e (cid:48) k ∈ T such that (cid:104) e (cid:48) , e , . . . , e k − , e (cid:48) k (cid:105) is an augmenting path for M H . We let e (cid:48) ← e if e ∈ T and similarly let e (cid:48) k ← e k if e k ∈ T . If e (cid:54)∈ T but still (7) holds, then deg T ( v ) ≥

2. Moreover, by construction of T in Algorithm 2, these two edges of v are in U H , i.e., the vertices left unmatched by M H . Notethat none of these two edges of v are connected to the intermediate vertices of P since P is anaugmenting-path for M H and hence all of its intermediate vertices are matched by M H (and so donot belong to U H ). However, it could be that one of these edges is connected to the other endpointof the augmenting path if the graph is non-bipartite. But this can happen for at most one of theedges of v since there are no parallel edges in the graph, which leaves the other edge as a validoption for e (cid:48) . In a similar way, if e k (cid:54)∈ T , we get deg T ( v k ) ≥ v k to be e (cid:48) k such that (cid:104) e (cid:48) , e , . . . , e k − , e (cid:48) k (cid:105) forms an augmenting path for M H . Thiscompletes the proof of the claim that condition (7) suﬃces to get φ ( P ) (cid:54) = ∅ . Claim 4.15.

Let P ∈ P L , e = ( u , v ) , and e k = ( v k , u k ) be as in Claim 4.14. Suppose thatcondition (7) does not hold for P . Then deg T ( u ) ≥ b or deg T ( u k ) ≥ b .Proof. We ﬁrst argue that both e and e k are part of graph G H of Phase II.A of Algorithm 2.Toward this, note that since P ∈ P L , we get from Deﬁnition 4.4 that e , e k ∈ G II.A . Moreover,since P is by deﬁnition an augmenting path for M H , its endpoints u , u k must be unmatched in M H (implying u , u k ∈ U H ) and vertices v , v k which are intermediate vertices of P must be matchedin M H (implying v , v k ∈ V H ). Hence, both e and e k must belong to G H (refer to Algorithm 2).Now let us suppose that (7) is false since its ﬁrst clause is false. That is, ( e (cid:54)∈ T and deg T ( v ) < e ∈ G H , the fact that Algorithm 2 does not add e to T uponprocessing e implies that either deg T ( v ) ≥ T ( u ) ≥ b (see description of Algorithm 2).The former cannot hold or otherwise the ﬁrst clause of (7) would not be false. Hence it should bethe case that deg T ( u ) ≥ b . The same argument implies that if (7) is false for its second clause,then deg T ( u k ) ≥ b . The proof is thus complete. Claim 4.16. | Φ | ≥ |P L | − b · µ ( H ) .Proof. Let X := { P ∈ P L | φ ( P ) = ∅} . By deﬁnition, Φ = P L \ X , thus | Φ | = |P L | − |X | . (8)It, therefore, suﬃces to upper bound the size of X . We do so by double counting the number ofedges in T .Recall that for any P ∈ P L , | P | ∈ { , , } by deﬁnition of P L . Moreover, if | P | = 1, then byconstruction φ ( P ) = P (cid:54) = ∅ and thus P (cid:54)∈ X . Hence for any P ∈ X it holds that | P | ∈ { , } . Now,20y Claim 4.14, condition (7) should not hold for any P ∈ X . This further implies from Claim 4.15that at least one of the endpoints of each P ∈ X must have degree at least b edges in T . Since X ⊆ P L and all augmenting paths in P L are vertex disjoint, this means that the endpoints of pathsin X collectively have at least |X | b edges in T . Moreover, all of these vertices must be on the U H = V \ V ( M H ) partition of graph G H since each P ∈ X ⊆ P L is an augmenting path for M H bydeﬁnition of P L . Now we give an alternative way of counting the edges in T . Note that any vertexin partition V H = V ( M H ) of G H , has at most 2 edges in T by construction of T in Algorithm 2.Hence, the number of edges in T can be upper bounded by 2 · | V ( M H ) | = 2 · | M H | = 4 | M H | . Assuch, we get |X | b ≤ | M H | and thus |X | ≤ | M H | /b . Plugging this upper bound for |X | into (8)and noting that | M H | = µ ( H ) completes the proof.We are ﬁnally ready to formally prove Lemma 4.7: Proof of Lemma 4.7.

Let

Q ⊆

Φ be as in Claim 4.13. All the paths in Q are vertex-disjoint. Also: |Q| Claim 4.13 ≥ | Φ | b + 3 Claim 4.16 ≥ b + 3 (cid:18) |P L | − b µ ( H ) (cid:19) . The proof of Lemma 4.7 is thus complete.

We ﬁrst lower bound E |P L | and then prove Lemma 4.5 via a concentration bound. Claim 4.17. E |P L | ≥ γ (1 − γ ) |P| .Proof. Recall again that we regard P as ﬁxed as we have conditioned on the outcome of Phase I.Now whether or not an augmenting path P ∈ P turns out to be lucky depends on the arrivalordering of the edges in G ≥ εm . We ﬁrst show that for any P ∈ P , Pr [ P ∈ P L ] ≥ γ (1 − γ ) . (9)(Where, recall, we hide the condition on Phase I for brevity in our probabilistic statements.)The key insight is to note that once we condition on G <εm , an edge e that is to arrive in Phase IIbelongs to G II.A independently (than other edges of Phase II) with probability γ and belongs to G II.B otherwise (i.e., with probability (1 − γ )). As already discussed at the start of Section 4, thisfollows from the fact that we do not ﬁx the size of Phase II.A in Algorithm 2 but rather choose itfrom distribution B ((1 − ε ) m, γ ). Having this independence, we can prove (9) as follows: Proof of Inequality (9) . Take an augmenting path P ∈ P . Since P includes augmenting pathsof length up to ﬁve, | P | ∈ { , , } . We prove (9) for all three cases one by one.First, consider the case where P is of length ﬁve and let P = (cid:104) e , e , e , e , e (cid:105) . By Deﬁnition 4.4, P is lucky if e , e ∈ G II.A and e ∈ G II.B . The former two events happen with probability γ each and the latter happens with probability (1 − γ ). Since the three events, as discussed, areindependent, we have Pr [ P ∈ P L ] = γ (1 − γ ) ∀ P = (cid:104) e , e , e , e , e (cid:105) ∈ P . For length-three paths, only the two endpoints should appear in Phase II.A, hence Pr [ P ∈ P L ] = γ ≥ γ (1 − γ ) ∀ P = (cid:104) e , e , e (cid:105) ∈ P . Pr [ P ∈ P L ] = (1 − γ ) ≥ γ (1 − γ ) ∀ P = (cid:104) e (cid:105) ∈ P . The combination of these cases completes the proof of inequality (9).

Proof of Lemma 4.5 via inequality (9) . By linearity of expectation, we have E |P L | = (cid:88) P ∈P Pr [ P ∈ P L ] (9) ≥ (cid:88) P ∈P γ (1 − γ ) = γ (1 − γ ) |P| . We are now ready to prove Lemma 4.5 via a simple Chernoﬀ bound.

Proof of Lemma 4.5.

Whether or not an augmenting path P ∈ P turns out to be lucky depends onhow its odd edges belong to G II.A and G II.B . Since all the augmenting paths in P are by deﬁnitionvertex-disjoint, and since as discussed edges of G ≥ εm belong to G II.A and G II.B independently fromeach other, we get that the paths in P belong to P L independently from each other. By a simpleChernoﬀ bound (Proposition 2.1), letting δ = (cid:113)

15 ln n E |P L | >

0, we have Pr (cid:16) |P L | ≤ (1 − δ ) E |P L | = E |P L |− (cid:112) E |P L | ln n (cid:17) ≤ (cid:18) − δ · E |P L | (cid:19) ≤ − n ) = 2 n − . Since E |P L | ≥ γ (1 − γ ) |P| by Claim 4.17 and E |P L | ≤ |P| ≤ µ ( G ) this implies that Pr (cid:16) |P L | ≤ γ (1 − γ ) |P| − (cid:112) µ ( G ) ln n (cid:17) ≤ n − . We also prove a lower bound on the approximation ratio of semi-streaming algorithms for bipartitematching on random-order streams.

Theorem 3.

There is a parameter ε = Θ( / log n ) such that the following is true. Any streamingalgorithm that outputs a (1 − ε ) -approximation for maximum bipartite matching, in expectationor with constant probability, given one pass over a stream of edges of the input graph in a randomorder requires n / log log n ) space. Theorem 3 provides the ﬁrst non-trivial lower bound for approximating matching in random-order streams. Prior to our work, only a lower bound of Ω( n ) space was known for ﬁnding an exact maximum matching [CCM08].A direct corollary of this result is then the following. Corollary 5.1.

There is no semi-streaming algorithm for maximum bipartite matching that forevery ε > , achieves a (1 − ε ) -approximation in O (exp((1 /ε ) . ) · n · poly log ( n )) space. The rest of this section is dedicated to the proof of Theorem 3. The proof of this theorem isbased on a new lower bound for (robust) one-way communication complexity of matching that weprove in this paper. In the following, we ﬁrst provide the necessary background and preliminariesand then present the lower bound proof. 22 .1 Preliminaries for the Lower Bound

Ruzsa-Szemer´edi graphs.

For any graph G , a matching M of G is an induced matching iﬀ forany two vertices u and v that are matched in M , if u and v are not matched to each other, thenthere is no edge between u and v in G . Deﬁnition 5.2 (Ruzsa-Szemer´edi graph [RS78]) . A graph G is an ( r, t ) - Ruzsa-Szemer´edi (RS)graph iﬀ its edges consists of t pairwise disjoint induced matchings M , . . . , M t , each of size r . RS graphs, ﬁrst introduced by Ruzsa and Szemer´edi [RS78], have been extensively studied asthey arise naturally in property testing, PCP constructions, additive combinatorics, streaming lowerbounds, etc. (see, e.g., [TV06, HW03, FLN +

02, BLM93, AMS12, GKK12, Alo02, AS06, FHS17]).

Communication model.

We work in the standard two-party communication model of Yao [Yao79]and in particular in the one-way model (see the excellent textbook by Kushilevitz and Nisan [KN97]for the standard deﬁnitions). The only slight derivation is that we focus on randomly partitionedinputs, wherein the input graph is still chosen adversarially, but every edge in the graph is sent toone of the players chosen independently and uniformly at random. To our knowledge, this modelwas ﬁrst introduced by [CCM08]. We note that the main resource of interest in this model is the communication and in particular the players are assumed to be computationally unbounded.In the communication problem we study for bipartite matching, we have an n -vertex bipartitegraph G = ( L, R, E ) whose edges are partitioned randomly into E A and E B given to Alice andBob, respectively (both players know L and R ). The goal is to compute an approximate maximummatching of G by Alice sending a single message to Bob and Bob outputting the solution. Thegoal is to understand the communication-approximation tradeoﬀ for the problem.We note that lower bounds on communication complexity in this model immediately implyspace lower bounds for streaming algorithm in random-order streams; see ,e.g. [CCM08]. Starting from [GKK12], all known super-linear-in- n communication lower bounds for approximat-ing the maximum matching problem [GKK12, Kap13, Kon15, AKLY16, AKL17, Kap21] are via con-structions based on Ruzsa-Szemer´edi (RS) graphs (Deﬁnition 5.2) . Our work in this paper is noexception (see [GKK12] for a formal reason why RS graphs are necessary for any lower bound inthe one-way model). However, our key novelty is a way of making these constructions “robust” sothat they can be used even under the random partitioning of the input.In more details, the lower bound of [GKK12] gives Alice an RS graph with induced matchingsof size Θ( n ) each, and gives Bob an “outside” matching that matches all vertices of this RS graph,except for one of the induced matchings unknown to Alice; this construction is such that anybetter-than-(2 / i ) theRS graph edges are now partitioned between both players, and ( ii ) Alice receives a random subsetof edges in the outside matching. The ﬁrst challenge is not that problematic as Alice still receiveshalf the edges of the RS graph in expectation. But the second challenge is more serious as revealingeven a small fraction of edges in the outside matching is enough to identify the special inducedmatching to Alice, hence, enabling her to focus on sending those edges, breaking the lower bound. The only exception is the very recent work of [DK20] in a communication model that allows for edge deletions.

23n order to circumvent this challenge, we replace edges of this outside matching with a newgadget based on the XOR function. We then show that if Alice misses at least one edge from everyone of the XOR-gadgets during the random partitioning of the input, the identity of the specialinduced matching of the RS graph remains hidden to her. By picking these gadgets appropriately,we ensure that this event happens with a large probability and use this in careful information-theoretic argument (instead of the combinatorial arguments in [GKK12]) to conclude the proof.

We introduce the following gadget as a key component of our lower bound construction.

Deﬁnition 5.3 ( XOR-Gadget ) . Let k > be an odd integer and ( x , . . . , x k ) be a k -tuple of bits.We deﬁne the XOR-gadget of ( x , . . . , x k ) as the following graph G xor ( x , . . . , x k ) : • There are k vertices { s, a , b , a , b , . . . , a k − , b k − , t } in G xor . We call s the start vertex and t the ﬁnal vertex . • There are k − edges in G xor deﬁned as follows using the bits x , . . . , x k : – s is connected to a if x = 0 and otherwise is connected to b . Similarly, t is connectedto a k − if x k = 0 and otherwise is connected to b k − . – For any i ∈ { , . . . , k − } , a i − , b i − are connected to a i , b i , respectively, if x i = 0 andto b i , a i otherwise.We use E xor ( x i ) to denote the set of two edges in the gadget that depend on the bit x i . Figure 3 gives an illustration of XOR-gadgets. s a b a b a b a b a b a b tx = 0 x = 0 x = 1 x = 0 x = 0 x = 1 x = 0 (a) An example when k = 7, ( x , . . . , x ) = (0 , , , , , ,

0) and so x ⊕ x ⊕ · · · ⊕ x = 0. s a b a b a b a b a b a b tx = 1 x = 0 x = 1 x = 1 x = 1 x = 1 x = 0 (b) An example when k = 7, ( x , . . . , x ) = (1 , , , , , ,

0) and so x ⊕ x ⊕ · · · ⊕ x = 1. Figure 3:

Solid edges show a maximum matching of the gadget and dashed edges are the remaining edges.

The following two lemmas capture the main properties of XOR-gadgets for our purpose. Theﬁrst lemma speciﬁes the connection of XOR-gadgets to the maximum matching problem.

Lemma 5.4.

Let k > be an odd integer and G xor ( x , . . . , x k ) be some XOR-gadget: ( i ) if x ⊕ · · · ⊕ x k = 0 , then there is a unique maximum matching in G xor with size k and thismatching necessarily matches t ; ii ) if x ⊕· · ·⊕ x k = 1 , then the maximum matching size of G xor is k − , and there is a maximummatching in G xor that does not match t .Proof. For this proof, it helps to refer to Figure 3 as a reference point.Consider the unique path P starting from s in G xor . Each bit x i = 1 changes the “parity” ofthe path from an a -vertex to a b -vertex ( s and t are considered a -vertices for the purpose of thisdiscussion) and each x i = 0 keeps this parity the same. As a result:( i ) if x ⊕ · · · ⊕ x k = 0, then P ends in t and thus G xor consists of an odd-length path of length k from s to t and another odd-length path of length k −

2. The unique maximum matching of sucha graph matches both s and t and has size (cid:100) k/ (cid:101) + (cid:100) ( k − / (cid:101) = k .( ii ) if x ⊕ · · · ⊕ x k = 1, then P does not end in t and thus G xor consists of two even-lengthpaths with k + 1 edges each. Each such path leaves out one of its vertices unmatched necessarilyand thus this graph has a maximum matching of size k − t .This second lemma speciﬁes the “hiding” properties these XOR-gadgets. Lemma 5.5.

Let G xor ( x , . . . , x k ) be a random XOR-gadget obtained by picking each bit x i in-dependently and uniformly at random. Suppose we partition the edges of G xor between Alice andBob such that for at least one bit x i , Alice has not received neither of the edges in E xor ( x i ) . Then,distribution of x ⊕ · · · ⊕ x k is still uniform over { , } even given Alice’s edges.Proof. Follows immediately from the fact that switching any single bit in the XOR function, re-gardless of any ﬁxed setting of the other bits, switches the value of the function.

We now describe our distribution of input graphs. For the remainder of the proof, we will use thefollowing parameters (all parameters are deﬁned with respect to some integer N ): r := N/ , t := N / log log N ) , k := 2 · (cid:100) log (3 / N (cid:101) + 1 . (10)Let G rs be a bipartite ( r, t )-RS graph with N vertices on each side of the bipartition and inducedmatchings M rs , . . . , M rs t (this graph itself is known to both players). The existence of such RSgraph is guaranteed by the results of [FLN +

02] (see also [GKK12]). The hard distribution of theinputs is as follows; see Figure 4 for an illustration.

A hard distribution G of graphs.

1. Pick j (cid:63) ∈ [ t ] uniformly at random and let M rs j (cid:63) be the special induced matching of G rs .2. For any vertex v ∈ G rs , let y v = 1 if v ∈ V ( M rs j (cid:63) ) and y v = 0 otherwise; sample a k -tuple( x v, , . . . , x v,k ) independently and uniformly at random conditioned on x v, ⊕· · ·⊕ x v,k = y v .3. For any v ∈ G rs , construct a vertex-disjoint XOR-gadget G xor v ( x v, , . . . , x v,k ) such that theﬁnal vertex of G xor v is the same as the vertex v .4. For any edge e ∈ G rs , drop e from the graph independently and with probability half. Let G be the resulting graph. 25he distribution G speciﬁes the input graph G . The input to players is then determined by thedistribution P that sends each edge to one of the players chosen uniformly at random. (a) A graph G sampled from G (b) A maximum matching in G Figure 4:

An illustration of the distribution G of input graphs and their maximum matchings. The middlegraph is the “base” RS graph and each box connected to vertices of this RS graph denotes an XOR-gadget. The following lemma speciﬁes the key role of the special induced matching in this distribution.

Lemma 5.6.

For a graph G ∼ G : ( i ) E [ µ ( G )] ≥ ( N − r ) · k + 2 r · ( k −

1) + r/ ; ( ii ) µ ( G \ M rs j (cid:63) ) ≤ ( N − r ) · k + 2 r · ( k − with probability one;Proof. For this proof, it helps to refer to Figure 4 as a reference point.Let us consider the graph G \ M rs j (cid:63) ﬁrst. By Lemma 5.4, for every v ∈ G rs \ V ( M rs j (cid:63) ), G xor v ( x v, , . . . , x v,k ) has a matching of size k since x v, ⊕ . . . ⊕ x v,k = y v = 0 in this case. Theremaining XOR-gadgets also have a matching of size k − V ( M rs j (cid:63) )is matched in them (by part ( ii ) of Lemma 5.4). Considering these matchings are vertex-disjointwe have a matching M of size 2 · ( N − r ) · k + 2 r · ( k −

1) in G \ M rs j (cid:63) with probability one that doesnot match any vertex of M rs j (cid:63) . As a result:( i ) In G , there is a matching consisting of M plus all edges of M rs j (cid:63) present in G . As each of theedges of M rs j (cid:63) (with size r ) is dropped w.p. half, we get the ﬁrst part of the lemma.( ii ) In G \ M rs j (cid:63) , the matching M is already a maximum matching. This is because, by the part( i ) of Lemma 5.4, the unique maximum matching of each XOR-gadget G xor v for v ∈ G rs \ V ( M rs j (cid:63) )necessarily matches v ; hence, if we instead match v to some vertex in G rs , there will be oneunmatched vertex in G xor v and thus size of the matching does not change. As a result, the onlyvertices that can be matched inside G rs are V ( M rs j (cid:63) ) but since M rs j (cid:63) consists of all edges betweenthem (as M rs j (cid:63) is an induced matching), there is no edge left for these vertices in G \ M rs j (cid:63) . Auxiliary Random Variables and Input of Players

Let us now specify the random variables used in the distributions G and P explicitly: • J and Y := { Y v } for all v ∈ G rs : the index j (cid:63) of the special matching M rs j (cid:63) and the correspondingrandom bits y v for XOR-gadgets. Notice that J and Y uniquely identify each other.26 X := { X v := ( X v , . . . , X vk ) } for all v ∈ G rs : the bits in XOR-gadgets of each vertex v of G rs . • Z := { Z e } for all e ∈ G rs : Z e = 1 for any e ∈ G rs that was chosen in G and Z e = 0 otherwise. • P xor := { P e } for all e ∈ G xor v among all v ∈ G rs : P e = 1 for any e ∈ G xor v that was sent toAlice as part of input under the random partitioning P and P e = 0 otherwise. • P rs := { P e } for all e ∈ G rs : P e = 1 for any e ∈ G rs that was sent to Alice as part of inputunder the random partitioning P and P e = 0 otherwise. Note that for technical reasons thatwill become evident shortly, we have deﬁned P rs as partitioning all edges of G rs and not onlythe ones with Z e = 1 that actually belong to the input graph.Additionally, we have the following deﬁnitions: • X A and X B : we say that a bit X vi is represented in Alice’s (resp. Bob’s) input iﬀ at least oneof the edges E xor ( X vi ) is given to Alice (resp. Bob) by P xor in partitioning of inputs (noticethat X vi might be represented in both players inputs); we use X A and X B to denote the bitsrepresented in Alice’s and Bob’s inputs, respectively. • E rs A and E rs B : we say that e ∈ G rs is represented in Alice’s (resp. Bob’s) input iﬀ P e = 1(resp. P e = 0), i.e., the partitioning P rs assigns e to Alice (resp. Bob); we use E rs A and E rs B todenote the edges represented in Alice’s and Bob’s inputs, respectively (again, notice that bythe deﬁnition of P rs for all e ∈ G rs , some edges are represented by Alice or Bob, but they maynot belong to the graph G to begin with). • M rs A ( j ) and M rs B ( j ) for all j ∈ [ t ]: we deﬁne M rs A ( j ) and M rs B ( j ) analogously to E rs A and E rs B restricted to edges in M rs j ; so E rs A = ( M rs A (1) , . . . , M rs A ( t )) and E rs B = ( E rs B (1) , . . . , E rs B ( t )). • Z A and Z B : we deﬁne Z A := { Z e } for e ∈ E rs A and Z B := { Z e } for e ∈ E rs B , that is the Z -valuesfor edges represented in Alice’s and Bob’s inputs respectively. Similarly, for any j ∈ [ t ], wedeﬁne Z A ( j ) and Z B ( j ) analogously to Z A and Z B restricted to edges in M rs A ( j ) and M rs B ( j ).We can now specify the input of Alice by the tuple A := ( P rs , Z A , P xor , X A ) and input of Bobby B := ( P rs , Z B , P xor , X B , J ). We note that these tuples are more general than the actual input ofplayers. In particular, P rs speciﬁes the partitioning of edges that may not even be part of the inputand Bob is explicitly given index J ; however, adding these more general inputs can only make ourlower bounds stronger as Alice and Bob can always ignore this extra information. Hiding property of XOR-gadgets.

The key role of XOR-gadgets in our construction is thatthey “hide” the identity of the special induced matching M rs j (cid:63) from Alice ; we formalize this asfollows. Deﬁne the following event: • Event E hide : for all v ∈ G rs , at least one of ( x v , . . . , x vk ) is not represented in Alice’s input.We note E hide is a deterministic function of the random variable P xor . We have, Lemma 5.7.

Suppose event E hide happens. Then, even conditioned on the input A of Alice, J isstill chosen uniformly at random from [ t ] .Proof. The only input of Alice which is, in principle, correlated with J is X A ; in general, Y and J uniquely identify each other and X A is used to determine Y , namely, with a slight abuse of notation Y = X A ⊕ X B . However, considering by E hide , X A “misses” at least one bit for every XOR-gadget,by Lemma 5.5, any choice of Y -value (even the correlated ones obtained by picking J ) are equallylikely conditioned on X A , proving the lemma. 27inally, an easy calculation shows that E hide happens with high probability. Claim 5.8. Pr ( E hide ) ≥ − o (1) .Proof. Fix any vertex v ∈ G rs . Any bit x vi is represented in Alice’s input if at least one of thetwo edges in E xor ( x vi ) is sent to Alice under P which happens with probability 3 /

4. As such,the probability that x v is represented in Alice’s input is only (3 / k ≤ / N by the choice of k in Eq (10). A union bound on all 2 N vertices in G rs ﬁnalizes the proof. To start the analysis, we need to setup some notation.

Notation.

Throughout this section, we ﬁx a deterministic protocol π over G and random parti-tioning P with communication cost o ( r · t ). We further let δ denote the probability that π outputsan edge that does not belong to G (and thus errs). We use Π to denote the random variable forthe message Π sent by Alice to Bob in π . We also use M π to denote the random variable for thematching output by the protocol π . Considering the input of Bob is B and he additionally receivesthe message Π from Alice, M π is a deterministic function of ( B , Π ). In the following, H ( · ) and I ( · ; · )denote the Shannon entropy and mutual information ; see Appendix A for more details.We ﬁrst bound the size of M π based on the information revealed by Π to Bob about edges ofthe special matching M rs j (cid:63) that are present in Alice’s input, i.e., Z A ( J ). (In the following, H is thebinary entropy function, i.e., H ( δ ) := H ( B ( δ )) where B ( δ ) is a mean- δ Bernoulli random variable.)

Lemma 5.9. E | M π | ≤ ( N − r ) · k + 2 r · ( k −

1) + r/ − H ( δ )) − · I ( Z A ( J ) ; Π | B ) . Proof.

By Lemma 5.6, M π can only have ( N − r ) · k + 2 r · ( k −

1) edges outside of M rs J . Hence,to prove the lemma, it suﬃces to bound E | M π ∩ M rs J | . By deﬁnition, E | M π ∩ M rs J | = E | M π ∩ M rs A ( J ) | + E | M π ∩ M rs B ( J ) | ≤ E | M π ∩ M rs A ( J ) | + r/ E | M rs B ( J ) | = r/ r edges goes to Bob w.p. half) and among these,again, in expectation half of them belong to G , i.e., have Z -value 1 (note that we can assumewithout loss of generality that Bob never outputs an edge e ∈ M rs B ( J ) with Z e = 0 as this edge isnot part of input and thus makes the output wrong; moreover, unlike edges in Alice’s input, hereBob directly knows Z e and can simply remove all edges with Z e = 0 from M rs B ( J )).To ﬁnalize the proof, we need to show E | M π ∩ M rs A ( J ) | ≤ (1 − H ( δ )) − · I ( Z A ( J ) ; Π | B ) . (11)Let us condition on any choice of P rs = P and J = j in B = ( P rs , Z B , P xor , X B , J ). This ﬁxes M rs A ( J ) to some matching M A ( j ) ⊆ M rs j , but { Z e } for e ∈ M A ( j ) are still uniformly distributedas Z ⊥ P rs , J . Fix any edge e ∈ M A ( j ). For Bob to be able to output e as part of M π , theentropy of Z e should be suﬃciently small conditioned on ( B , Π ); otherwise, Bob is likely to outputan edge that does not belong to the graph and thus errs. Formally, for any e ∈ M π ∩ M A ( j ), Pr ( Z e = 0 | Π , B ) ≤ δ which implies that, H ( Z e | Π , B ) ≤ H ( δ ) , (12)We are going to use this to bound the information revealed about Z A ( J ) by Alice’s message.Let L := L ( P, j ) denote the set of “low entropy” edges in M A ( j ), i.e., all edges e ∈ M A ( j ) that28atisfy Eq (12) conditioned on P rs = P and J = j . As discussed, E | M π ∩ M rs A ( J ) | ≤ E P,j | L ( P, j ) | . (13)We now bound the RHS above as follows. By the deﬁnition of B = ( P rs , Z B , P xor , X B , J ), I ( Z A ( J ) ; Π | B ) = E P,j [ I ( Z A ( j ) ; Π | P rs = P, Z B , P xor , X B , J = j )]= E P,j (cid:104) H ( Z A ( j ) | P rs = P, Z B , P xor , X B , J = j ) − H ( Z A ( j ) | Π , P rs = P, Z B , P xor , X B , J = j ) (cid:105) = E P,j (cid:104) | M A ( j ) | − H ( Z A ( j ) | Π , P rs = P, Z B , P xor , X B , J = j ) (cid:105) (as { Z e } for e ∈ M A ( j ) is uniformly distributed conditioned on the remaining variables) ≥ E P,j (cid:104) | M A ( j ) | − (cid:88) e ∈ M A ( j ) H ( Z e | Π , P rs = P, Z B , P xor , X B , J = j ) (cid:105) (by the sub-additivity of entropy) ≥ E P,j (cid:104) | M A ( j ) | − ( | M A ( j ) | − | L ( P, j ) | + (cid:88) e ∈ L ( P,j ) H ( Z e | Π , P rs = P, Z B , P xor , X B , J = j )) (cid:105) (by upper bounding the entropy of the terms not in L ( P, j ) by one) ≥ E P,j (cid:104) | M A ( j ) | − ( | M A ( j ) | − | L ( P, j ) | + (cid:88) e ∈ L ( P,j ) H ( δ )) (cid:105) (by the deﬁnition of L ( P, j ) based on Eq (12))= (1 − H ( δ )) · E P,j | L ( P, j ) | . Plugging in this bound in Eq (12) ﬁnalizes the proof.The main part of the proof is to bound the mutual information term in the RHS of Lemma 5.9,i.e., show that a low communication protocol cannot reveal much information about Z ( J ) evenconditioned on all the inputs of Bob. Lemma 5.10. I ( Z A ( J ) ; Π | B ) = o ( r ) .Proof. Recall that B = ( P rs , Z B , P xor , X B , J ) and that any choice P for P xor , determines whetheror not the event E hide happens when for P xor = P . As such, I ( Z A ( J ) ; Π | B ) = E P [ I ( Z A ( J ) ; Π | P rs , Z B , P xor = P, X B , J )] ≤ E P |E hide [ I ( Z A ( J ) ; Π | P rs , Z B , P xor = P, X B , J )] + (1 − Pr ( E hide )) · r (as this mutual information term can be at most r )= E P |E hide [ I ( Z A ( J ) ; Π | P rs , Z B , P xor = P, X B , J )] + o ( r ) , (14)where the ﬁnal step is by Claim 5.8.We now focus only on the cases when E hide happens in the RHS above. Choosing a value P (cid:48) for P rs determines M rs A (1) , . . . , M rs A ( t ) and the partitioning of Z into Z A and Z B . Thus, we have,First term in the RHS of (14) = E P,P (cid:48) |E hide (cid:2) I ( Z A ( J ) ; Π | P rs = P (cid:48) , Z B , P xor = P, X B , J ) (cid:3) E P,P (cid:48) |E hide (cid:2) I ( Z A ( J ) ; Π | P rs = P (cid:48) , P xor = P, X B , J ) (cid:3) ; (15)this is because, the input of Alice conditioned on B is determined only by Z A and X A and boththese variables are independent of Z B , which implies, Π ⊥ Z B | Z A ( J ) , P rs = P (cid:48) , Z B , P xor = P, X B , J and thus we can apply Proposition A.3 to remove conditioning on Z B .Our goal now is to also remove the conditioning on X B . However, this is not as direct as theprevious step as ( X B , J ) together are correlated with the input of Alice (in particular, X A ) and wecannot use the previous argument. Instead, we are going to show that we can in fact “switch” X B with X A in the conditioning above without decreasing the RHS. We claim that, for any P, P (cid:48) , I ( Z A ( J ) ; Π | P rs = P (cid:48) , P xor = P, X B , J ) ≤ I ( Z A ( J ) ; Π | P rs = P (cid:48) , P xor = P, X B , J , X A );this is because Z A ( J ) ⊥ X A | P rs = P (cid:48) , P xor = P, X B , J as Z -values and X -values are chosenindependently (and none of the conditions correlate them); thus we can apply Proposition A.2. Wecan now remove X B from the conditioning: I ( Z A ( J ) ; Π | P rs = P (cid:48) , P xor = P, X B , J , X A ) ≤ I ( Z A ( J ) ; Π | P rs = P (cid:48) , P xor = P, J , X A );this is because Π ⊥ X B | Z A ( J ) , P rs = P (cid:48) , P xor = P, J , X A as Π is only a function of X A and Z A after the conditioning, and in particular is independent of X B ; thus we can apply Proposition A.3.By plugging in these bounds in the RHS of Eq (15), we obtain that,RHS of (15) ≤ E P,P (cid:48) |E hide (cid:2) I ( Z A ( J ) ; Π | P rs = P (cid:48) , P xor = P, J , X A ) (cid:3) = E P,P (cid:48) |E hide  t (cid:88) j =1 Pr (cid:0) J = j | P, P (cid:48) (cid:1) · I ( Z A ( j ) ; Π | P rs = P (cid:48) , P xor = P, J = j, X A )  = 1 t · E P,P (cid:48) |E hide  t (cid:88) j =1 I ( Z A ( j ) ; Π | P rs = P (cid:48) , P xor = P, J = j, X A )  (as J ⊥ P rs , P xor and is uniform over [ t ])= 1 t · E P,P (cid:48) |E hide  t (cid:88) j =1 I ( Z A ( j ) ; Π | P rs = P (cid:48) , P xor = P, X A )  ;in the last step, we can drop the conditioning on the event J = j as the joint distribution ofthe remaining variables ( Z A ( j ) , Π , X A ) is independent of J : this is because all these variables onlydepend on the input of Alice, while conditioned on the event E hide , by Lemma 5.7, the input ofAlice is independent of J .We can continue the above calculations as follows:RHS of (15) ≤ t · E P,P (cid:48) |E hide  t (cid:88) j =1 I ( Z A ( j ) ; Π | P rs = P (cid:48) , P xor = P, X A )  ≤ t · E P,P (cid:48) |E hide  t (cid:88) j =1 I ( Z A ( j ) ; Π | P rs = P (cid:48) , P xor = P, X A , Z A ([1 : j −  (by Proposition A.2 as Z A ( j ) ⊥ Z A ([1 : j − | P rs = P (cid:48) , P xor = P, X A )30 1 t · E P,P (cid:48) |E hide (cid:2) I ( Z A ; Π | P rs = P (cid:48) , P xor = P, X A ) (cid:3) (by the chain rule of mutual information (Fact A.1-(5)))= 1 t · [ I ( Z A ; Π | P rs , P xor , X A )] ≤ t · H ( Π ) ≤ o ( r ) . (as H ( Π ) = o ( r · t ) since π only communicates o ( r · t ) many bits and by Fact A.1-(1))Plugging in this bound in Eq (15) and then in turn in Eq (14) ﬁnalizes the proof.Suppose the error probability of the protocol π , i.e., δ , is some constant bounded away fromzero. Then, Lemmas 5.9 and 5.10, together with the fact that H ( δ ) <

1, imply the following upperbound on the size of the matching output by Bob: E | M π | ≤ ( N − r ) · k + 2 r · ( k −

1) + r/ o ( r ) = E [ µ ( G )] − r/ o ( r ) , where the equality is by part ( i ) of Lemma 5.6. On the other hand, since E [ µ ( G )] ≤ (2 k + 1) · N (as number of vertices is twice this quantity), we have that E | M π | E [ µ ( G )] ≤ − r/ o ( r )(2 k + 1) · N := 1 − ε , for some ε = Θ( / log N ) (as r = N/ k = Θ(log N )).Finally, note that the number of vertices in the graph is n = (2 k + 1) · N = Θ( N · log N ). Assuch, we obtain that any deterministic protocol with communication cost o ( n / log log n ) ) which is o ( r · t ) = o ( N / log log N ) ) (for an appropriate choice of constant in the exponent), cannot outputa (1 − ε )-approximation to maximum matching in expectation (even if it is allowed to err withconstant probability by outputting an edge not in the graph). Moreover, we can immediately extendthis result to randomized protocols using the “easy direction” of Yao’s minimax principle (i.e., anaveraging argument).This concludes the proof of Theorem 3 by the connection between communication complexityand streaming lower bounds. Remark 5.11.

For the simplicity of exposition, we compared the expected size of the matchingof the protocol vs a maximum matching of the input. However, a simple application of Markovbound also imply the same result for protocols that output a (1 − ε ) -approximate matching withany constant probability of success.Basically, the lower bound for µ ( G ) in Lemma 5.6 is highly concentrated (a simple Chernoﬀbound on edges of the special matching that belong to the graph). Also, the only term in our upperbound of M π in Lemma 5.9 which is not necessarily concentrated is the mutual information termwhich is only o ( r ) by Lemma 5.10; hence, by Markov bound, | M π | ≤ µ ( G ) − r/ o ( r ) with probability − o (1) and not only in expectation. Acknowledgements

We thank Aaron Bernstein for helpful conversations on the random-order streaming matchingproblem and several insightful comments that helped us in improving the presentation of the paper.

References [AB19] Sepehr Assadi and Aaron Bernstein. Towards a uniﬁed theory of sparsiﬁcation formatching problems. In , pages 11:1–11:20, 2019. 2, 5, 1131ABB +

19] Sepehr Assadi, MohammadHossein Bateni, Aaron Bernstein, Vahab S. Mirrokni, andCliﬀ Stein. Coresets meet EDCS: algorithms for matching and vertex cover on massivegraphs. In

Proceedings of the Thirtieth Annual ACM-SIAM Symposium on DiscreteAlgorithms, SODA 2019, San Diego, California, USA, January 6-9, 2019 , pages 1616–1635, 2019. 1, 2[AKL17] Sepehr Assadi, Sanjeev Khanna, and Yang Li. On estimating maximum matching size ingraph streams. In

Proceedings of the Twenty-Eighth Annual ACM-SIAM Symposium onDiscrete Algorithms, SODA 2017, Barcelona, Spain, Hotel Porta Fira, January 16-19 ,pages 1723–1742, 2017. 23[AKLY16] Sepehr Assadi, Sanjeev Khanna, Yang Li, and Grigory Yaroslavtsev. Maximum match-ings in dynamic graph streams and the simultaneous communication model. In

Pro-ceedings of the Twenty-Seventh Annual ACM-SIAM Symposium on Discrete Algorithms,SODA 2016, Arlington, VA, USA, January 10-12, 2016 , pages 1345–1364, 2016. 23[Alo02] Noga Alon. Testing subgraphs in large graphs.

Random Struct. Algorithms , 21(3-4):359–370, 2002. 23[AMS12] Noga Alon, Ankur Moitra, and Benny Sudakov. Nearly complete graphs decomposableinto large induced matchings and their applications. In

Proceedings of the 44th Sympo-sium on Theory of Computing Conference, STOC 2012, New York, NY, USA, May 19- 22, 2012 , pages 1079–1090, 2012. 23[AS04] Noga Alon and Joel H Spencer.

The probabilistic method . John Wiley & Sons,2004. 4[AS06] Noga Alon and Asaf Shapira. A characterization of easily testable induced subgraphs.

Combinatorics, Probability & Computing , 15(6):791–805, 2006. 23[Ber62] Claude Berge.

The theory of graphs . Courier Corporation, 1962. 4[Ber20] Aaron Bernstein. Improved bounds for matching in random-order streams. In , pages 12:1–12:13, 2020.1, 2, 3, 4, 5, 15[BLM93] Yitzhak Birk, Nathan Linial, and Roy Meshulam. On the uniform-traﬃc capacity ofsingle-hop interconnections employing shared directional multichannels.

IEEE Trans-actions on Information Theory , 39(1):186–191, 1993. 23[BS15] Aaron Bernstein and Cliﬀ Stein. Fully dynamic matching in bipartite graphs. In

Au-tomata, Languages, and Programming - 42nd International Colloquium, ICALP 2015,July 6-10, 2015, Proceedings, Part I , pages 167–179, 2015. 1, 2, 7[BS16] Aaron Bernstein and Cliﬀ Stein. Faster fully dynamic matchings with small approxi-mation ratios. In

Proceedings of the Twenty-Seventh Annual ACM-SIAM Symposiumon Discrete Algorithms, SODA 2016, January 10-12, 2016 , pages 692–711, 2016. 1, 2[CCM08] Amit Chakrabarti, Graham Cormode, and Andrew McGregor. Robust lower boundsfor communication and stream computation. In

Proceedings of the 40th Annual ACMSymposium on Theory of Computing, May 17-20, 2008 , pages 641–650, 2008. 22, 2332CT06] Thomas M. Cover and Joy A. Thomas.

Elements of information theory (2. ed.) . Wiley,2006. 35[DK20] Jacques Dark and Christian Konrad. Optimal lower bounds for matching and vertexcover in dynamic graph streams.

CoRR , abs/2005.11116. To appear in CCC 2020, 2020.23[FHM +

20] Alireza Farhadi, Mohammad Taghi Hajiaghayi, Tung Mai, Anup Rao, and Ryan A.Rossi. Approximate maximum matching in random streams. In

Proceedings of the2020 ACM-SIAM Symposium on Discrete Algorithms, SODA 2020, Salt Lake City,UT, USA, January 5-8, 2020 , pages 1773–1785, 2020. 1, 2[FHS17] Jacob Fox, Hao Huang, and Benny Sudakov. On graphs decomposable into inducedmatchings of linear sizes.

Bulletin of the London Mathematical Society , 49(1):45–57,2017. 23[FKM +

05] Joan Feigenbaum, Sampath Kannan, Andrew McGregor, Siddharth Suri, and JianZhang. On graph problems in a semi-streaming model.

Theor. Comput. Sci. , 348(2-3):207–216, 2005. 1[FLN +

02] Eldar Fischer, Eric Lehman, Ilan Newman, Sofya Raskhodnikova, Ronitt Rubinfeld, andAlex Samorodnitsky. Monotonicity testing over general poset domains. In

Proceedingson 34th Annual ACM Symposium on Theory of Computing, May 19-21, 2002, Montr´eal,Qu´ebec, Canada , pages 474–483, 2002. 23, 25[GKK12] Ashish Goel, Michael Kapralov, and Sanjeev Khanna. On the communication andstreaming complexity of maximum bipartite matching. In

Proceedings of the Twenty-third Annual ACM-SIAM Symposium on Discrete Algorithms , SODA ’12, pages 468–485. SIAM, 2012. 1, 2, 3, 23, 24, 25[GKMS19] Buddhima Gamlath, Sagar Kale, Slobodan Mitrovic, and Ola Svensson. Weightedmatchings via unweighted augmentations. In

Proceedings of the 2019 ACM Symposiumon Principles of Distributed Computing, PODC 2019, Toronto, ON, Canada, July 29 -August 2, 2019 , pages 491–500, 2019. 1, 2[Hal35] Philip Hall. On representatives of subsets.

Journal of the London Mathematical Society ,1(1):26–30, 1935. 4[HW03] Johan H˚astad and Avi Wigderson. Simple analysis of graph tests for linearity and PCP.

Random Struct. Algorithms , 22(2):139–160, 2003. 23[Kap13] Michael Kapralov. Better bounds for matchings in the streaming model. In

Proceedingsof the Twenty-Fourth Annual ACM-SIAM Symposium on Discrete Algorithms, SODA2013, New Orleans, Louisiana, USA, January 6-8, 2013 , pages 1679–1697, 2013. 1, 23[Kap21] Michael Kapralov. Space lower bounds for approximating maximum matching in theedge arrival model. In

Proceedings of the Annual ACM-SIAM Symposium on DiscreteAlgorithms, SODA 2021 , 2021. 1, 23[KMM12] Christian Konrad, Fr´ed´eric Magniez, and Claire Mathieu. Maximum matching in semi-streaming with few passes. In

Approximation, Randomization, and Combinatorial Op-timization. Algorithms and Techniques - 15th International Workshop, APPROX 2012, nd 16th International Workshop, RANDOM 2012, Cambridge, MA, USA, August 15-17, 2012. Proceedings , pages 231–242, 2012. 1, 2[KN97] Eyal Kushilevitz and Noam Nisan. Communication complexity . Cambridge UniversityPress, 1997. 23[Kon15] Christian Konrad. Maximum matching in turnstile streams. In

Algorithms - ESA 2015 -23rd Annual European Symposium, September 14-16, 2015, Proceedings , pages 840–852,2015. 23[Kon18] Christian Konrad. A simple augmentation method for matchings with applications tostreaming algorithms. In , pages 74:1–74:16, 2018. 1, 2[RS78] Imre Z Ruzsa and Endre Szemer´edi. Triple systems with no six points carrying threetriangles.

Combinatorics (Keszthely, 1976), Coll. Math. Soc. J. Bolyai , 18:939–945,1978. 23[Tut47] William T Tutte. The factorization of linear graphs.

Journal of the London Mathemat-ical Society , 1(2):107–111, 1947. 4[TV06] Terence Tao and Van H Vu.

Additive combinatorics , volume 105. Cambridge UniversityPress, 2006. 23[Yao79] Andrew Chi-Chih Yao. Some complexity questions related to distributive computing(preliminary report). In

Proceedings of the 11h Annual ACM Symposium on Theory ofComputing, April 30 - May 2, 1979, Atlanta, Georgia, USA , pages 209–213, 1979. 2334

Tools from Information Theory

We shall use the following basic properties of entropy and mutual information throughout; theproofs can be found in [CT06, Chapter 2].

Fact A.1.

Let A , B , C , and D be four (possibly correlated) random variables.1. ≤ H ( A ) ≤ log | supp( A ) | . The right equality holds iﬀ dist( A ) is uniform.2. I ( A ; B ) ≥ . The equality holds iﬀ A and B are independent .3. Conditioning on a random variable reduces entropy : H ( A | B , C ) ≤ H ( A | B ) . The equalityholds iﬀ A ⊥ C | B .4. Subadditivity of entropy : H ( A , B | C ) ≤ H ( A | C ) + H ( B | C ) .5. Chain rule for mutual information : I ( A , B ; C | D ) = I ( A ; C | D ) + I ( B ; C | A , D ) . We also use the following two standard propositions.

Proposition A.2.

For random variables A , B , C , D , if A ⊥ D | C , then, I ( A ; B | C ) ≤ I ( A ; B | C , D ) . Proof.

For random variables A , B , C , D , if A ⊥ D | B , C , then, I ( A ; B | C ) ≥ I ( A ; B | C , D ) . Proof.