[PDF] On the sampling Lovász Local Lemma for atomic constraint satisfaction problems

Abstract

We study the problem of sampling an approximately uniformly random satisfying assignment for atomic constraint satisfaction problems i.e. where each constraint is violated by only one assignment to its variables. Let p denote the maximum probability of violation of any constraint and let \Delta denote the maximum degree of the line graph of the constraints. Our main result is a nearly-linear (in the number of variables) time algorithm for this problem, which is valid in a Lov\'asz local lemma type regime that is considerably less restrictive compared to previous works. In particular, we provide sampling algorithms for the uniform distribution on: (1) q-colorings of k-uniform hypergraphs with \Delta \lesssim q^{(k-4)/3 + o_{q}(1)}. The exponent 1/3 improves the previously best-known 1/7 in the case q, \Delta = O(1) [Jain, Pham, Vuong; arXiv, 2020] and 1/9 in the general case [Feng, He, Yin; STOC 2021]. (2) Satisfying assignments of Boolean k-CNF formulas with \Delta \lesssim 2^{k/5.741}. The constant 5.741 in the exponent improves the previously best-known 7 in the case k = O(1) [Jain, Pham, Vuong; arXiv, 2020] and 13 in the general case [Feng, He, Yin; STOC 2021]. (3) Satisfying assignments of general atomic constraint satisfaction problems with p\cdot \Delta^{7.043} \lesssim 1. The constant 7.043 improves upon the previously best-known constant of 350 [Feng, He, Yin; STOC 2021]. At the heart of our analysis is a novel information-percolation type argument for showing the rapid mixing of the Glauber dynamics for a carefully constructed projection of the uniform distribution on satisfying assignments. Notably, there is no natural partial order on the space, and we believe that the techniques developed for the analysis may be of independent interest.

Full PDF

aa r X i v : . [ c s . D S ] F e b ON THE SAMPLING LOVÁSZ LOCAL LEMMA FOR ATOMIC CONSTRAINTSATISFACTION PROBLEMS

VISHESH JAIN, HUY TUAN PHAM, AND THUY DUONG VUONG

Abstract.

We study the problem of sampling an approximately uniformly random satisfyingassignment for atomic constraint satisfaction problems i.e. where each constraint is violated by onlyone assignment to its variables. Let p denote the maximum probability of violation of any constraintand let ∆ denote the maximum degree of the line graph of the constraints.Our main result is a nearly-linear (in the number of variables) time algorithm for this problem,which is valid in a Lovász local lemma type regime that is considerably less restrictive compared toprevious works. In particular, we provide sampling algorithms for the uniform distribution on: • q -colorings of k -uniform hypergraphs with ∆ . q ( k − / o q (1) . The exponent / improves the previously best-known / in the case q, ∆ = O (1) [Jain,Pham, Vuong; arXiv, 2020] and / in the general case [Feng, He, Yin; STOC 2021]. • Satisfying assignments of Boolean k -CNF formulas with ∆ . k/ . . The constant . in the exponent improves the previously best-known in the case k = O (1) [Jain, Pham, Vuong; arXiv, 2020] and in the general case [Feng, He, Yin; STOC 2021]. • Satisfying assignments of general atomic constraint satisfaction problems with p · ∆ . . . The constant . improves upon the previously best-known constant of [Feng, He, Yin;STOC 2021].At the heart of our analysis is a novel information-percolation type argument for showing the rapidmixing of the Glauber dynamics for a carefully constructed projection of the uniform distributionon satisfying assignments. Notably, there is no natural partial order on the space, and we believethat the techniques developed for the analysis may be of independent interest. Introduction

Let X , . . . , X n denote a collection of independent random variables and let C = { C , . . . , C m } denote a collection of events depending on X , . . . , X n (here, the letter C is chosen to represent a“constraint”). For C ∈ C , let vbl( C ) ⊆ { X , . . . , X n } be such that C depends only on X i ∈ vbl( C ) .The celebrated Lovász Local Lemma (LLL) [EL73] (stated here in its variable version, symmetricform) asserts that e · p · ∆ ≤ ⇒ P [ ∧ i ∈ [ m ] C i ] ≥ (1 − e · p ) |C| > , (1.1)where C denotes the complement of the event C , e is the base of the natural logarithm, p = max i ∈ [ m ] P [ C i ] , (1.2)and ∆ ≥ satisﬁes { j ∈ [ m ] : vbl( C j ) ∩ vbl( C i ) = ∅} ≤ ∆ for all i ∈ [ m ] . (1.3)The original proof of (1.1) is non-constructive and does not provide an eﬃcient algorithm to ﬁnda point in ∧ i ∈ [ m ] C i (such a point is called a satisfying assignment). After much work over a periodof two decades (cf. [Bec91, Alo91, MR98, CS00, Sri08, Mos08, Mos09]), the landmark work of Moser nd Tardos [MT10] provided an eﬃcient (randomized) algorithm to ﬁnd a satisfying assignmentwhenever the LLL condition (i.e. the condition on the left hand side of (1.1)) is satisﬁed, providedthat one is able to eﬃciently sample from the distribution of X i , and eﬃciently able to determinethe set of constraints that are violated by a given realization of X , . . . , X n .In recent years, much attention (cf. [HSZ19, Moi19, GLLZ19, GJL19, FGYZ20, FHY21, JPV20])has been devoted to approximate counting and sampling variants of the algorithmic LLL: underconditions similar to the LLL condition, can we eﬃciently approximately count the total number ofsatisfying assignments? Can we eﬃciently sample from approximately the uniform distribution onsatisfying assignments?This problem turns out to be computationally harder than the problem of eﬃciently ﬁnding onesatisfying assignment. Indeed, consider the Boolean k - CNF-SAT problem, in which we are given n Boolean variables x , . . . , x n and m constraints C , . . . , C m such that each constraint depends onexactly k variables, and each constraint is violated by exactly one assignment (out of the k possibleassignments) to its variables. A direct application of the LLL shows that if each constraint sharesvariables with at most (approximately) k /e other constraints, then the formula has a satisfyingassignment, in which case, the algorithm of [MT10] eﬃciently ﬁnds such a satisfying assignment.However, it was shown by Bezáková et al. [BGG +

19] that it is NP -hard to approximately count thenumber of satisfying assignments for a Boolean k -CNF formula in which every variable is allowedto be present in ≥ · k/ constraints, even when the formula is monotone.On the algorithmic side, both deterministic and randomized algorithms have been devised forapproximate counting under LLL-like conditions. On the deterministic side, Moitra [Moi19] pro-vided a deterministic algorithm to approximately count the number of satisfying assignments of aBoolean k -CNF formula in which each constraint shares variables with at most ∆ . k/ otherconstraints (the . hides polynomial factors in k ), provided that k = O (1) . Moitra’s method wasextended by Guo, Liao, Lu, and Zhang [GLLZ19] to provide an eﬃcient deterministic algorithmfor approximately counting proper q -colorings of k -uniform hypergraphs with maximum degree d ,provided that q & d / ( k − and q, k = O (1) . Recently [JPV20], the authors of this paper showedthat for any instance of the variable version, symmetric form of the LLL, if each constraint dependson at most k variables and if each variable takes on at most q values, then there is an eﬃcientdeterministic algorithm to approximately count the number of satisfying assignments provided that q, k = O (1) and p ∆ . , where . hides polynomial factors in q, k . Here, p and ∆ are as in (1.2)and (1.3). In particular, this subsumes and improves upon [Moi19, GLLZ19]. We note that theseapproximate counting algorithms also lead to eﬃcient algorithms for sampling from approximatelythe uniform distribution on satisfying assignments.On the randomized side, algorithms have been devised for instances of the variable version, sym-metric LLL with atomic constraints. Here, an atomic constraint refers to a constraint which isviolated by exactly one assignment to its variables. For the special case of monotone Boolean k -CNF formulas, Hermon, Sly, and Zhang [HSZ19] showed that the Glauber dynamics mix rapidlyprovided that each variable is present in at most c k/ constraints, for some absolute constant c ;note that this matches the hardness regime from [BGG +

19] up to a constant factor. For extremal

Boolean k -CNF formulas (see [GJL19] for the deﬁnition) and for d in the entire LLL regime, themethod of partial rejection sampling due to Guo, Jerrum, and Liu [GJL19] allows eﬃcient perfect sampling from the uniform distribution on satisfying assignments. For general Boolean k -CNF for-mulas, Feng, Guo, Yin, and Zhang [FGYZ20] analyzed the Glauber dynamics on a certain “projectedspace” inspired by Moitra’s method, and obtained a near-linear time algorithm for sampling fromapproximately the uniform distribution on satisfying assignments provided that ∆ . k/ ( . hides olynomial factors in k ) – the motivation for constructing the projected space is that, while theoriginal space of satisfying assignments might not even be connected, by passing to an appropriateprojection, not only do we have connectivity, but no bottlenecks for the Glauber dynamics. At thesame time, since we want to be able to sample from the original distribution conditioned on therealisation of the projected assignment, the projection should be relatively ‘mild’ so as not to losetoo much information.Most relevant to this paper is the recent work of Feng, He, and Yin [FHY21], which introducedthe idea of ‘states compression’, thereby considerably expanding the applicability of the methodused in [FGYZ20]. We will survey their results in the next subsection when we compare themwith our own. Here, we only note that compared to the deterministic algorithms for approximatecounting, the randomized algorithms discussed here have two advantages: the running time is muchfaster (in fact, nearly linear in n ), and they are eﬃcient even when parameters such as k, q growwith n . On the other hand, the disadvantage is that so far, these methods are limited to atomicconstraints, whereas the algorithm of [JPV20] is applicable to general instances of the symmetricLLL.1.1. Our results.

We provide randomized algorithms for approximately counting the number ofsatisfying assignments and sampling from approximately the uniform distribution on satisfyingassignments for LLL instances with atomic constraints.We begin with our result for the following class of instances, which capture many interestingproblems such as Boolean k - CNF-SAT and k -hypergraph q -coloring. Later, in Theorem 1.5, wediscuss a result for general atomic constraints. Deﬁnition 1.1. A ( k, ∆ , q ) -CSP (constraint satisfaction problem) is an instance of the variableversion, symmetric LLL in which each variable X i is uniformly distributed on an alphabet Ω i ofsize q , each constraint depends on exactly k variables, and each constraint shares variables with atmost ∆ other constraints.As before, we say that a ( k, ∆ , q ) -CSP is atomic if every constraint is violated by exactly oneassignment to its variables. Note that for an atomic ( k, ∆ , q ) -CSP, the LLL asserts that if ∆ ≤ cq k ,for an absolute constant c , then there exists a satisfying assignment. Theorem 1.2.

Given an atomic ( k, ∆ , q ) -CSP on the variables X , . . . , X n , an error parameter ǫ ∈ (0 , / , and a parameter η ∈ (0 , , suppose that one of the following conditions holds.(T1) k ≥ , q ≥ q ( η ) , and ∆ ≤ c ( η ) · q ( k − / o q (1) .(T2) k = 2 , q ≥ q ( η ) , and ∆ ≤ c ( η ) · q / o q (1) .(T3) k ≥ , q ≥ , ∆ ≤ c ( η ) · q . k / ( k · q log q ) .(T4) k ≥ , q = 3 , ∆ ≤ c ( η ) · . k /k .(T5) k ≥ , q = 2 . ∆ ≤ c ( η ) · . k /k .Here q ( η ) and c ( η ) are constants depending only on η . Then, there is a randomized algorithm whichruns in time ˜ O ( n · (( n/ǫ ) η + ∆) · ∆ · k ) , where ˜ O hides polylogarithmic factors in n, ∆ , /ǫ , k , q , and outputs a random assignment X ∈ Q i ∈ [ n ] Ω i such that the distribution µ alg of X satisﬁes d TV ( µ alg , uniform-satisfying) ≤ ǫ, where uniform-satisfying denotes the uniform distribution on satisfying assignments and d TV denotesthe total variation distance between probability measures. emark. In [FHY21], analogs of (T1) and (T5) are considered. In these cases, they obtain analgorithm for sampling from approximately the distribution uniform-satisfying , and with a similarrunning time, under the more restrictive conditions:(T’1) [FHY21, Theorem 5,4] k ≥ , q ≥ q , and ∆ ≤ q ( k − / .(T’5) [FHY21, Theorem 5.5] k ≥ , q = 2 , ∆ ≤ c ( η ) · k/ . Remark.

We ﬁnd case (T1) of Theorem 1.2 remarkable since, prior to the work of Moser [Mos08],the best-known version of the existential algorithmic LLL due to Srinivasan [Sri08] required thecondition p ∆ ≤ c (for an absolute constant c , and with notation as in (1.2), (1.3)); in particular,[Sri08] does not guarantee eﬃciently ﬁnding even a single satsifying assignment in the regime (T1)(for suﬃciently large q ). The chief innovation of Moser was to use denser witness trees instead ofso-called -trees (Deﬁnition 4.3); however, in our work, we are able to bypass the ∆ barrier foratomic ( k, ∆ , q ) -CSPs, for suﬃciently large q , even while using -trees.We pause here to record a couple of particularly interesting corollaries of Theorem 1.2. Let H = ( V, E ) denote a k -uniform hypergraph with vertex set V and edge set E . Recall that a proper q -coloring of H is an assignment χ : V → [ q ] such that for every edge e , there exist u, v ∈ e with χ ( u ) = χ ( v ) . In words, no edge is monochromatic. The problem of properly q -coloring H can berecast as an atomic ( k, ∆ · q, q ) -CSP, where ∆ denotes the maximum number of edges that any edgeof H intersects. Indeed, we simply add q constraints for each edge, where constraint i for the edgeis violated if each vertex in the edge is colored with i . Then, by (T1), we have: Corollary 1.3.

Let H = ( V, E ) be a k -uniform hypergraph with k ≥ and let ∆ be deﬁned asabove. Then, for any ǫ, η ∈ (0 , , for q ≥ q ( η ) , and for ∆ ≤ c ( η ) · q ( k − / o q (1) , we can samplefrom a distribution which is ǫ -close in total variation distance to the uniform distribution on proper q -colorings of H , in time ˜ O ( n · (( n/ǫ ) η + ∆) · ∆ · k ) .Remark. This corollary improves upon [FHY21, Theorem 1.3] which requires ∆ ≤ q ( k − / o q (1) ,and on the previous best known regime (even in the bounded degree case) of ∆ . q ( k − / due to[JPV20].The next corollary follows from (T5). Corollary 1.4.

Consider a Boolean k - CNF-SAT instance on n variables x , . . . , x n such that eachconstraint shares variables with at most ∆ other constraints. Then, for any ǫ, η ∈ (0 , and for ∆ ≤ c ( η ) · . k /k , we can sample from a distribution which is ǫ -close in total variation distanceto the uniform distribution on satisfying assignments, in time ˜ O ( n · (( n/ǫ ) η + ∆) · ∆ · k ) .Remark. The constant . in the exponent is within a factor of less than of the hardness regimefrom [BGG + . from [FHY21, Theorem 1.4] and on theprevious best known constant (even in the bounded degree case) of . due to [JPV20].We now present our result for general atomic instances of the LLL. Theorem 1.5.

Given an atomic instance of the LLL, let k denote an upper bound on the numberof variables in any constraint, and let q denote an upper bound on the size of the support of anyvariable X i . Let ǫ, η ∈ (0 , . Let ∆ be as in (1.3) , p ≤ p ( η ) be as in (1.2) , and suppose that p · ∆ . o p (1) ≤ . Then, there is an algorithm which runs in time ˜ O ( n · (( n/ǫ ) η + ∆) · ∆ · k ) , here ˜ O hides polylogarithmic factors in n, ∆ , /ǫ, k, q , and outputs a random assignment X suchthat the distribution µ alg of X satisﬁes d TV ( µ alg , uniform-satisfying) ≤ ǫ. Remark.

The constant . (which has not been completely optimized and may be slightly lowered)improves upon the constant from [FHY21, Theorem 1.1]. Moreover, given a CSP for which everyconstraint is violated by at most N assignments to its variables, we can construct an atomic CSPwith at most N atomic constraints for every original constraint, and thereby obtain a result similarto Theorem 1.5, with ∆ replaced by ∆ N . We further note that the constant . is almost thesame as the constant in [JPV20]; while Theorem 1.5 only applies to atomic CSPs, its advantageis the much faster running time, as well as an LLL type condition which does not depend on k or q .1.2. Approximate counting.

Theorems 1.2 and 1.5 also imply eﬃcient algorithms for approxi-mately counting the number of satisfying assignments in the same regime. Indeed, by using thesimulated annealing reduction in [FGYZ20], one can easily show that if T ( ǫ ) is the time to obtainone sample (from a distribution which is ǫ -close in total variation distance to the uniform distri-bution), then for any δ ∈ (0 , , there is a randomized algorithm for approximately counting thenumber of satisfying assignments within a multiplicative factor of (1 + δ ) , which runs in time ˜ O (cid:16) mδ T ( ǫ m,δ ) (cid:17) , where m denotes the number of constraints i.e. m = |C| , and ǫ m,δ = Θ (cid:18) δ m log( m/δ ) (cid:19) . Techniques.

In [HSZ19], the authors showed that for a Boolean k -CNF formula which is monotone , the Glauber dynamics on the space of satisfying assignments mix rapidly outside (aconstant factor of) the hardness regime identiﬁed in [BGG + k -CNF Boolean formulas, the space of satisfying assignments may not even be connected. Toovercome this barrier, and inspired by Moitra’s approach [Moi19] of ‘marking’ variables, the work[FGYZ20] introduced the following two step procedure for sampling a uniformly random satisfyingassignment: ﬁrst, sample from the induced distribution on the so-called unmarked variables, andthen, given such a sample Y , sample from the uniform distribution on the satisfying assignmentsconditioned on the assignment to the unmarked variables being Y . The reason the last step iseasy is that, given a typical assignment to the unmarked variables, the remainder of the formulafactors into logarithmic-sized connected components, so that ordinary rejection sampling succeedswith high probability. The key, therefore, is to sample from the induced distribution on unmarkedvariables.The recent work [FHY21] introduced the idea of ‘states-compression’, which generalizes the mark-ing procedure of Moitra. Now, for each variable v with domain Ω v , one constructs a suitable map π v : Ω v → Q v , and the assignment Y now lives in Q v ∈ V Q v . Once again, the projection is to bechosen so that given a typical realisation of Y , the remainder of the formula factors into logarithmic-sized connected components on which ordinary rejection sampling succeeds with high probability,and the main part is showing that one can eﬃciently sample Y from the corresponding distribution.In order to sample Y , both [FGYZ20,FHY21] show that the Glauber dynamics for the distributionon the space Q v ∈ V Q v , induced by the uniform distribution on satisfying assignments, mix rapidly.For this, both works employ a one-step path coupling argument based on, and extending, the ar-gument from [Moi19]. However, showing that the one-step path coupling is contracting requiresadditional assumptions on the relationship between p and ∆ , and indeed, this is the main reasonfor the degradation of the dependence between p and ∆ in the ﬁnal results of [FGYZ20, FHY21] and also in [Moi19, GLLZ19]). Furthermore, establishing contraction of the one-step path couplingrequires considerable case analysis for diﬀerent ranges of the parameters – for instance, the mixingfor the regimes corresponding to Theorem 1.2 and Theorem 1.5 are analyzed separately in [FHY21].The main contribution of our work is to completely dispense with the path coupling analysis of this‘projected Glauber dynamics’ and instead, to devise a novel information-percolation based argumentwhich avoids the need to consider worst-case neighborhoods of a vertex. Such an argument is alsothe crux of [HSZ19] (which, in turn, is inspired by the argument in [LS16]). However, compared to[HSZ19], we critically do not have monotonicity at our disposal. This makes certain ‘sandwiching’arguments inaccessible, and consequently, necessitate developing a careful and somewhat elaboratenotion of combinatorial structures, which we call minimal discrepancy checks (Deﬁnition 5.6). Ina nutshell, the information-percolation argument is based on the fact that if the maximal one-step coupling of the Glauber dynamics fails to couple at some time, then there must already beanother discrepancy between the conﬁgurations at that time. By tracking the evolution of thesediscrepancies back in time, we show, in fact, that the origin of the failure of the one-step maximalcoupling can be attributed to the appearance of a minimal discrepancy check. By analyzing theseminimal discrepancy checks with considerable care, we then show that they occur with probabilitywhich is essentially just low enough (Proposition 5.9) so as to overcome the union bound on thenumber of possible minimal discrepancy checks. We expect this part of our argument to also beuseful in other contexts.Introducing minimal discrepancy checks allows us to handle the projected Glauber dynamics in allregimes in a uniﬁed manner. Another component which facilitates this, and also contributes to ourimproved quantitative estimates, is the notion of admissible projection schemes (Deﬁnition 3.2) – incontrast to [FHY21], the conditions we demand of the projections π v : Ω v → Q v are seemingly morecomplicated, but these are exactly the conditions which show up in the analysis of the algorithm,and therefore, avoid unnecessary degradation of the parameters. Once the correct conditions forthe admissible projection schemes are identiﬁed (and this is the non-trivial part), the argument forthe existence itself is a standard (although a necessarily quite careful) probabilistic argument.1.4. Organization.

The remainder of this paper is organized as follows. In Section 2, we recordsome preliminary notions related to the Lovász local lemma and constraint satisfaction problems.In Section 3, we introduce admissible projection schemes and state the result (Proposition 3.3)guaranteeing the existence of admissible projection schemes in the LLL regime. The proof ofProposition 3.3, which is initiated in Section 3 is completed in Section 6. In Section 4, we present ourmain sampling algorithm. The main result in this section is Theorem 4.1, which implies Theorem 1.2and Theorem 1.5. The key ingredient required for the proof of Theorem 4.1 is Proposition 4.2. Thisis proved in Section 5, which is the key section of the paper.2.

Preliminaries

Lovász Local Lemma.

The LLL provides a suﬃcient condition guaranteeing that the prob-ability of avoiding a collection C of “bad events” in a probability space is positive. When the LLLcondition (1.1) is satisﬁed, the so-called LLL distribution, µ S [ · ] := P [ · | ∧ C ∈C C ] is well-deﬁned (here, the subscript S is chosen to represent “satisfying”). For later use, we recorda standard comparison between the LLL distribution µ S [ · ] and the original distribution P [ · ] on theprobability space i.e. the product distribution on X , . . . , X n . For any event B in the probabilityspace, let Γ( B ) = { C ∈ C : vbl( B ) ∩ vbl( C ) = ∅} . heorem 2.1 (cf. [HSS11, Theorem 2.1]) . Under (1.1) , for any event B in the probability space, µ S [ B ] ≤ P [ B ] Y C ∈ Γ( B ) (1 − e · P [ C ]) − . We also record here the following algorithmic version of the Lovász Local Lemma, which followsdirectly from Moser-Tardos algorithm [MT10] and is also used in [Moi19, FGYZ20, FHY21].

Theorem 2.2 ([MT10]) . Under (1.1) , for any δ ∈ (0 , , there exists a randomized algorithm whichoutputs, with probability at least − δ , a satisfying assignment in time O ( n ∆ k log(1 /δ )) , where k = max C ∈C | vbl( C ) | .Proof. By [MT10], under (1.1), there exists a randomized algorithm which ﬁnds a satisfying assign-ment in at most |C| ∆ ≤ n steps in expectation, where each step has time complexity O (∆ k ) . ByMarkov’s inequality, if we run this algorithm for n steps, then with probability at least / , thealgorithm returns a satisfying assignment. The desired conclusion now follows by running log(1 /δ ) independent copies of this algorithm for n steps each. (cid:3) Constraint satisfaction problems.

Let V denote a collection of variables with ﬁnite domains (Ω v ) v ∈ V satisfying | Ω v | ≥ for all v ∈ V . A constraint on V is a map C : Y v ∈ V Ω v → { True , False } . We say that C depends on a variable v ∈ V if there exist σ , σ ∈ Q v ∈ V Ω v diﬀering only in v suchthat C ( σ ) = C ( σ ) . For every constraint C , we ﬁx vbl( C ) ⊆ V containing all variables that C depends on. A constraint satisfaction problem (CSP) is speciﬁed by Φ = ( V, (Ω v ) v ∈ V , C ) , where C isa collection of constraints. Given a constraint satisfaction problem, we say that σ ∈ Q v Ω v satisﬁes Φ if and only if C ( σ ) = True for all C ∈ C . We deﬁne the degree ∆ of a CSP to be ∆ = max C ∈C |{ C ′ ∈ C : vbl( C ) ∩ vbl( C ′ ) = ∅}| . We say that C is an atomic constraint if | C − (False) | = 1 . A CSP Φ is said to be atomic if every C ∈ C is an atomic constraint. Popular examples of atomicconstraint satisfaction problems are: • k -CNF-SAT. Here, Ω v = { , } for all v ∈ V and | vbl( C ) | = k for all C ∈ C . • k -Hypergraph q -coloring. Let H = ( V, E ) denote a k -uniform hypergraph. To each vertex v ∈ V , we assign a color in [ q ] such that no hyperedge is monochromatic. This correspondsnaturally to an atomic CSP Φ = ( V, (Ω v ) v ∈ V , C ) with Ω v = [ q ] for all v ∈ V and C = { C e,i : e ∈ E , i ∈ [ q ] } where for σ ∈ [ q ] V , C e,i ( σ ) = False ⇐⇒ σ ( w ) = i ∀ w ∈ e. To every CSP, we associate an instance of the LLL as follows: the random variables are X , . . . , X v ,where each X i is uniformly distributed on Ω i . To each constraint C ∈ C , we associate the event ( σ ∈ Y v ∈ V Ω v : C ( σ ) = False ) . We will abuse notation and denote this event by C and the collection of all such events by C . inally, for a CSP Φ , we let µ Φ denote the LLL distribution of the associated LLL instance i.e. µ Φ is the uniform distribution on satisfying assignments of Φ . When the underlying CSP is clearfrom context, we will omit the subscript and denote µ Φ simply by µ .3. Projection schemes

Preliminaries.

Given a CSP

Φ = ( V, (Ω v ) v ∈ V , C ) , a projection scheme is a collections of maps π v : Ω v → Q v , where Q v is a ﬁnite alphabet with | Q v | ≥ . We will frequently denote the collection ( π v ) v ∈ V simply by π . We let P π denote the product distribution on Q v ∈ V Q v induced via π by the uniformdistribution on Q v ∈ V Ω v . We also let µ π denote the distribution on Q v ∈ V Q v induced via π by µ = µ Φ .Let Φ be an atomic CSP. Recall that this means that for each C ∈ C , there exists some C ∈ Q v ∈ vbl( C ) Ω v such that X ∈ Q v ∈ V Ω v does not satisfy C if and only if X ( v ) = C ( v ) ∀ v ∈ vbl( C ) . Given an atomic CSP Φ and a projection scheme π , for every C ∈ C , we deﬁne C π ∈ Q v ∈ vbl( C ) Q v by C π ( v ) = π v ( C ( v )) ∀ v ∈ vbl( C ) . This naturally leads to a CSP Φ π = ( V, ( Q v ) v ∈ V , C π ) , where for each C ∈ C , there is a constraint C π ∈ C π such that for Y ∈ Q v ∈ V Q v , C π ( Y ) = False ⇐⇒ Y ( v ) = C π ( v ) ∀ v ∈ vbl( C ) . Motivated by this, for a constraint C π ∈ C π , v ∈ vbl( C π ) := vbl( C ) and Y ∈ Q v ∈ V Q v , we say that Y ( v ) does not satisfy C π if and only if Y ( v ) = C π ( v ) .For a constraint C ∈ C , let b ( C ) := max Y ∈ Q v ∈ V Q v Y u ∈ vbl( C ) P [ X ( u ) = C ( u ) | Y ] , where Y = π ( X ) . Equivalently, b ( C ) = Y u ∈ vbl( C ) | π − u ( C π ( u )) | − . Let b := max C ∈C b ( C ) . Also, let q := max v ∈ V,Y ∈ Q u ∈ V Q u d TV ( P π [value( v ) = · ] , µ π [value( v ) = · | Y − v ]) , where Y − v denotes the | V | − dimensional vector obtained by removing Y ( v ) from Y .The following useful bound on the conditional marginals of µ π follows from Theorem 2.1. Lemma 3.1.

Let Φ be an atomic CSP and let π be a projection scheme. Suppose that e · b · ∆ ≤ .Then for any v ∈ V and any partial assignment Z ∈ Q u ∈ V \{ v } Q u , µ π [value( v ) = · | Z ] ≤ (1 − b ) − ∆ P π [value( v ) = · ] . roof. Consider the product distribution P Z on Q v ∈ V Ω v where each coordinate u ∈ V \ { v } is dis-tributed according to P [ X ( u ) = ·| π u ( X ( u )) = Z ( u )] and the v th coordinate is uniformly distributedon Ω v . The P Z probability that a constraint C ∈ C is not satisﬁed is at most b ( C ) by deﬁnition. Let µ Z,S denote the distribution on satisfying assignments of Φ induced by P Z . Then, since e · b · ∆ ≤ ,we have by Theorem 2.1 that µ π [value( v ) = · | Z ] = P [ π v ( X ( v )) = · | π u ( X ( u )) = Z ( u ) , X satisﬁes all C ∈ C ]= µ Z,S [ π v ( X ( v )) = · ] ≤ P Z [ π v ( X ( v )) = · ] Y C ∈C : v ∈ vbl( C ) (1 − e · P Z [ C ]) − ≤ P π [value( v ) = · ](1 − b ) − ∆ . (cid:3) Let Φ be an atomic CSP and let π be a projection scheme. For each constraint C ∈ C , let vbl( C ) denote the set of variables v in C for which | Q v | > . Also, for C ∈ C , let ζ ( C ) := max v ∈ vbl( C ) (cid:18) , min (cid:18) (1 − b ) ∆ q P π [value( v ) = C π ( v )] , (cid:19)(cid:19) Admissible projection schemes.

The next deﬁnition isolates the class of projection schemeswe will be interested in.

Deﬁnition 3.2.

Let Φ be an atomic CSP and let π be a projection scheme. Let η ∈ (0 , / . Wesay that π is admissible if(A1) b ≤ η/ (300∆) .(A2) There exists κ ≥ such that for any C ∈ C , | vbl( C ) | · κ · ζ ( C ) · Y v ∈ vbl( C ) (cid:16) (1 − b ) − ∆ P π [value( v ) = C π ( v )] + e − κ/ (cid:17) ≤ (60000∆) − . Furthermore, κ ≤ K (log ∆ + log q + log k ) for a universal constant K , for q = max v ∈ V | Q v | ,and for k = max C ∈C | vbl( C ) | .(A3) For any v ∈ V and C, C ′ ∈ C with v ∈ vbl( C ) ∩ vbl( C ′ ) , P π [value( v ) = C π ( v )] ≤ P π [value( v ) = C ′ π ( v )] ≤ P π [value( v ) = C π ( v )] . (A4) For v ∈ V , π ( v ) can be computed in time K log | Ω v | , and for any q ∈ Q v a uniform value in π − v ( q ) can be sampled in time K log | Ω v | , where K is a universal constant. Remark.

Note that the condition b ≤ η/ (300∆) for η < / in (A1) guarantees that (1 − b ) − ∆ ≤ b ∆ ≤ η/ . The following is the main result of this section.

Proposition 3.3.

Let

Φ = ( V, (Ω v ) v ∈ V , C ) be an atomic CSP. Suppose that at least one of thefollowing holds:(1) | Ω v | = A ≥ A for all v ∈ V , | vbl( C ) | = k ≥ for all C ∈ C , and ∆ ≤ A g ( k ) − o A (1) , where A is a constant depending only on η , and g ( k ) = max (cid:8) k − , k (cid:9) .(2) | Ω v | = A = 2 for all v ∈ V , | vbl( C ) | = k ≥ for all C ∈ C , and ∆ ≤ cA . k /k where c is a constant depending only on η .(3) | Ω v | = A = 3 for all v ∈ V , | vbl( C ) | = k ≥ for all C ∈ C , and ∆ ≤ cA . k /k where c isa constant depending only on η . | Ω v | = A ≥ for all v ∈ V , | vbl( C ) | = k ≥ for all C ∈ C , and ∆ ≤ cA . k − / ( k log A ) ,where c is a constant depending only on η .(5) ∆ ≤ p − . o p (1) , where p ≤ c for a constant c depending only on η .Then, there exists an admissible projection scheme π = ( π v ) v ∈ V with π v : Ω v → Q v . Moreover, forany δ ∈ (0 , , this projection scheme can be constructed, with probability at least − δ , in time O ( n ∆ k log(1 /δ )) , where k = max C ∈C | vbl( C ) | .Proof. We give here the complete proof of

Case 1 . The proofs of the remaining cases are deferredto Section 6.Let R = ⌊ A / ⌋ . For each v ∈ V , we let Q v = [ R ] and deﬁne the projection π v arbitrarily so thatthe preimage of each element in Q v has size either ⌊ A/R ⌋ or ⌈ A/R ⌉ . Clearly this projection schemecan be constructed in time O (1) , π ( v ) can be computed in time O (log A ) , and for any q ∈ Q v , auniformly random element of π − v ( q ) can be returned in time O (log A ) . This conﬁrms (A4).Assuming that A is a suﬃciently large constant depending on η and that ∆ ≤ A g ( k ) − o A (1)

In this section, we present and analyze our main sampling algorithm

Main (Φ , π, ǫ ) .Let Φ = ( V, (Ω v ) v ∈ V , C ) be an atomic constraint satisfaction problem and let π = ( π v ) v ∈ V with π v : Ω v → Q v be an admissible projection scheme. In addition to the notation introduced inSections 2 and 3, we will also make use of the following notation. For a subset V ′ ⊆ V and a partialassignment Y ∈ Q v ∈ V ′ Q v , we let C ( Y ) denote the set of constraints which are not satisﬁed by Y .Recall that this is the set of constraints C π such that Y ( v ) = C π ( v ) for all v ∈ vbl( C ) ∩ V ′ . We let G ( Y ) denote the graph whose vertex set is C ( Y ) and such that C = C ′ ∈ C ( Y ) are connected if andonly if vbl( C ) ∩ vbl( C ′ ) = ∅ . Also, let H ( Y ) denote the graph whose vertex set is V and such that u = v ∈ V are connected if and only if there exists some C ∈ C ( Y ) for which { u, v } ∈ C . Finally,for each connected component H ′ of H ( Y ) , let C ( H ′ ) = { C ∈ C ( Y ) : vbl( C ) ⊆ H ′ } .The following, which is similar to the algorithm considered in [FGYZ20, FHY21], is our main sam-pling algorithm. Throughout, we assume that ∆ , n ≥ c . , where c . is an absolute constantdetermined by Proposition 4.2. Main(Φ , π, ǫ ) : The algorithm takes as input an atomic CSP Φ , an admissible projection scheme π , and an error parameter ǫ ∈ (0 , / , and outputs either ERROR or a satisfying assignment X ∈ Q v ∈ V Ω v .(M1) Initialize Y ∈ Q v ∈ V Q v by sampling Y ( v ) independently and uniformly at random from Q v for each v ∈ V .(M2) Let T = C . κn log( n ∆ /ǫ ) , where C . is an absolute constant determined by Proposition 4.2.For each t ∈ { , . . . , T } , given Y t − , generate Y t by choosing v t ∈ V uniformly at random,setting Y t ( u ) = Y t − ( u ) for all u = v t , and setting Y t ( v t ) = Sample( Y t − , v ) .(M3) Return InvSample( Y T ) .The main result of this section, which, together with Proposition 3.3, immediately implies Theorems 1.2and 1.5 is the following. Theorem 4.1.

Let Φ be an atomic CSP and let π be an admissible projection scheme as inDeﬁnition 3.2. Let q = max v ∈ V | Ω v | , k = max C ∈C | vbl( C ) | . Then, for any ǫ ∈ (0 , / , Main(Φ , π, ǫ ) runs in time ˜ O ( n · (( n/ǫ ) η + ∆) · ∆ · k ) , where ˜ O hides polylogarithmic factors in n, ∆ , /ǫ, k, q , and outputs either a random satisfying as-signment of Φ or ERROR . Denoting by µ alg the distribution produced by the algorithm and by µ Φ the uniform distribution on satisfying assignments of Φ (trivially extended to take on the value ERROR with probability ), we have that d TV ( µ alg , µ Φ ) ≤ ǫ. The algorithm

Main(Φ , π, ǫ ) uses two subroutines, Sample( Y t − , v ) and InvSample( Y T ) , which wenow describe in detail. Sample(

Y, v ) : This subroutine takes as input Y ∈ Q v ∈ V Q v and v ∈ V , and returns an elementof Q v .Consider the partial assignment Y − v and the corresponding graph H ( Y − v ) . Let H v denote the(maximal) connected component of v in H ( Y − v ) . S1) If |C ( H v ) | >

20∆ log( nκ/ǫ ) , output a uniformly random element of Q v . If we are in this case,we say that Sample(

Y, v ) fails due to (S1).(S2) Otherwise, |C ( H v ) | ≤

20∆ log( nκ/ǫ ) . Let S = 10( κn/ǫ ) η log( nκ/ǫ ) .For each s = 1 , . . . , S , do the following: • For each u ∈ H v , u = v , sample independently and uniformly at random a value X ( u ) ∈ π − u ( Y ( u )) . Sample independently and uniformly at random from Ω v a value X ( v ) . Denote the resulting | H v | -dimensional vector by X ( H v ) . • If X ( H v ) satisﬁes C ( H v ) , then terminate and output π v ( X ( v )) . Otherwise, (i) if s = S ,then go to the next bullet point, (ii) if s < S , then skip the next bullet point and increment s by . • If we reach this bullet point (i.e. X ( H v ) does not satisfy C ( H v ) for all s = 1 , . . . , S ), thenoutput a uniformly random element of Q v . In this case, we say that Sample(

Y, v ) failsdue to (S2). InvSample( Y ) : This subroutine takes as input Y ∈ Q v ∈ V Q v and returns either ERROR or anassignment in Q v ∈ V Ω v .Consider Y ∈ Q v ∈ V Q v and the graph H ( Y ) .(I1) If any (maximal) connected component H ′ of H ( Y ) has |C ( H ′ ) | >

20∆ log( nκ/ǫ ) , output ERROR . In this case, we say that

InvSample fails due to (I1).(I2) Otherwise, for each (maximal) connected component H ′ of H ( Y ) , we have |C ( H ′ ) | ≤

20∆ log( nκ/ǫ ) .Let S = 10( κn/ǫ ) η log( nκ/ǫ ) . Let H , . . . , H ℓ denote an enumeration of the (maximal) con-nected components of H ( Y ) .For j = 1 , . . . , ℓ , do the following:For each s = 1 , . . . , S , do the following: • For each u ∈ H j , sample independently and uniformly at random a value X ( u ) ∈ π − u ( Y ( u )) .Denote the resulting | H j | -dimensional vector by X ′ ( H j ) . • If X ′ ( H j ) satisﬁes C ( H j ) , then set X | H j = X ′ ( H j ) . Return to the outermost for loop (in j ) with j incremented by . • If X ′ ( H j ) does not satisfy C ( H j ) and s < S , then return to the for loop in s with s incremented by . • If X ′ ( H j ) does not satisfy C ( H j ) and s = S , then terminate both for loops and return ERROR . In this case, we say that

InvSample( Y ) has failed for H j .(I3) Output X .We say that InvSample( Y ) fails due to (I2) if it fails for any connected component H , . . . , H ℓ .4.1. The distribution of Y t . The main step in the analysis of the algorithm is the followingproposition, which shows rapid mixing for the Glauber dynamics for the distribution µ π . Comparedto the works [FGYZ20, FHY21], we are able to establish rapid mixing of the Glauber dynamics fora much wider class of projection schemes – this is done by abandoning the path coupling approachof [FGYZ20, FHY21] and instead devising an information-percolation type argument extending theapproach in [HSZ19] from the monotone Boolean case to general ﬁnite alphabets and without anymonotonicity assumption. Proposition 4.2.

There exist absolute constants c . , C . ≥ for which the following holds. Let ( Y t ) t ≥ denote the Glauber dynamics for µ π starting at an arbitrary initial state Y . Then, for any δ ∈ (0 , / and for T = C . κn log( n ∆ /δ ) , the total variation distance between the distribution of Y T and the distribution µ π is at most δ , provided that n ≥ c . . The proof of this key proposition is the content of Section 5 and constitutes the bulk of thispaper. .2. Connected components of the projected CSP.

To control failures due to (S1) and (I1),we will show that, with high probability, the projected chain Y t satisﬁes the property that forevery connected component H ′ of H ( Y t ) , we have |C ( H ′ ) | ≤

20∆ log( nκ/ǫ ) . As in [Moi19, FGYZ20,FHY21,JPV20], our analysis uses -trees, which were ﬁrst used in a similar context by Alon [Alo91]. Deﬁnition 4.3.

Let G = ( V, E ) be a graph and let d G ( · , · ) denote the graph metric. A set ofvertices T ⊆ V is called a -tree if for every u = v ∈ T , d G ( u, v ) ≥ , and such that if we addan edge between all pairs of vertices u, v ∈ T with d G ( u, v ) ≤ , then the resulting graph on T isconnected.We will need the following result on the number of -trees of a prescribed size which contain agiven vertex. Lemma 4.4 (cf. [FGYZ20, Corollary 5.7]) . Let G = ( V, E ) be a graph with maximum degree ∆ .For any v ∈ V , the number of -trees in G which contain v and have size ℓ is at most ( e ∆ ) ℓ − . Here, e is the base of the natural logarithm.The next (standard) lemma shows that large -trees exist in graphs of bounded maximum degree.We include the proof for the reader’s convenience. Lemma 4.5.

Let G = ( V, E ) be a graph with maximum degree ∆ . Let H = ( V ( H ) , E ′ ) be aconnected subgraph of G and let v ∈ V ( H ) . Then, there exists a -tree T with v ∈ T ⊆ V ( H ) suchthat |T | ≥ | V ( H ) | / (∆ + 1) .Proof. We construct such a -tree greedily. Let T = { v } , H = V ( H ) , and v = v . In the i th -step,for i ≥ , let H i = H i − \ ( { v i − }∪ N G ( v i − )) , and choose (if possible) v i to be some vertex in H i suchthat d G ( T i − , v i ) = 2 . We then let T i = T i − ∪ { v i } . Observe that H i = V ( H ) \ ( T i − ∪ N G ( T i − )) .We claim that if we cannot ﬁnd v i satisfying d G ( T i − , v i ) = 2 , then it must be the case that H i = ∅ . Indeed, assume that H i is nonempty. Since H i = V ( H ) \ ( T i − ∪ N G ( T i − )) is notempty, and since H is connected, there must exist some u ∈ H i for which d G ( T i − , u ) > and some u ′ ∈ T i − ∪ N G ( T i − ) such that there is an edge between u and u ′ . Note that u ′ / ∈ T i − , since u ′ ∈ T i − means that u ∈ N G ( T i − ) , a contradiction. Hence, d G ( T i − , u ′ ) = 1 so that d G ( T i − , u ) ≤ , whichcombined with the previous lower bound gives d G ( T i − , u ) = 2 .Thus, when our construction terminates, we have a -tree T satisfying T ∪ N G ( T ) ⊇ V ( H ) . Sincethe maximum degree in G is ∆ , it follows in particular that |T | ≥ | V ( H ) | / (∆ + 1) , as desired. (cid:3) With this preparation, we are ready to bound the probability of failure due to (S1) or (I1).

Proposition 4.6.

Fix ≤ t ≤ T . Then, with probability at least − ( ǫ/κn ) , for every connectedcomponent H ′ of H ( Y t ) , we have |C ( H ′ ) | <

20∆ log( nκ/ǫ ) .Proof. By the law of total probability, it suﬃces to prove the result after conditioning on the choiceof variables v , . . . , v t chosen to be updated in the ﬁrst t steps. Suppose for contradiction that thereexists a connected component H ′ of H ( Y t ) with |C ( H ′ ) | ≥

20∆ log( nκ/ǫ ) =: α . Then, by deﬁni-tion, there must exist a connected component of G ( Y t ) of size at least α containing a constraint C ∗ ∈ C ( H ′ ) and a variable v ∗ with v ∗ ∈ vbl( C ∗ ) (in particular, v ∗ ∈ H ′ ). Since the maximum degreeof G ( Y t ) is ∆ , it follows from Lemma 4.5 that there exists a -tree T ∗ of constraints in C ( Y t ) suchthat C ∗ ∈ T ∗ and |T ∗ | ≥ α/ (∆ + 1) . We now proceed to bound the probability of appearance ofsuch a -tree.By Lemma 4.4, the number of -trees in G ( Y t ) which are rooted at C ∗ and which have size ℓ is atmost ( e ∆ ) ℓ − . The main observation is the following: ﬁx any such -tree T . Then, by deﬁnition,for every constraint C ∈ T and for every v ∈ vbl( C ) , we must have Y t ( v ) = C π ( v ) . Moreover, or any variable v , letting t v denotes the last time before (and including) t that the value of v wasupdated (note that t v is determined by our conditioning), we have (by the assumed distribution of Y and the description of the subroutine Sample(

Y, u ) ) that the value of v at t v is chosen from oneof two distributions: • The uniform distribution on Q v , or • The distribution µ π [value( v ) = · | Y − vt v − ] .In either case, the probability that Y t v ( v ) = C π ( v ) is at most (1 − b ) − ∆ P π [value( v ) = C π ( v )] –indeed, this is true for the uniform distribution even without the (1 − b ) − ∆ factor, whereas for theother case, this follows immediately from Lemma 3.1. Since the last time before (and including) t that each variable is updated is determined by our conditioning, and since diﬀerent constraints in T are disjoint, it follows that the probability that none of the constraints in T are satisﬁed is atmost Y C ∈T Y v ∈ vbl( C ) ((1 − b ) − ∆ P [ Y ( v ) = C π ( v )]) ≤ Y C ∈T (3000∆) − ≤ (3000∆) − ℓ , where the ﬁrst inequality uses condition (A2) of Deﬁnition 3.2. Therefore, by the union bound, itfollows that the probability that there exists some C ∗ ∈ C ( Y t ) and a -tree T ∗ rooted at C ∗ suchthat |T ∗ | ≥ α/ (∆ + 1) := ℓ , and such that none of the constraints in T ∗ are satisﬁed, is at most n ∆ · ( e ∆ ) ℓ − · (3000∆) − ℓ , where the ﬁrst factor accounts for the number of choices for C ∗ and the second factor is fromLemma 4.4. Since by our choice of ℓ , n ∆ · ( e ∆ ) ℓ − · (3000∆) − ℓ ≤ ( ǫ/κn ) , we have the required assertion. (cid:3) Rejection sampling.

Having seen that the probability of failure due to (S1) or (I1) is verylow, we now show that the probability of failure due to (S2) or (I2) is also very low.

Proposition 4.7.

Let V ′ ⊆ V and let Y ∈ Q v ∈ V ′ Q v be a partial assignment. Let H ′ be a connectedcomponent of H ( Y ) and suppose that the size of H ′ is at most

20∆ log( nκ/ǫ ) . Let X be obtained bysampling each X ( v ) independently and uniformly from π − v ( Y ( v )) for each v ∈ H ′ ∩ V ′ , and from Ω v for each v ∈ H ′ ∩ ( V \ V ′ ) . Then, the probability that X satisﬁes all constraints C ∈ C ( H ′ ) is atleast ( nκ/ǫ ) − η . Here, η is the parameter appearing in Deﬁnition 3.2. Proof.

By deﬁnition of X , the probability that a constraint C ∈ C ( H ′ ) is not satisﬁed by X is atmost b ( C ) = Y v ∈ vbl( C ) | π − v ( C π ( v )) | . Let G denote the event that X satisﬁes all constraints C ∈ C ( H ′ ) . Since b ( C ) ≤ b for all C andsince b ≤ η by (A1) of Deﬁnition 3.2, it follows from Theorem 2.1 and (A1) that P [ G ] ≥ (1 − b ) |C ( H ′ ) | ≥ (1 − b )

20∆ log( nκ/ǫ ) ≥ ( nκ/ǫ ) − η . (cid:3) We immediately obtain the following corollary.

Corollary 4.8.

Fix ≤ t ≤ T . The probability that Sample fails due to (S2) at time t is at most ( ǫ/κn ) . Moreover, the probability that InvSample fails due to (I2) is at most ( ǫ/κn ) . roof. Recall that S = 10( nκ/ǫ ) η log( nκ/ǫ ) . By Proposition 4.7, the probability that Sample failsdue to (S2) at time t is at most (1 − ( κn/ǫ ) − η ) S ≤ exp (cid:0) − κn/ǫ ) − η ( κn/ǫ ) η log( nκ/ǫ ) (cid:1) = ( ǫ/κn ) . Moreover, again by Proposition 4.7, the probability that

InvSample fails due to any connectedcomponent H ′ with |C ( H ′ ) | ≤

20∆ log( nκ/ǫ ) is at most (1 − ( κn/ǫ ) − η ) S ≤ ( ǫ/κn ) . Therefore, by the union bound over all (at most n ) maximal connected components of H ( Y ) , theprobability that InvSample fails due to (I2) is at most ( ǫ/κn ) . (cid:3) Analysis of the main algorithm.

The proof of Theorem 4.1 now follows readily.

Proof of Theorem 4.1.

Let µ alg be the distribution on { satisfying assignments of Φ } ∪ { ERROR } given by the output of the algorithm Main(Φ , π, ǫ ) . Also, let µ alg ′ be the distribution on { satisfying assignments of Φ } ⊆ Y v ∈ V Ω v given by µ alg ′ [ x ] := µ S [ X = x | π ( X ) = Y T ] , where Y T is generated by running the Glauber dynamics for µ π for T steps, starting from Y (where Y is as in (M1)).The relation between the distributions µ alg and µ alg’ is as follows: let G T denote the event thatnone of the calls to Sample fail (either due to (S1) or (S2)) and that the call to

InvSample also doesnot fail (either due to (I1) or (I2)). Observe that µ alg | G T = µ alg ′ . Therefore, by the characterization of the total variation distance in terms of coupling (cf. [LP17,Proposition 4.7]), we have d TV ( µ alg , µ alg ′ ) ≤ Pr[ G T ] ≤ T ( ǫ/κn ) + ( ǫ/κn ) + ( T + 1)( ǫ/κn ) ≤ ( ǫ/κn ) , where the second line follows from Proposition 4.6 and Corollary 4.8, and the third line follows fromthe value of T and since κ ≥ log(∆) ((A2) of Deﬁnition 3.2).Moreover, by Proposition 4.2, we know that d TV ( µ π , Y T ) ≤ ǫ , from which it immediately follows (again by [LP17, Proposition 4.7]) that d TV ( µ alg ′ , µ Φ ) ≤ ǫ . Therefore, by the triangle inequality we have that d TV ( µ alg , µ Φ ) ≤ ǫ. It remains to analyze the running time of the algorithm. Let q = max v ∈ V | Ω v | , k = max C ∈C | vbl( C ) | . ach call to Sample takes time ˜ O ((( n/ǫ ) η + ∆) · ∆ · k · log q ) , where ˜ O hides polylogarithmic factors in n, ∆ , ǫ − . This is because we require ˜ O (∆ · k · log q ) timefor checking whether or not |C ( H v ) | ≤

20∆ log( nκ/ǫ ) , and in case the upper bound holds, thenﬁnding this component. In the latter case, for each iteration, we require time ˜ O ( k · ∆ · log q ) tosample X ( H v ) and time ˜ O ( k · ∆ · log q ) to check whether X ( H v ) satisﬁes C ( H v ) . Therefore, (M1)and (M2) take time ˜ O ( n · (( n/ǫ ) η + ∆) · ∆ · k · log q ) Moreover, by a similar analysis as for

Sample , the call to

InvSample also takes time ˜ O ( n · (( n/ǫ ) η + ∆) · ∆ · k · log q ) , so that the running time of the algorithm is ˜ O ( n · (( n/ǫ ) η + ∆) · ∆ · k · log q ) , as desired. (cid:3) Glauber dynamics for the projected distribution: Proof of Proposition 4.2

Preliminaries.

Throughout this section, we ﬁx an atomic CSP

Φ = ( V, (Ω v ) v ∈ V , C ) and anadmissible projection scheme π = ( π v ) v ∈ V with π v : Ω v → Q v . Recall that µ π is the distribution on Q v ∈ V Q v induced via π by the uniform distribution on satisfying assignments, µ = µ Φ on Q v ∈ V Ω v .In this section, which is the main innovation of our work, we study the mixing of the Glauberdynamics for the distribution µ π . Recall that the Glauber dynamics is a discrete time Markov chainon the state space Q v ∈ V Q v whose transitions are as follows: given the current state Y , choose auniformly random vertex v ∈ V and move to the state Y ′ where Y ′ ( w ) = Y ( w ) ∀ w = vY ′ ( v ) ∼ µ π [value( v ) = · | Y − v ] . It is standard that this chain is aperiodic and reversible with respect to µ π and by using the condition e · b · ∆ < along with the LLL, it is also easily seen (cf. [FHY21, Proposition 8.1]) that this chainis irreducible. Therefore (cf. [LP17, Corollary 1.17]), µ π is the unique stationary distribution of thischain.We denote the Glauber dynamics for µ π by ( Z t ) t ≥ . Given X , Y , let ( X t , Y t ) t ≥ denote acoupling of two copies of Z t starting from X and Y . For this coupling, let τ couple = min { t ≥ X t = Y t } . It is well known (cf. [LP17, Theorem 5.4]) that max Z d TV ( Z t , µ π ) ≤ max X ,Y inf couplings P [ τ couple ≥ t ] , (5.1)where the inﬁmum is taken over all couplings ( X t ) t ≥ and ( Y t ) t ≥ of two copies of the Glauberdynamics with initial states X and Y respectively. Thus, our goal in this section is to show thatfor any X , Y , there is a coupling ( X t , Y t ) t ≥ which coalesces quickly with high probability.In fact, the coupling that we will analyze is the optimal one-step coupling of the chains. Recallthat this coupling is constructed as follows: given the current state ( X t − , Y t − ) , we choose auniformly random vertex v ∈ V (in this case, we say that v is updated at time t ) and move to thestate ( X t , Y t ) where X t ( w ) = X t − ( w ) ∀ w = vY t ( w ) = Y t − ( w ) ∀ w = v X t ( v ) , Y t ( v )) is sampled from the optimal coupling of ( µ π [value( v ) = · | X − vt − ] , µ π [value( v ) = · | Y − vt − ]) . Hence, throughout the remainder of this section, ( X t , Y t ) t ≥ will always denote the optimal one-step coupling of two copies of Z t starting at X and Y respectively.We partition time into blocks of size H = 100 κ · n, where κ ≥ is the parameter appearing in (A2) of Deﬁnition 3.2. By time block K , we mean thetime interval [ HK, H ( K + 1)) . We will also need the following notation. Let C = C × Z ≥ . Recallthat G ( C ) denotes the graph whose vertex set consists of constraints C ∈ C and there is an edgebetween C = C ′ ∈ C if and only if vbl( C ) ∩ vbl( C ′ ) = ∅ Let U = { ( v, t ) ∈ V × Z ≥ : v is updated at time t } . Recall that this means that v is the vertex chosen by the Glauber dynamics when the current statesare X t − and Y t − . Recall also that by the deﬁnition of the optimal one-step coupling, the samevertex is chosen to be updated at a given time t in both chains. Let D = { ( v, t ) ∈ V × Z ≥ : X t ( v ) = Y t ( v ) } . We call D the set of discrepancies .Given a time interval I , we denote by V ( I ) the set of variables v ∈ V which are not updated in I and by V + ( I ) the set of variables v ∈ V which are updated at least κ | I | /n times in I .Finally, for Z ∈ Q v ∈ V Q v , for S ⊆ V , and for C ∈ C , we say that S does not satisfy C in Z if Z ( v ) = C π ( v ) ∀ v ∈ vbl( C ) ∩ S, and that a partial assignment Z ′ does not satisfy C if Z ′ ( v ) = C π ( v ) ∀ v ∈ vbl( C ) for which Z ′ ( v ) is deﬁned . Discrepancy checks.

Our argument for bounding τ couple will be based on showing that itis very unlikely for certain combinatorial structures, which we call minimal discrepancy checks , toarise from the randomness of the choice of updates driving the Glauber dynamics. To this end, webegin by deﬁning the notion of a discrepancy check . Deﬁnition 5.1.

Let v ∈ V and T be an integer in [ HK, H ( K +1)) (in particular, T is in time block K ). For i ≥ , let T i = H ( K − i ) . A discrepancy check D starting at ( v , T ) consists of a sequenceof elements ( C , T ) , ( C , T ) , . . . , ( C K − , T K − ) ∈ C , a sequence of elements v , . . . , v K − ∈ V , acollection of induced oriented paths P , . . . , P K − in G ( C ) , and a collection of Boolean variables f , . . . , f K − satisfying the following properties.(D1) ( v , T ) ∈ D , v ∈ vbl( C ) and vbl( C ) \ { v } does not satisfy C in at least one of X − v T − and Y − v T − .(D2) f = 1 , v ∈ vbl( C ) , and P is an induced path oriented from C to C . Additionally, thefollowing properties are satisﬁed. • ( v , T ) ∈ D and either X T ( v ) or Y T ( v ) is equal to ( C ) π ( v ) . • For each constraint C ′ in P \ { C } , there exists T ′ ∈ [ T , T ) such that at least one ofthe following holds. – The subset vbl( C ′ ) does not satisfy C ′ in at least one of X T ′ and Y T ′ . – There exists v ′ ∈ vbl( C ′ ) such that ∗ The subset vbl( C ′ ) \ { v ′ } does not satisfy C ′ in at least one of X T ′ and Y T ′ ,and ∗ v ′ is updated in ( T ′ , T ) , and ∗ The ﬁrst update at time t ′ > T ′ of v ′ results in a discrepancy, and ∗ There exists some C ′′ ∈ C such that either X t ′ ( v ′ ) or Y t ′ ( v ′ ) is equal to C ′′ π ( v ′ ) . e call the induced oriented path P from C to C the -leg of the discrepancy check.(D3) For each ≤ i ≤ K − , given C i , T i , v i , and P i , if v i is not updated in ( T i +1 , T i ] , then f i +1 = 0 , C i +1 = C i , v i +1 = v i , and P i +1 = { C i } = { C i +1 } .(D4) For each ≤ i ≤ K − , given C i , T i , v i , and P i , if v i is updated in ( T i +1 , T i ] , then f i +1 = 1 and P i +1 is an induced path oriented from C i +1 to C i +1 . We require that the following propertiesare satisﬁed. • v i +1 ∈ vbl( C i +1 ) , ( v i +1 , T i +1 ) ∈ D , and either X T i +1 ( v i +1 ) or Y T i +1 ( v i +1 ) is equal to ( C i +1 ) π ( v i +1 ) . • C i +1 shares a variable with some constraint in P i . None of the constraints C ′ ∈ P i +1 \{ C i +1 } share variables with any constraints in P i . • For each C ′ ∈ P i +1 , there exists T ′ ∈ [ T i +1 , T i ) such that at least one of the followingholds. – The subset vbl( C ′ ) does not satisfy C ′ in at least one of X T ′ or Y T ′ . – There exists v ′ ∈ vbl( C ′ ) such that ∗ The subset vbl( C ′ ) \ { v ′ } does not satisfy C ′ in at least one of X T ′ and Y T ′ ,and ∗ v ′ is updated in ( T ′ , T i ) , and ∗ The ﬁrst update at time t ′ > T ′ of v ′ results in a discrepancy, and ∗ There exists some C ′′ ∈ C such that either X t ′ ( v ′ ) or Y t ′ ( v ′ ) is equal to C ′′ π ( v ′ ) .We call the induced oriented path P i +1 from C i +1 to C i +1 the ( i + 1) -leg of the discrepancycheck.For later use, we record the following simple lemma. Lemma 5.2.

Let D be a discrepancy check starting from ( v , T ) , where T is in time block K .Then, for all ≤ i ≤ K − , ( v i , T i ) ∈ D .Proof. By assumption, ( v , T ) ∈ D and ( v , T ) ∈ D . Suppose for contradiction that there is some ≤ i ≤ K − such that ( v i , T i ) / ∈ D and let i ∗ denote the smallest such index. Then, by theﬁrst bullet point of (D4), we cannot have f i ∗ = 1 . Therefore, we must have f i ∗ = 0 , in whichcase v i ∗ = v i ∗ − . But since ( v i ∗ − , T i ∗ − ) ∈ D by the minimality of i ∗ and since v i ∗ = v i ∗ − is notupdated in ( T i ∗ , T i ∗ − ] due to the condition f i ∗ = 0 , it follows that necessarily, ( v i ∗ , T i ∗ ) ∈ D , whichcontradicts the deﬁnition of i ∗ . (cid:3) Constructing a discrepancy check.

In this subsection, we show that whenever ( v , T ) ∈ U ∩ D , there must exist a discrepancy check starting at ( v , T ) . Proposition 5.3.

Let ( v , T ) ∈ U ∩ D . Then there exists a discrepancy check D starting at ( v , T ) . We divide the proof into a couple of lemmas.

Lemma 5.4.

Under the optimal one-step coupling of the Glauber dynamics for µ π , if ( v, t ) ∈ U ∩ D ,then there exists a path C , C , . . . , C k in G ( C ) such that • v ∈ vbl( C ) . • Each C i is not satisﬁed in at least one of X − vt − and Y − vt − . • vbl( C k ) contains some u = v satisfying X t − ( u ) = Y t − ( u ) .Remark. Since C k is not satisﬁed by at least one of X − vt − and Y − vt − and since u = v , it follows inparticular that either X t − ( u ) or Y t − ( u ) is equal to C k π ( u ) . Proof.

Let C ′ denote those constraints which are not satisﬁed by at least one of X − vt − and Y − vt − . Let G ′ ( C ′ ) be graph on the vertex set C ′ induced by the graph G ( C ) . It is clear that the distributions π [value( v ) = · | X − vt − ] and µ π [value( v ) = · | Y − vt − ] depend only on the restrictions of X − vt − (respec-tively Y − vt − ) to the connected component of v in G ′ ( C ′ ) . Therefore, if the connected component of v in G ( C ′ ) does not contain any variable u = v for which X t − ( u ) = Y t − ( u ) , then under the opti-mal coupling of the Glauber dynamics, we must necessarily have X t ( v ) = Y t ( v ) , which contradicts ( v, t ) ∈ U ∩ D . (cid:3) The next lemma, which is more involved, shows how to inductively build a discrepancy check.

Lemma 5.5.

For ≤ ℓ ≤ K − , given ( C ℓ , T ℓ ) ∈ C , v ℓ ∈ V , and P ℓ satisfying the properties inDeﬁnition 5.1, there exist ( C ℓ +1 , T ℓ +1 ) ∈ C , f ℓ +1 , v ℓ +1 ∈ V and P ℓ +1 satisfying the properties of the ( ℓ + 1) -leg in Deﬁnition 5.1.Proof. If v ℓ is not updated in ( T ℓ +1 , T ℓ ] , then the choice f ℓ +1 = 0 , v ℓ +1 = v ℓ , C ℓ +1 = C ℓ and P ℓ +1 = { c ℓ +1 } satisﬁes (D3) and we are done.Otherwise, v ℓ is updated in ( T ℓ +1 , T ℓ ] . We claim that there exists an induced oriented path ofconstraints P = ( C , . . . , C k ) with the following properties:(Q1) v ℓ ∈ vbl( C ) .(Q2) There exists some v ℓ +1 ∈ vbl( C k ) such that ( v ℓ +1 , T ℓ +1 ) ∈ D and either X T ℓ +1 ( v ℓ +1 ) or Y T ℓ +1 ( v ℓ +1 ) is equal to ( C k ) π ( v ℓ +1 ) .(Q3) For each constraint C ∈ P , there exists some T ′ ∈ [ T ℓ +1 , T ℓ ) such that at least one of thefollowing holds. • The subset vbl( C ) does not satisfy C in at least one of X T ′ and Y T ′ . • There exists v ′ ∈ vbl( C ) such that – The subset vbl( C ) \ { v ′ } does not satisfy C in at least one of X T ′ and Y T ′ , and – v ′ is updated in ( T ′ , T ℓ ) , and – The ﬁrst update at time t ′ > T ′ of v ′ results in a discrepancy, and – There exists some C ′′ ∈ C such that either X t ′ ( v ′ ) or Y t ′ ( v ′ ) is equal to C ′′ π ( v ′ ) .We now show that such a path exists. This is the main step in the proof.Let T ′ ℓ be the last update of v ℓ in the interval ( T ℓ +1 , T ℓ ] . Since ( v ℓ , T ℓ ) ∈ D (Lemma 5.2), itfollows that ( v ℓ , T ′ ℓ ) ∈ D and hence in ( v ℓ , T ′ ℓ ) ∈ U ∩ D . Therefore, letting T = T ′ ℓ − , it followsby Lemma 5.4 that there exists a path P = ( C , . . . , C s ) in G ( C ) such that v ℓ ∈ vbl( C ) , each C j is not satisﬁed by at least one of X − v ℓ T and Y − v ℓ T , and there exists v = v ℓ with v ∈ vbl( C s ) and ( v , T ) ∈ D . In particular, either X T ( v ) or Y T ( v ) is equal to ( C s ) π ( v ) . By choosing such apath of minimum length, we may assume that P is an induced oriented path in G ( C ) . We havethe following two cases. Case I: T = H ( K − ℓ −

1) = T ℓ +1 . Then, P ℓ +1 = P satisﬁes (Q1), (Q2), and (Q3). Indeed,(Q1) and (Q2) (with v ℓ +1 = v ) are clear. Moreover, the constraints C j for j ≥ satisfy theﬁrst bullet point of (Q3). Finally, the constraint C satisﬁes the second bullet point of (Q3) with v ′ = v ℓ , T ′ = T ℓ +1 = T , t ′ = T + 1 = T ′ ℓ , and C ′′ = C ℓ . Indeed, we know by the ﬁrst bulletpoint of (D4) that either X T ℓ ( v ℓ ) or Y T ℓ ( v ℓ ) is equal to ( C ℓ ) π ( v ℓ ) and since t ′ = T ′ ℓ is the time ofthe last update to v ℓ before T ℓ , it must be the case that either X t ′ ( v ′ ) or Y t ′ ( v ′ ) is equal to ( C ℓ ) π ( v ′ ) . Case II: T > H ( K − ℓ −

1) = T ℓ +1 . By induction, suppose that for j ≥ , we have an inducedoriented path P j in G ( C ) with P j = ( C j , . . . , C js j ) , a variable v j ∈ V , and T ℓ ≥ T j > H ( K − ℓ − with the following properties:(R1) v ℓ ∈ vbl( C j ) . R2) v j ∈ vbl( C js j ) , ( v j , T j ) ∈ D , and either X T j ( v j ) or Y T j ( v j ) is equal to ( C js j ) π ( v j ) .(R3) For each constraint C ∈ P j , there exists some T ′ ∈ [ T ℓ +1 , T ℓ ) such that at least one of thefollowing holds. • The subset vbl( C ) does not satisfy C ′ in at least one of X T ′ or Y T ′ . • There exists v ′ ∈ vbl( C ) such that – The subset vbl( C ) \ { v ′ } does not satisfy C in at least one of X T ′ and Y T ′ , and – v ′ is updated in ( T ′ , T ℓ ) , and – The ﬁrst update at time t ′ > T ′ of v ′ results in a discrepancy, and – There exists some C ′′ ∈ C such that either X t ′ ( v ′ ) or Y t ′ ( v ′ ) is equal to C ′′ π ( v ′ ) .Let T j be the last update of v j with T j ≤ T j . We have two cases. Case 1 : If T j ≤ T ℓ +1 , then the path P j satisﬁes the required properties (Q1), (Q2), (Q3).Indeed, (R1) implies (Q1), (R3) implies (Q3), and (R2) implies (Q2) since T j ≤ T ℓ +1 implies that X T j ( v j ) = X T ℓ +1 ( v j ) and Y T j ( v j ) = Y T ℓ +1 ( v j ) . Case 2 : T j > T ℓ +1 . Since ( v j , T j ) ∈ D by (R2), we must have ( v j , T j ) ∈ U ∩ D . Therefore, byLemma 5.4, there exists an induced path P ′ j = ( C ′ j , . . . , C ′ js ′ j ) with v j ∈ vbl( C ′ j ) , each C ′ ji is notsatisﬁed by at least one of X − v j T j +1 and Y − v j T j +1 , where T j +1 = T j − , and there exists v j +1 = v j with v j +1 ∈ vbl( C ′ js ′ j ) and ( v j +1 , T j +1 ) ∈ D . Concatenating P j with P ′ j gives an oriented path from C j to C ′ js ′ j . By taking a sub-path between these endpoints which is an induced oriented path in G ( C ) ,we get P j +1 , which satisﬁes properties (R1), (R2), and (R3) with j + 1 . Indeed, (R1) follows fromthe assumption (R1) for P j , (R2) follows from Lemma 5.4 and the remark following it whereas (R3)follows from Lemma 5.4 and the assumptions (R2) and (R3) for P j .Note that, by construction, we have T ℓ +1 ≤ T j +1 < T j . If T j +1 = T ℓ +1 , then, as before, P j +1 satisﬁes the properties (Q1), (Q2), and (Q3). If T j +1 > T ℓ +1 , then we repeat, noting that theprocess is guaranteed to terminate in ﬁnitely many steps since the sequence ( T j ) j ≥ is strictly de-creasing before termination. This completes the proof of our claim about the existence of an inducedoriented path satisfying (Q1), (Q2), and (Q3).Now, let P = ( C , . . . , C k ) be an induced oriented path in G ( C ) satisfying (Q1), (Q2), and (Q3).Let C ℓ +1 be the last (according to the orientation) constraint in P which shares a variable with anyconstraint in P ℓ . Let P ℓ +1 denote the part of P starting from C ℓ +1 . Also, denote the last constraintin P ℓ +1 by C ℓ +1 and let C ℓ +1 = ( C ℓ +1 , T ℓ +1 ) . We claim that P ℓ +1 satisﬁes the properties of the ( ℓ + 1) -leg of the discrepancy check. Indeed, the ﬁrst bullet point in (D4) follows from (Q2), thesecond bullet point of (D4) follows from the construction of P ℓ +1 , and the third bullet point of (D4)follows from (Q3). (cid:3) Given the preceding two lemmas, the proof of Proposition 5.3 follows easily.

Proof of Proposition 5.3.

Since ( v , T ) ∈ U ∩ D , it follows by Lemma 5.4 that there exists aninduced oriented path P = ( C , . . . , C s ) in G ( C ) such that v ∈ vbl( C ) , each C j is not satisﬁedby at least one of X − v T − , Y − v T − , and there exists v = v with v ∈ vbl( C s ) and ( v , T − ∈ D .Then, by the same argument as in the proof of Lemma 5.5 (the only diﬀerence is that we slightlyweaken the condition (R3) and require it only for C ∈ P j \ { C j } ), we can show that there exists aninduced oriented path of constraints P = ( C , . . . , C k ) with C = C and C k = C and such thatthe following properties hold.(Q’1) v ∈ vbl( C ) . Q’2) There exists some v ∈ vbl( C ) such that ( v , T ) ∈ D and either X T ( v ) or Y T ( v ) is equalto ( C ) π ( v ) .(Q’3) For each constraint C ∈ P \ { C } , there exists some T ′ ∈ [ T , T ) such that at least one ofthe following holds. • The subset vbl( C ) does not satisfy C in at least one of X T ′ and Y T ′ . • There exists v ′ ∈ vbl( C ) such that – The subset vbl( C ) \ { v ′ } does not satisfy C in at least one of X T ′ and Y T ′ , and – v ′ is updated in ( T ′ , T ℓ ) , and – The ﬁrst update at time t ′ > T ′ of v ′ results in a discrepancy, and – There exists some C ′′ ∈ C such that either X t ′ ( v ′ ) or Y t ′ ( v ′ ) is equal to C ′′ π ( v ′ ) .For such a path, note that C = ( C , T ) ∈ C , f = 1 , v ∈ V , and P satisfy the properties inDeﬁnition 5.1. Now, a direct (repeated) application of Lemma 5.5 gives a discrepancy check startingat ( v , T ) . (cid:3) Minimal discrepancy checks.

In order to carry out the union bound argument later (bothto control the size of the union as well as to control the probabilities of individual events in theunion), it will be convenient to focus on minimal discrepancy checks . Deﬁnition 5.6.

Let v ∈ V and T an integer in [ HK, H ( K + 1)) (in particular, T is in time block K ). For ≤ i ≤ K − , let T i = H ( K − i ) . A minimal discrepancy check M starting at ( v , T ) consists of a sequence of induced oriented paths P , . . . , P K − in G ( C ) and a collection of Booleanvariables f = 1 , f , . . . , f K − such that the following properties are satisﬁed.(M1) ( v , T ) ∈ D .(M2) The ﬁrst constraint of P , which we denote by C , satisﬁes v ∈ vbl( C ) . Moreover, vbl( C ) \{ v } does not satisfy C in at least one of X − v T − and Y − v T − .(M3) For i ≥ satisfying f i +1 = 0 , let j ( i + 1) = max { j : j ≤ i + 1 , f j = 1 } . Then, there exists some v j ( i +1) ∈ vbl( C j ( i +1) ) , where C j ( i +1) is the last constraint in P j ( i + ) , such that ( v j ( i +1) , T i ) ∈ D and v j ( i +1) is not updated in ( T i +1 , T i ) . Moreover, P i +1 = { C i } .(M4) For i ≥ satisfying f i +1 = 1 , the following properties hold. • The last constraint of P i and the ﬁrst constraint of P i + have non-empty intersection.Any other pair of constraints in P i and P i +1 are disjoint. • For any C ′ ∈ P \ { C } (in case i = 0 ) and for any C ′ ∈ P i +1 (in case i ≥ ), let vbl ( C ′ ) := vbl( C ′ ) ∩ V (( T i +2 , T i +1 )) , and vbl + ( C ′ ) := vbl( C ′ ) ∩ V + (( T i +1 , T i )) . Then, there exists some T ′ ∈ [ T i +1 , T i ) such that at least one of the following holds. – The subset vbl( C ′ ) \ (vbl ( C ′ ) ∪ vbl + ( C ′ )) does not satisfy C ′ in at least one of X T ′ or Y T ′ . – There exists v ′ ∈ vbl( C ′ ) \ (vbl ( C ′ ) ∪ vbl( C ′ )) such that ∗ The subset vbl( C ′ ) \ ( { v ′ } ∪ vbl ( C ′ ) ∪ vbl + ( C ′ )) does not satisfy C ′ in at least one of X T ′ and Y T ′ , and ∗ v ′ is updated in ( T ′ , T i ) , and ∗ The ﬁrst update at time t ′ > T ′ of v ′ results in a discrepancy, and ∗ There exists some C ′′ ∈ C such that either X t ′ ( v ′ ) or Y t ′ ( v ′ ) is equal to C ′′ π ( v ′ ) . or a minimal discrepancy check M , we denote the number of constraints in P i by r i and refer toit as the length of leg i .The next lemma shows how to modify a discrepancy check in order to obtain a minimal discrep-ancy check. Lemma 5.7.

Suppose there exists a discrepancy check D starting at ( v , T ) . Then, there exists aminimal discrepancy check M starting at ( v , T ) .Proof. Let D be a discrepancy check starting at ( v , T ) . For each i ≥ , let ˜ C i denote the ﬁrst(according to the orientation) constraint in P i for which vbl( ˜ C i ) ∩ vbl( C i +1 ) = ∅ . Let P i denote thepart of P i from the starting point until and including ˜ C i . We claim that the paths P , . . . , P K − and the Boolean variables f , . . . , f K − from D constitute a minimal discrepancy check startingat ( v , T ) , where we use the variables v i from D for each i satisfying f i +1 = 0 in order to checkcondition (M3).Indeed, (M1) and (M2) follow from (D1). (M3) follows from (D3) and Lemma 5.2. The ﬁrstbullet point of (M4) follows from the the second bullet point of (D4), the construction of P i s, andthe fact that the P j s are induced paths. The second bullet point of (M4) follows from the secondbullet point of (D2) and the third bullet point of (D4). (cid:3) Combining this lemma with Proposition 5.3, we have the following.

Proposition 5.8.

Let ( v , T ) ∈ U ∩ D . Then there exists a minimal discrepancy check M startingat ( v , T ) . To prepare for the next subsection, we introduce some more notation. To every minimal discrep-ancy check M , we associate a graph G ( M ) = ( V ( M ) , E ( M )) deﬁned as follows. The vertex set V ( M ) consists of pairs ( C, i ) where ≤ i ≤ K − and C ∈ P i . The vertices ( C, i ) and ( C ′ , i ) are connected to each other if and only if C and C ′ are adjacent in P i . Moreover, if C is the lastconstraint in P i and C ′ is the ﬁrst constraint in P i +1 , then ( C, i ) and ( C ′ , i + 1) are adjacent. Giventhe Boolean variables f , . . . , f K − of M , we can ﬁnd some ≤ ℓ ≤ K − and disjoint intervals L , . . . , L ℓ ⊂ { , . . . , K − } with the following properties. • For all a < b , max L a < min L b . • For every ≤ s ≤ ℓ and for every i ∈ L s , f i = 1 . • For every i such that f i = 1 , there exists some ≤ s ≤ ℓ such that i ∈ L s .Given L , . . . , L ℓ , we deﬁne oriented induced paths b P , . . . , b P ℓ in G ( M ) where b P contains all thepoints { ( C, a ) : a ∈ L , C ∈ P a } \ ( C , and for ≤ j ≤ ℓ , b P j contains all the points { ( C, a ) : a ∈ L j , C ∈ P a } . For ≤ j ≤ ℓ , let b r j = X a ∈ L j r a . We say that ℓ is the eﬀective parameter of the minimal discrepancy check and b r , . . . , b r ℓ are its eﬀective leg lengths .Finally, for each ≤ j ≤ ℓ , by taking every other point b P j starting with the very last point, weobtain an independent set I ( M ) in G ( M ) such that | I ( M ) | ≥ ℓ X j =1 b r j / . We have two cases.(Ind1) P ℓj =1 b r j ≥ P K − i =1 r i / . In this case, we deﬁne I ( M ) := ∅ and I ( M ) := I ( M ) . Ind2) P ℓj =1 b r j < P K − i =1 r i / . In this case, observe that we can ﬁnd an independent set I ( M ) in G ( M ) such that every element ( C, i ) ∈ I ( M ) satisﬁes f i = 0 , | I ( M ) | ≥ P K − i =1 r i / , and I ( M ) := I ( M ) ∪ I ( M ) is also an independent set in G ( M ) .5.5. Probability of a minimal discrepancy check.

In the previous subsection, we showed thatif ( v , T ) ∈ U ∩ D , then there must exist a minimal discrepancy check starting at ( v , T ) . In thissubsection, we will bound the probability (under the randomness driving the Glauber dynamics)of seeing a minimal discrepancy check with given leg lengths. This will then be combined with aunion bound argument in the next subsection.It will be convenient to use the following description of the Glauber dynamics for µ π . To eachvertex v and each time t , we associate an independent uniform random variable U ( v, t ) ∼ Unif[0 , .At time t (recall that this means that the current states are X t − , Y t − ), we choose a variable v toupdate from the uniform distribution on V and choose X t ( v ) and Y t ( v ) according to the one-stepmaximal coupling of the corresponding conditional marginal distributions, with the realisation of ( X t ( v ) , Y t ( v )) determined using U ( v, t ) in the natural manner. With this notation, observe thefollowing.(O1) For each t, X t − , Y t − , v , there exist subsets I d ( v, X t − , Y t − ) ⊆ [0 , of measure at most q such that if ( v, t ) ∈ U ∩ D , then U ( v, t ) ∈ I d ( v ) . Here, as was deﬁned in Section 3, q = max v ∈ V,Y ∈ Q v ∈ V Q v d TV ( P π [value( v ) = · ] , µ π [value( v ) = · | Y − v ]) . (O2) Suppose e · b · ∆ ≤ . Then, by Lemma 3.1, for each v, C , there exist subsets I s ( v, C ) ⊆ [0 , of measure at most (1 − b ) − ∆ P π [value( v ) = C π ( v )] such that if v does not satisfy C in at least one of X t and Y t and if t ′ is the last update of v before time t , then U ( v, t ′ ) ∈ I s ( v, C ) . Here, recall that b = max C ∈C b ( C ) , where b ( C ) = Y u ∈ vbl( C ) | π − u ( C π ( u )) | − . Let P denote the probability measure corresponding to the randomness of the Glauber dynamicsi.e. the choice of vertex to update at time t and the i.i.d. random variables U ( v, t ) ∼ Unif[0 , . Themain result of this subsection is the following. Proposition 5.9.

Let Φ be an atomic CSP and let π be an admissible projection scheme. Let M be a minimal discrepancy check starting from ( v , T ) , where T ≥ HK ( H = 100 κn ), and with leglengths ( r , . . . , r K − ) . Further, let ℓ be the eﬀective parameter of M and let its eﬀective leg lengthsbe ( b r , . . . , b r ℓ ) . Then, P ( M ) ≤ (3000∆) − P ℓi =1 ˆ r i · (3000∆) − | I ( M ) | Proof.

Recall the deﬁnition of the graph G ( M ) associated to M and the independent set I ( M ) = I ( M ) ∪ I ( M ) in this graph. Recall also that I ( M ) does not contain ( C , . For ≤ i ≤ K − ,let C i = { ( C, i ) ∈ I ( M ) } . Also, let V ( C, i ) = V (( T i +1 , T i )) ∩ vbl( C ) , V + ( C, i ) = V + (( T i , T i − )) ∩ vbl( C ) , nd let V i = ∪ ( C,i ) ∈ C i V ( C, i ) , V + i = ∪ ( C,i ) ∈ C i V + ( C, i ) . Note that for ≤ i ≤ K − , the sets V + i and V i − are completely determined by the choice ofvertices selected to be updated between times ( T i , T i − ) =: Int i − . We say that Int i − is exceptional if | V + i | + | V i − | ≥ n . It follows from a straightforward application of the Chernoﬀ bound that for any ≤ i ≤ K − , P [Int i − is exceptional ] ≤ − H/ ≤ exp( − κn ) . (5.2)Moreover, for disjoint subsets V and V + of V such that | V | + | V + | ≤ n/ , we have P [ V i − = V , V + i = V + ] = P [ V + i = V + | V i − = V ] P [ V i − = V ] (5.3) ≤ exp (cid:18) − H | V + | n (cid:19) · (cid:18) − | V | n (cid:19) H ≤ e − κ · ( | V | + | V + | ) , where in the second line, we have used that conditioned on V i − = V , the vertex to be updatedat each step is chosen uniformly from a set of size at least n/ , and further, membership in V + i fordiﬀerent vertices is negatively dependent.Let I ⊂ { , . . . , K − } denote the (random) set of i such that Int i − is exceptional. By the lawof total probability, it suﬃces to show that P [ M ∩ { I = ˆ J } ] ≤ (3000∆) − P ℓi =1 b r i · (3000∆) − | I ( M ) | . for every subset ˆ J of { , . . . , K − } . Therefore, for the remainder of the proof, we ﬁx such a choiceof ˆ J and let ˆ I = { i ∈ { , . . . , K − } : { i − , i, i + 1 } / ∈ ˆ J } . Also, for each i ∈ ˆ I , let T i − denote the sigma-algebra generated by the random variables whichrecord the choice of vertex to update for times between [ T i , T i − ) . Note that V i and V + i aremeasurable with respect to T i and T i − respectively. We denote the realizations of these randomsets, given the relevant sigma algebras, by V i ( T i ) and V + i ( T i − ) respectively. Note that, by thedeﬁnition of ˆ I , it is necessarily the case that | V + i ( T i − ) | + | V i − ( T i − ) | ≤ n/ . We also let ˆ I := { i ∈ ˆ I : f i = 1 } , and ˆ I := { i ∈ ˆ I : f i = 0 , ∃ C ∈ C such that ( C, i ) ∈ I ( M ) } . Observe that, by construction of the set I ( M ) , we have that for any i ∈ ˆ I and any j ∈ ˆ I , Int i − ∩ (Int j − ∪ Int j ) = ∅ . (5.4)Now, consider i ∈ ˆ I . Conditioning on T i − , T i ﬁxes V + i = V + i ( T i − ) and V i = V i ( T i ) . More-over, conditioning on T i ﬁxes, for each b C = ( C, i ) ∈ C i , the set T ( b C ) ⊆ [ T i , T i − ) consisting of T i and all the update times of each variable v ∈ vbl( C ) \ V + i . Observe that |T ( b C ) | ≤ κH/n · | vbl( C ) | and that the following holds. If there exists T ′ ∈ [ T i , T i − ) such that C is not satisﬁed in at least one of X T ′ and Y T ′ by vbl( C ) \ ( V i ∪ V + i ) , then there exists some t ∈ T ( b C ) such that in the last update t v ≤ t ofeach variable v ∈ vbl( C ) \ ( V i ∪ V + i ) before time t , we necessarily have U ( v, t v ) ∈ I s ( v, C ) .Indeed, t can be taken to simply be the maximum of T i and the last time before (andincluding) T ′ that any variable in vbl( C ) \ ( V i ∪ V + i ) is updated. Call this event E ( b C ) .Since conditioned on T i and T i − , t v is determined by t for each v ∈ vbl( C ) \ V i , we have P [ E ( b C ) | T i , T i − ] ≤ |T ( b C ) | Y v ∈ vbl( C ) \ ( V i ∪ V + i ) P [ U ( v, t v ) ∈ I s ( v, C ) | T i , T i − , t ] ≤ | vbl( C ) | κ · Y v ∈ vbl( C ) \ ( V i ∪ V + i ) (1 − b ) − ∆ P π [value( v ) = C π ( v )] . • If there exists T ′ ∈ [ T i , T i − ) and v ′ ∈ vbl( C ) \ ( V i ∪ V + i ) such that – C is not satisﬁed in X T ′ or Y T ′ by vbl( C ) \ { v ′ } , and – the ﬁrst update t ′ > T ′ of v ′ results in a discrepancy, and – there exists some C ′ ∈ C with v ′ ∈ vbl( C ′ ) such that either X t ′ ( v ′ ) or Y t ′ ( v ′ ) is equalto C ′ π ( v ′ ) ,then there exists some t ∈ T ( b C ) (again, t can be taken to be the maximum of T i and thelast time before (and including) T ′ that any variable in vbl( C ) \ ( V i ∪ V + i ) is updated) suchthat – in the last update t v ≤ t of each variable v ∈ vbl( C ) \ ( { v ′ } ∪ V i ∪ V + i ) , we necessarilyhave U ( v, t v ) ∈ I s ( v, C ) , and – U ( v ′ , t ′ ) ∈ I d ( v ′ ) ∩ I s ( v ′ , C ′ ) .Call this event E ( b C ) .Let T ≤ i denote the sigma algebra generated by T , . . . , T i . Note that, conditioned on T ≤ i , t ∈ T ( b C ) determines t v (for all v ∈ vbl( C ) \ V i ), v ′ and t ′ . Therefore, letting V ′ ( b C ) := vbl( C ) \ ( V i ∪ V + i ∪ { v ′ } ) , we have P [ E ( b C ) | T ≤ i ] ≤ |T ( b C ) | · P [ U ( v ′ , t ′ ) ∈ I d ( v ′ ) ∩ I s ( v ′ , C ′ ) | T ≤ i , t ] · Y v ∈ V ′ ( b C ) P [ U ( v, t v ) ∈ I s ( v, C ) | T ≤ i , t ] ≤ | vbl( C ) | κ · min (cid:18) q, ∆(1 − b ) − ∆ max C ′ : v ′ ∈ vbl( C ′ ) P π [value( v ′ ) = C ′ π ( v ′ )] (cid:19) · Y v ∈ V ′ ( b C ) (1 − b ) − ∆ P π [value( v ) = C π ( v )] . Here, the factor of ∆ in the second term in the parentheses is to account for the choice of C ′ given v ′ .Note that by the deﬁnition of a minimal discrepancy check, if f i = 1 (in particular, if i ∈ ˆ I ),then for every such b C ∈ I ( M ) at least one of the events E ( b C ) and E ( b C ) holds. Let E ( b C ) = E ( b C ) ∪ E ( b C ) . hen, by the deﬁnition of ζ ( C ) and property (A3) of Deﬁnition 3.2, we have P [ E ( b C ) | T ≤ i ] ≤ | vbl( C ) | κ · ζ ( C ) · Y v ∈ vbl( C ) \{ V i ∪ V + i } (1 − b ) − ∆ P π [value( v ) = C π ( v )] . (5.5)Let Θ( V i , V + i ) := Y C i  | vbl( C ) | κ · ζ ( C ) · Y v ∈ vbl( C ) \{ V i ∪ V + i } (1 − b ) − ∆ P π [value( v ) = C π ( v )]  , and let b Θ i := Y C i  | vbl( C ) | κ · ζ ( C ) · Y v ∈ vbl( C ) (cid:16) (1 − b ) − ∆ P π [value( v ) = C π ( v )] + e − κ/ (cid:17) . Since for any ( C, i ) , ( C ′ , i ) ∈ C i , we necessarily have vbl( C ) ∩ vbl( C ′ ) = ∅ , it follows by expandingthe term inside the parentheses in the deﬁnition of ˆΘ i that ˆΘ i ≥ X V i ,V + i ⊆ V,V i ∩ V + i = ∅ Θ( V i , V + i ) · e − κ ( | V i | + | V + i | ) / . (5.6)Moreover, by property (A2) of Deﬁnition 3.2, we have that ˆΘ i · Y C i | vbl( C ) |  ≤ (3000∆) − | C i | . (5.7)At this point, we are almost done. Note that(N1) The time intervals Int j − for j ∈ ˆ J are disjoint from the time intervals Int i , Int i +1 , Int i − for i ∈ ˆ I .(N2) For each i ∈ ˆ I , we have f i = 0 . Recall the deﬁnition of the sets L , . . . , L ℓ ⊆ { , . . . , K − } associated to M . Let j ( i ) be the largest index such that j ( i ) ≤ i and j ( i ) ∈ L a for some ≤ a ≤ ℓ . Let C ∗ j ( i ) denote the last constraint in P j ( i ) . Then, by construction of I ( M ) ,we necessarily have ( C ∗ j ( i ) , j ( i )) ∈ I ( M ) ⊆ I ( M ) . Also, by (M3), there must exist some v ∗ j ( i ) ∈ vbl( C ∗ j ( i ) ) such that v ∗ j ( i ) is not updated in ( T i , T i − ) = Int i − .(N3) For each i ∈ ˆ I , for each ( C, i ) ∈ I ( M ) , for each v ∈ vbl( C ) , at least one of the followingholds: (i) v ∈ V + i (ii) v ∈ V i , (iii) there is a term corresponding to v in the expression for Θ( b C, V i , V + i ) .(N4) By construction, for every ( C, i ) , ( C ′ , j ) ∈ I ( M ) with | j − i | ≤ , we have vbl( C ) ∩ vbl( C ′ ) = ∅ .Let ˆ I e denote the set of indices ≤ i ≤ K − for which there exists some j ∈ ˆ I with | i − j | ≤ ,and let V denote the collection of all subsets { V i − , V + i } i ∈ ˆ I e subject to the restriction that | V + i | + | V i − | ≤ n/ ∀ i ∈ ˆ I e . Also, let W denote the collection of the variables v ∗ j ( i ) (with notation as in (N2)) for each i ∈ ˆ I .Then, from the above discussion, we have P [ M ∩ { I = ˆ J } ] ≤ X V X W e − κn | ˆ J | ·  Y i ∈ ˆ I Θ( V i , V + i ) · e − κ · ( | V i | + | V + i | )  · Y ˆ I e − κ e − κn | ˆ J | · X V  Y i ∈ ˆ I Θ( V i , V + i ) · e − κ · ( | V i | + | V + i | )  · X W Y ˆ I e − κ ≤ e − κn | ˆ J | · Y i ∈ ˆ I ˆΘ i ·  Y j ∈{ j ( i ): i ∈ ˆ I } | vbl( C ∗ j ) |  ·  Y i ∈ ˆ I e − κ  ≤ e − κn | ˆ J | · Y i ∈ ˆ I  ˆΘ i · Y C i | vbl( C ) |  · Y i ∈ ˆ I e − κ ≤ e − κn | ˆ J | · e − κ | ˆ I | · Y i ∈ ˆ I (3000∆) − | C i | ≤ (3000∆) − | I ( M ) | ≤ (3000∆) − P ℓi =1 b r i · (3000∆) − | I ( M ) | . Let us explain this chain of inequalities. In the ﬁrst line, we have used (5.2), (5.3), (5.4), (5.5),(N1)-(N4), and the law of total probability; the third line follows from (5.6) and (N2); the fourthline follows again from (N2); the ﬁfth line follows from (5.7); the sixth line follows upon notingthat each of the leg lengths r , . . . , r K − is at most n since P , . . . , P K − are induced paths andthat κ ≥ by (A2) on Deﬁnition 3.2; and the last line follows by the construction of I ( M ) . (cid:3) Rapid mixing of the Glauber dynamics.

We are now in a position to prove Proposition 4.2,which will follow as a consequence of the next lemma.

Lemma 5.10.

Let K ≥ and let B K be the event that there exists a minimal discrepancy checkstarting from ( v , T ) with T in time block K . Then, P ( B K ) ≤ K · (4 / − ( K − . Proof.

This follows from a union bound. First, note that the graph G ( C ) whose vertices are C = C × { , , . . . , K } and which has an edge between ( C, i ) and ( C ′ , j ) if and only if | j − i | ≤ and vbl( C ) ∩ vbl( C ′ ) = ∅ has maximum degree at most . From this, it readily follows that thenumber of minimal discrepancy checks starting from time block K , with eﬀective parameter ℓ , andwith eﬀective leg lengths ( b r , . . . , b r ℓ ) is at most (3∆) · (3∆) b r + ··· + b r ℓ · K − , where the factor K − accounts for the choice of f i and the ﬁrst factor of (3∆) accounts for thechoice of the vertex in G ( C ) adjacent to ( C , . By Proposition 5.9, we know that for a minimaldiscrepancy check with eﬀective parameter ℓ and eﬀective leg lengths ( b r , . . . , b r ℓ ) , P [ M ] ≤ (3000∆) − P ℓi =1 b r i · (3000∆) − | I ( M ) | . Therefore, by the union bound, we have P [ B K ] ≤ (3∆) · K − X ℓ =1 X b r ,..., b r ℓ (3000∆) − P ℓi =1 b r i · (3∆) b r + ··· + b r ℓ · (3000∆) − | I ( M ) | · K − ≤ K − · (3∆) · K − X ℓ =1 X b r ,..., b r ℓ − P ℓi =1 b r i · − P ℓi =1 b r i · (3000∆) − | I ( M ) | K − · (3∆) · K − X ℓ =1 X b r ,..., b r ℓ − P ℓi =1 b r i · − | I ( M ) | ≤ K − · (3∆) · K − X ℓ =1 X b r ,..., b r ℓ − P ℓi =1 b r i · − K − / ≤ (2 . − ( K − · (3∆) · K · K − ≤ K · (4 / − ( K − . Here, the third line follows by the construction of I ( M ) and the fourth line follows from (Ind1),(Ind2), and P K − i =1 r i ≥ K − . (cid:3) We are now in a position to prove Proposition 4.2.

Proof of Proposition 4.2.

Consider arbitrary initial states X , Y and couple the Markov chains ( X t ) t ≥ , ( Y t ) t ≥ as above. Let t ∗ = 10 H log( n ∆ /δ ) + 10 n log(1 /δ ) , where recall that H = 100 κn . If τ couple ≥ t ∗ then in particular, there exists some v ∈ V such that ( v, t ∗ ) ∈ D . Let t v denote the last time before (and including) t ∗ that v was updated. Note that ( v, t v ) ∈ U ∩ D . Therefore, by Proposition 5.3, there exists a minimal discrepancy check M startingat ( v, t v ) . Note that for any v ∈ V , P [ t v ≤ t ∗ − n log(1 /δ )] ≤ δ . On the other hand, if t v > t ∗ − n log(1 /δ ) , then in particular, t v ≥ H log( n ∆ /δ ) so that t v is in time block K for K ≥

10 log( n ∆ /δ ) . By Lemma 5.10, the probability of having a minimal discrepancy check starting from some ( v, t v ) ∈ V × [ t ∗ ] with K − legs is at most · n · t ∗ · (4 / −

10 log( n ∆ /δ ) ≤ δ . The desired conclusion now follows from (5.1). (cid:3) Finishing the proof of Proposition 3.3

We conclude by completing the proof of Proposition 3.3.

Proof of Proposition 3.3.

Case 2.

Let α ∈ [0 , be a parameter to be chosen momentarily via anoptimization problem. We will use the marked/unmarked scheme of [Moi19]. Namely, for each v ∈ V , independently, with probability α , we set Q v = { } , and with probability − α , we set Q v = [ A ] = [2] . Clearly, this satisﬁes (A3) and (A4). Let V be the set of v ∈ V for which | Q v | = 1 .Let V f be the set of v ∈ V for which | Q v | = A = 2 .Let θ , θ f ∈ (0 , and γ > be parameters such that(1) γ ≤ θ < α .(2) γ ≤ θ f < − α .(3) D ( θ , α ) ≥ γ log A .(4) D ( θ f , − α ) ≥ γ log A . ere, D ( x, y ) = x log( x/y ) + (1 − x ) log((1 − x ) / (1 − y )) is the Kullback-Leibler divergence. Our goal is to maximize γ . Solving this optimization problemfor A = 2 , we can ﬁnd parameters θ , θ f , α, γ such that γ ≥ . .Assume that ∆ ≤ cA γk /k for a suﬃciently small constant c depending only on η . We nextshow that there exists a choice of V and V f such that for all C ∈ C , |V ∩ vbl( C ) | ≥ γk and |V f ∩ vbl( C ) | ≥ γk .For this, we will use the LLL. Note that if V is a random set in which each variable is includedindependently with probability α , then by the Chernoﬀ-Hoeﬀding bound, Pr[ |V ∩ C | < θ | vbl( C ) | ] ≤ e − D ( θ ,α ) | vbl( C ) | and Pr[ |V f ∩ C | < θ f | vbl( C ) | ] ≤ e − D ( θ f , − α ) | vbl( C ) | . By our assumptions on the parameters (i.e. they satisfy the conditions of the optimization problem),we have max (cid:16) e − D ( θ ,α ) | vbl( C ) | , e − D ( θ f , − α ) | vbl( C ) | (cid:17) ≤ e − k log A · γ < (2 e ∆) − . Thus, by the LLL, there exists a choice of assignments of v ∈ V to V and V f such that for all C ∈ C , |V ∩ vbl( C ) | ≥ θ k, |V f ∩ vbl( C ) | ≥ θ f k. Moreover, by Theorem 2.2, with probability at least − δ , this assignment can be constructed intime O ( n ∆ k log(1 /δ )) .Under a choice of V and V f with the above properties, for C ∈ C , b ( C ) = A −|V ∩ vbl( C ) | ≤ A − γk ≤ min (cid:0) (300∆ /η ) − , (600 k ∆) − (cid:1) , so that (A1) holds. It remains to verify (A2). Note that ζ ( C ) ≤ max(1 , A ) ≤ . Let κ = 12 log(3000( k + ∆)) . Then, we have | vbl( C ) | κ · ζ ( C ) · Y v ∈ vbl( C ) (cid:16) (1 − b ) − ∆ P π [value( v ) = C π ( v )] + e − κ/ (cid:17) ≤ |V f ∩ vbl( C ) | · (12 log(3000( k + ∆))) · (cid:18) / (100 k ) A + 12 kA (cid:19) |V f ∩ vbl( C ) | ≤ k · (12 log(3000( k + ∆))) · (cid:18) / (2 k ) A (cid:19) |V f ∩ vbl( C ) | ≤ k · (12 log(3000( k + ∆))) · · A − γk ≤ (60000∆) − , by the assumed upper bound on ∆ . Case 3.

For each v ∈ V , let Q v := { , } and choose a uniformly random projection π v from [ A ] to Q v such that | π − v (1) | = 1 and | π − v (2) | = 2 . Clearly, this satisﬁes (A3) and (A4). For each C ∈ C , let V ( C ) be the set of v ∈ vbl( C ) for which | π − v ( C π ( v )) | = 1 and let V ( C ) be the set of v ∈ vbl( C ) for which | π − v ( C π ( v )) | = 2 . Note that for each v ∈ vbl( C ) , the probability (over thechoice of π v ) that v ∈ V ( C ) is / and the probability that v ∈ V ( C ) is / .We have b ( C ) = 2 −|V ( C ) | . ote that ζ ( C ) ≤ max(1 , A ) = 3 . Let κ = 12 log(3000(∆ + k )) . Then, we have | vbl( C ) | κ · ζ ( C ) · Y v ∈ vbl( C ) (cid:16) (1 − b ) − ∆ P π [value( v ) = C π ( v )] + e − κ/ (cid:17) ≤ k · (12 log(3000(∆ + k ))) · (1 + 1 / (100 k ) + 1 / (2 k )) k (cid:18) (cid:19) |V ( C ) | (cid:18) (cid:19) |V ( C ) | ≤ k · (12 log(3000(∆ + k ))) · · − k · |V ( C ) | . Let γ = 0 . . By Markov’s inequality, we have Pr[ b ( C ) > − γk ] = Pr[(3 / |V ( C ) | · |V ( C ) | > (1 − γ ) k ] ≤ min θ> (3 (1 − γ ) k ) − θ (cid:18)

23 (3 / θ + 13 3 θ (cid:19) k ! . Similarly, we have

Pr[2 |V ( C ) | > (1 − γ ) k ] ≤ min θ> (3 (1 − γ ) k ) − θ (cid:18)

23 2 θ + 13 (cid:19) k ! . For γ = 0 . , solving the above optimization problem in θ , one ﬁnds that Pr[ b ( C ) > − γk ] ≤ − γk , and Pr[2 |V ( C ) | > (1 − γ ) k ] ≤ − γk . Using the above bounds, assuming that ∆ ≤ c γk /k , by the LLL, there exists a choice of theprojection so that for all C ∈ C , b ( C ) ≤ min (cid:0) (300∆ /η ) − , (600 k ∆) − (cid:1) , and | vbl( C ) | κ · ζ ( C ) · Y v ∈ vbl( C ) (cid:16) (1 − b ) − ∆ P π [value( v ) = C π ( v )] + e − κ/ (cid:17) ≤ (60000∆) − . Furthermore, by Theorem 2.2, such projection maps can be constructed in time O ( n ∆ k log(1 /δ )) with probability at least − δ . Case 4 . First, we consider the case

A / ∈ { , } . By taking the constant c > in the statementof the proposition to be suﬃciently small, we may assume that A k is suﬃciently large for variousinequalities below to go through. We will use the inequality inf A ≥ ,A/ ∈{ , } (cid:26) max R ∈ [2 ,A ] (cid:20) min (cid:18)

12 log( A/ ⌈ A/R ⌉ )log A , log( ⌊ A/R ⌋ )log A (cid:19)(cid:21)(cid:27) := α ∗ ≥ . , which may be veriﬁed numerically. For A ≥ , A / ∈ { , } , let R := arg max R ′ ∈ [2 ,A ] (cid:20) min (cid:18)

12 log( A/ ⌈ A/R ′ ⌉ )log A , log( ⌊ A/R ′ ⌋ )log A (cid:19)(cid:21) . As in Case 1, let Q v = [ R ] for all v ∈ V and deﬁne π v arbitrarily so that the preimage of eachelement in Q v has size either ⌊ A/R ⌋ or ⌈ A/R ⌉ . As before, this satisﬁes (A4). ecall that ∆ ≤ cA α ∗ ( k − / ( k log A ) for some small absolute constant c (depending only on η ).Then, (A1) holds since b ≤ (cid:18) ⌊ A/R ⌋ (cid:19) k ≤ A − α ∗ k ≤ η/ (300∆) , and (A3) holds since by the choice of R and since A is suﬃciently large, A − α ∗ ≤ ⌊ A/R ⌋ A ≤ P π [value( v ) = C π ( v )] ≤ ⌈ A/R ⌉ A ≤ A − α ∗ . It remains to verify (A2). By the choice of R and the upper bound on ∆ , we have (1 − b ) − ∆ ≤ b ∆ ≤ / (100 k ) . Therefore, as in Case 1, we have that ζ ( C ) ≤ A α ∗ . Let κ = 12 log(3000(∆ + Ak )) . Then, ζ ( C ) · Y v ∈ vbl( C ) (cid:16) (1 − b ) − ∆ P π [value( v ) = C π ( v )] + e − κ/ (cid:17) ≤ A α ∗ · (1 + 1 / (2 k )) k · A − α ∗ k ≤ · A − α ∗ ( k − . Thus, | vbl( C ) | · κ · ζ ( C ) · Y v ∈ vbl( C ) (cid:16) (1 − b ) − ∆ P π [value( v ) = C π ( v )] + e − κ/ (cid:17) ≤ · k · (12 log(3000(∆ + Ak ))) A − α ∗ ( k − ≤ (60000∆) − , where in the last inequality, we used the assumption ∆ ≤ cA α ∗ ( k − / ( k log A ) .Next, we consider the case A = 5 . Here, our construction is similar to Case 3. Let x, y ≥ be parameters to be chosen later via an optimization problem, which satisfy x + y = 1 . For each v ∈ V , with probability x , we choose the projection π v to be a uniformly random partition of [ A ] into two parts of sizes and , and with probability y , we choose the projection π v to be a uniformlyrandom partition of [ A ] into three parts of sizes , , and . For C ∈ C , let V x, ( C ) be the set ofvariables v ∈ vbl( C ) for which π v is a partition of [ A ] into parts of size , , and | π − v ( C π ( v )) | = 3 .Similarly, we deﬁne V x, ( C ) , V y, ( C ) and V y, ( C ) . Note that for v ∈ vbl( C ) , Pr[ v ∈ V x, ( C )] = 3 x/ , Pr[ v ∈ V x, ( C )] = 2 x/ , Pr[ v ∈ V y, ( C )] = 4 y/ , and Pr[ v ∈ V y, ( C )] = y/ .We have b ( C ) = 3 −|V x, ( C ) | · −|V x, ( C ) | · −|V y, ( C ) | . Note that ζ ( C ) ≤ max(1 , A ) = 5 . Let κ = 12 log( k/c + 3000∆) . Then, we have | vbl( C ) | κ · ζ ( C ) · Y v ∈ vbl( C ) (cid:16) (1 − b ) − ∆ P π [value( v ) = C π ( v )] + e − κ/ (cid:17) ≤ k · (12 log( k/c + 3000∆)) · (1 + 6 c/k + 1 / (2 k )) k (cid:18) (cid:19) |V x, ( C ) | (cid:18) (cid:19) |V x, ( C ) | (cid:18) (cid:19) |V y, ( C ) | (cid:18) (cid:19) |V y, ( C ) | ≤ k · (12 log( k/c + 3000∆)) · · − k · |V x, ( C ) | · |V x, ( C ) | · |V y, ( C ) | . et γ = 0 . . By Markov’s inequality, we have Pr[ b ( C ) > − γk ] = Pr h (5 / |V x, ( C ) | · (5 / |V x, ( C ) | · (5 / |V y, ( C ) | · |V y, ( C ) | > (1 − γ ) k i ≤ min θ> (5 (1 − γ ) k ) − θ (cid:18) x / θ + 2 x / θ + 4 y / θ + y θ (cid:19) k ! . Similarly, we have

Pr[3 |V x, ( C ) | · |V x, ( C ) | · |V y, ( C ) | > (1 − γ ) k ] ≤ min θ> (5 (1 − γ ) k ) − θ (cid:18) x θ + 2 x θ + 4 y θ (cid:19) k ! . For γ = 0 . , solving the above optimization problem in θ , one ﬁnds that Pr[ b ( C ) > − γk ] ≤ − γk , and Pr[3 |V x, ( C ) | · |V x, ( C ) | · |V y, ( C ) | > (1 − γ ) k ] ≤ − γk . Using the above bounds, assuming that ∆ ≤ c γk /k , by the LLL, there exists a choice of theprojection so that for all C ∈ C , b ( C ) ≤ min (cid:0) (300∆ /η ) − , (600 k ∆) − (cid:1) , and | vbl( C ) | κ · ζ ( C ) · Y v ∈ vbl( C ) (cid:16) (1 − b ) − ∆ P π [value( v ) = C π ( v )] + e − κ/ (cid:17) ≤ (60000∆) − . Furthermore, by Theorem 2.2, such projection maps can be constructed in time O ( n ∆ k log(1 /δ )) with probability at least − δ .The case A = 7 can be done similarly, using partitions of [ A ] into sets of size (3 , , or (2 , , , .By using the same analysis as above, one can show that an admissible projection scheme exists when ∆ ≤ c γk /k for γ = 0 . , and moreover, that such a projection scheme can be constructed intime O ( n ∆ k log(1 /δ )) with probability at least − δ . Case 5.

In the general case, we will combine the constructions used in the previous cases.For v with | Ω v | ≥ and | Ω v | / ∈ { , } , let R v := arg max R ∈ [1 , | Ω v | ] min (cid:18)

12 log( | Ω v | / ⌈| Ω v | /R ⌉ )log | Ω v | , log( ⌊| Ω v | /R ⌋ )log | Ω v | (cid:19) , and recall that min (cid:18)

12 log( | Ω v | / ⌈| Ω v | /R v ⌉ )log | Ω v | , log( ⌊| Ω v | /R v ⌋ )log | Ω v | (cid:19) ≥ α ∗ = 0 . . For v ∈ V with | Ω v | ≥ and | Ω v | / ∈ { , } , we let Q v = [ R v ] , and deﬁne the projection π v arbitrarilyso that the preimage of each element in Q v has size ⌊| Ω v | /R v ⌋ or ⌈| Ω v | /R v ⌉ . Let V L be the set ofsuch variables v ∈ V .For each C ∈ C , deﬁne p L ( C ) = Q v ∈V L ∩ vbl( C ) 1 | Ω v | . Let V S = V \ V L and let p S ( C ) = Q v ∈V S ∩ vbl( C ) 1 | Ω v | . For each A ∈ { , , , } , as before, vbl A ( C ) denotes those variables v ∈ V for which | Ω v | = A . We denote b L ( C ) = Y v ∈V L ∩ vbl( C ) | π − v ( C π ( v )) | , b S ( C ) = Y v ∈V S ∩ vbl( C ) | π − v ( C π ( v )) | . e also deﬁne t L ( C ) = Y v ∈V L ∩ vbl( C ) | π − v ( C π ( v )) || Ω v | , t S ( C ) = Y v ∈V S ∩ vbl( C ) | π − v ( C π ( v )) || Ω v | , t ( C ) = t L ( C ) t S ( C ) . By the deﬁnition of α ∗ , we have that b L ( C ) ≤ p L ( C ) α ∗ , and t L ( C ) ≤ p L ( C ) α ∗ . For A ∈ { , , , } and for each v ∈ V with | Ω v | = A , we deﬁne the projection π v randomlyusing the same partitions of [ A ] as in previous cases. Note that the choice of the projection dependson certain probability parameters that will be chosen later from an optimization problem. Wedenote the collection of these parameters by p . Then b S ( C ) − is a product of independent randomvariables indexed by v ∈ vbl( C ) ∩ V S . Similarly, t S ( C ) − is a product of independent randomvariables indexed by v ∈ vbl( C ) ∩ V S . Note that t S ( C ) b S ( C ) = p S ( C ) .Let γ > be a parameter to be chosen later from an optimization problem. As in previous cases,we can use Markov’s inequality and obtain Pr[ b ( C ) − < p − γ ] ≤ Pr[ b S ( C ) − < p L ( C ) α ∗ − γ p S ( C ) − γ ]= Pr[ b S ( C ) /p S ( C ) > p L ( C ) γ − α ∗ p S ( C ) γ − ] ≤ min θ> (cid:16)(cid:0) p L ( C ) α ∗ − γ p S ( C ) − γ (cid:1) θ E [( b S ( C ) /p S ( C )) θ ] (cid:17) . Let b A be the random variable / | π − v ( C π ( v )) | for (some) v ∈ vbl A ( C ) ; note that the distributionof this random variable is the same for all v ∈ vbl A ( C ) so that b A is indeed well deﬁned. Let θ b = arg min θ> max A ∈{ , , , } log A (cid:16) A − (1 − γ ) θ E [( A b A ) θ ] (cid:17) , and let m b = − max A ∈{ , , , } log A (cid:16) A − (1 − γ ) θ b E [( A b A ) θ b ] (cid:17) . Then

Pr[ b ( C ) − < p − γ ] ≤ p ( α ∗ − γ ) θ b L p m b S ≤ p min( m b , ( α ∗ − γ ) θ b ) . Similarly, we have

Pr[ t ( C ) − < p − γ ] ≤ Pr[ t S ( C ) − < p L ( C ) α ∗ − γ ) p S ( C ) − γ ]= Pr[ b S ( C ) − > p S ( C ) − γ p L ( C ) γ − α ∗ ) ] ≤ min θ> (cid:18)(cid:16) p L ( C ) α ∗ − γ ) p S ( C ) − γ (cid:17) θ E [( b S ( C ) − θ ] (cid:19) . Let θ t = arg min θ> max A ∈{ , , , } log A (cid:16) A − (1 − γ ) θ E [( b A ) − θ ] (cid:17) , and let m t = − max A ∈{ , , , } log A (cid:16) A − (1 − γ ) θ t E [( b A ) − θ t ] (cid:17) . Then

Pr[ t ( C ) − < p − γ ] ≤ p α ∗ − γ ) θ t L p m t S ≤ p min( m t , α ∗ − γ ) θ t ) . We will maximize γ subject to the constraints that max p (min( m b , ( α ∗ − γ ) θ b )) ≥ γ, max p (min( m t , α ∗ − γ ) θ t )) ≥ γ. Numerical optimization shows that one can take γ = 0 . . or such γ and for suﬃciently small η > , assuming further that ∆ ≤ p − γ − o p (1) , it follows fromthe LLL that there exists a choice of projections π v for v ∈ V S so that for all C ∈ C , b ( C ) ≤ (600∆ /η ) − , and t ( C ) ≤ p γ . (6.1)Furthermore, by Theorem 2.2, such projection maps can be constructed in time O ( n ∆ k log(1 /δ )) with probability at least − δ .For such a projection scheme, the properties (A1), (A3) and (A4) are easily veriﬁed. We nowshow that (A2) is also satisﬁed. Let κ = 12 log(3000(∆ + 100)) . Since ζ ( C ) ≤ , we have | vbl( C ) | κ · ζ ( C ) · Y v ∈ vbl( C ) (cid:16) (1 − b ) − ∆ P π [value( v ) = C π ( v )] + e − κ/ (cid:17) ≤ (12 log(3000(∆ + 100))) · · | vbl( C ) | Y v ∈ vbl( C ) (cid:16) (1 − b ) − ∆ P π [value( v ) = C π ( v )] + e − κ/ (cid:17) . By (A1) and our choice of κ , we have (1 − b ) − ∆ P π [value( v ) = C π ( v )] + e − κ/ ≤ for all C ∈ C and v ∈ vbl( C ) . We have two cases.(1) If there exists some v ∈ vbl( C ) for which e − κ/ > | vbl( C ) | P π [value( v ) = C π ( v )] , then (12 log(3000(∆ + 100))) · · | vbl( C ) | Y v ∈ vbl( C ) (cid:16) (1 − b ) − ∆ P π [value( v ) = C π ( v )] + e − κ/ (cid:17) ≤ (12 log(3000(∆ + 100))) · · | vbl( C ) | · (3 / vbl( C ) − · e − κ/ ≤ (60000∆) − , where the last inequality follows by our choice of κ .(2) On the other hand, if for all v ∈ vbl( C ) , e − κ/ ≤ | vbl( C ) | P π [value( v ) = C π ( v )] , and furthersince b ∆ ≤ η/ , (6.2)we have (12 log(3000(∆ + 100))) · · | vbl( C ) | Y v ∈ vbl( C ) (cid:16) (1 − b ) − ∆ P π [value( v ) = C π ( v )] + e − κ/ (cid:17) ≤ (12 log(3000(∆ + 100))) · · | vbl( C ) | Y v ∈ vbl( C ) (cid:0) (1 − b ) − ∆ P π [value( v ) = C π ( v )] (cid:1) ≤ (12 log(3000(∆ + 100))) · ∆ · C η Y v ∈ vbl( C ) P π [value( v ) = C π ( v )] − η . (6.3)Here we have used the inequality x (1 − δ ) x < C δ , which holds for all δ ∈ (0 , , x ≥ , and for asuﬃciently large C δ depending only on δ . Now, since P π [value( v ) = C π ( v )] = | π − v ( C π ( v )) || Ω v | , it followsfrom (6.1) that (6.3) is at most (60000∆) − , which veriﬁes (A2). (cid:3) References [Alo91] Noga Alon. A parallel algorithmic version of the local lemma.

Random Structures & Algorithms , 2(4):367–378, 1991.[Bec91] József Beck. An algorithmic approach to the Lovász local lemma. I.

Random Structures & Algorithms ,2(4):343–365, 1991. BGG +

19] Ivona Bezáková, Andreas Galanis, Leslie Ann Goldberg, Heng Guo, and Daniel Stefankovic. Approxima-tion via correlation decay when strong spatial mixing fails.

SIAM Journal on Computing , 48(2):279–349,2019.[CS00] Artur Czumaj and Christian Scheideler. Coloring nonuniform hypergraphs: A new algorithmic approachto the general Lovász local lemma.

Random Structures & Algorithms , 17(3-4):213–237, 2000.[EL73] Paul Erdős and László Lovász. Problems and results on 3-chromatic hypergraphs and some related ques-tions. In

Colloquia Mathematica Societatis Janos Bolyai 10. Inﬁnite and Finite Sets, Keszthely (Hungary) .Citeseer, 1973.[FGYZ20] Weiming Feng, Heng Guo, Yitong Yin, and Chihao Zhang. Fast sampling and counting k-SAT solutionsin the local lemma regime. In

Proceedings of the 52nd Annual ACM SIGACT Symposium on Theory ofComputing , pages 854–867, 2020.[FHY21] Weiming Feng, Kun He, and Yitong Yin. Sampling constraint satisfaction solutions in the local lemmaregime.

Proceedings of the 53rd Annual ACM SIGACT Symposium on Theory of Computing (STOC 2021),to appear , 2021.[GJL19] Heng Guo, Mark Jerrum, and Jingcheng Liu. Uniform sampling through the Lovász local lemma.

Journalof the ACM (JACM) , 66(3):1–31, 2019.[GLLZ19] Heng Guo, Chao Liao, Pinyan Lu, and Chihao Zhang. Counting hypergraph colorings in the local lemmaregime.

SIAM Journal on Computing , 48(4):1397–1424, 2019.[HSS11] Bernhard Haeupler, Barna Saha, and Aravind Srinivasan. New constructive aspects of the Lovász locallemma.

Journal of the ACM (JACM) , 58(6):1–28, 2011.[HSZ19] Jonathan Hermon, Allan Sly, and Yumeng Zhang. Rapid mixing of hypergraph independent sets.

RandomStructures & Algorithms , 54(4):730–767, 2019.[JPV20] Vishesh Jain, Huy Tuan Pham, and Thuy Duong Vuong. Towards the sampling Lovász Local Lemma. arXiv preprint arXiv:2011.12196 , 2020.[LP17] David A Levin and Yuval Peres.

Markov chains and mixing times , volume 107. American MathematicalSoc., 2017.[LS16] Eyal Lubetzky and Allan Sly. Information percolation and cutoﬀ for the stochastic Ising model.

Journalof the American Mathematical Society , 29(3):729–774, 2016.[Moi19] Ankur Moitra. Approximate counting, the Lovász local lemma, and inference in graphical models.

Journalof the ACM (JACM) , 66(2):1–25, 2019.[Mos08] Robin A Moser. Derandomizing the Lovász local lemma more eﬀectively. arXiv preprint arXiv:0807.2120 ,2008.[Mos09] Robin A Moser. A constructive proof of the Lovász local lemma. In

Proceedings of the forty-ﬁrst annualACM symposium on Theory of computing , pages 343–350, 2009.[MR98] Michael Molloy and Bruce Reed. Further algorithmic aspects of the local lemma. In

Proceedings of thethirtieth annual ACM symposium on Theory of computing , pages 524–529, 1998.[MT10] Robin A Moser and Gábor Tardos. A constructive proof of the general Lovász local lemma.

Journal ofthe ACM (JACM) , 57(2):1–15, 2010.[Sri08] Aravind Srinivasan. Improved algorithmic versions of the Lovász local lemma. In

Proceedings of the nine-teenth annual ACM-SIAM symposium on Discrete algorithms , pages 611–620. Citeseer, 2008.

Stanford University, Stanford, CA 94305, USA

Email address : {visheshj, huypham, tdvuong}@stanford.edu{visheshj, huypham, tdvuong}@stanford.edu