[PDF] Error Correcting Codes, finding polynomials of bounded degree agreeing on a dense fraction of a set of points

Abstract

Here we present some revised arguments to a randomized algorithm proposed by Sudan to find the polynomials of bounded degree agreeing on a dense fraction of a set of points in F 2 for some field F .

Full PDF

aa r X i v : . [ c s . S C ] J un Error Correcting Codes

Priyank DeshpandeJuly 2, 2020

Abstract

Here we present some revised arguments to a randomized algorithm proposed by Sudan toﬁnd the polynomials of bounded degree agreeing on a dense fraction of a set of points in F for some ﬁeld F . Here we will discuss some concepts in the ﬁeld of error-correcting codes.

Deﬁnition 1.

Given Σ a collection of symbols, and x, y ∈ Σ n . We deﬁne the hamming distance between x and y denoted as HD ( x, y ) as |{ i ∈ [ n ] : ( x ) i = ( y ) i }| . That is, the number of indices at which x and y diﬀer. Example 1.

Given

Σ = { , , } and x = “201” , and y = “222” . We have that HD ( x, y ) = 1 . Deﬁnition 2.

Let Σ be a collection of symbols and n, k, δ ∈ Z . We say C ⊂ Σ n is a [ n, k, δ ] code if |C| = | Σ | k ,and ∀ x, y ∈ C , HD ( x, y ) ≥ δ . Deﬁnition 3.

Let Σ be a collection of symbols and C a [ n, k, δ ] . If τ ∈ Z : 2 τ + 1 ≤ δ , then we say C is a τ error correcting code. Deﬁnition 4.

Let F be a ﬁnite ﬁeld of cardinailty n . Let C be a [ n, d + 1 , n − d ] code of an alphabet Σ , wesay C is a Reed-Solomon Code if C = { “ p (0) | p ( w ) | . . . | p ( w | F |− )” : p ( x ) ∈ F [ x ] , deg( p ) ≤ d } . Here | denotesstring concatenation, and w ∈ F , is a generator of F ∗ . Deﬁnition 5.

We will refer the maximum-likelihood decoding problem as the following task: Given a [ n, k, δ ] code, a string s ∈ Σ n , a string in c ∈ C such that HD ( s, c ) ≤ HD ( s, x ) , ∀ x ∈ C . We will refer to the listdecoding problem as: Given a string s ∈ Σ n , a [ n, k, δ ] code, and a parameter τ ∈ N , return all c ∈ C suchthat HD ( s, c ) ≤ τ , Remark 1.

We will present a randomized algorithm by Sudan-’96, for the following problem: Given a ﬁeld F , { ( x i , y i ) } ni =1 ⊂ F and parameters t, d ∈ N , ﬁnd all f ( x ) ∈ F [ x ] such that |{ i ∈ [ n ] : f ( x i ) = y i ) | ≥ t , and deg( f ) ≤ d . Remark 2.

We deﬁne concept of weighted degree which will be relevant to the randomized algorithm to bepresented. Given ( w x , w y ) ∈ Z which we will call weights and a bivariate monomial in x, y , c ij x i y j , we saythe weighted degree of such a monomial is i · w x + j · w y . Given a bivariate polynomial Q ( x, y ) ∈ F [ x, y ] , wesay the weighted degree of Q ( x, y ) = P i,j c ij x i y j to be the maximum of the weighted degrees of its monomials. Algorithm 1.

Deﬁne the following randomized algorithm: Let { ( x i , y i ) } i ∈ [ n ] , d, t ∈ N be inputs to thealgorithm, and m, l parameters to be determined to optimize the algorithm. Then: • Find a P ( x, y ) ∈ F [ x, y ] such that P ( x, y ) has weighted degree with weights (1 , d ) at most m + l · d , P ( x, y ) is not identically zero, and P ( x, y ) vanishes on { ( x i , y i ) } i ∈ [ n ] . That is, P ( x i , y i ) = 0 , ∀ i ∈ [ n ] . (1) 1 Factor P ( x, y ) into irreducible polynomials in F [ x, y ] . (2) • Check all functions f ( x ) ∈ F [ x ] of degree at most d , such that ( y − f ( x )) | P ( x, y ) , and f ( x i ) = y i forat least t distinct choices of i ∈ [ n ] . (3) Remark 3.

We will justify that this algorithm runs in polynomial time.

Proposition 1.

The polynomial as described in step (1) can be found in polynomial time, with respect tothe size of the ﬁeld, if such a polynomial exists.Proof.

By the conditions imposed by the weighted degree constraint, we can write P ( x, y ) ∈ F [ x, y ] as P ( x, y ) = P lj =0 P m +( l − j ) di c ij x i y j because j ≤ l , i ≤ m + ( l − j ) d implies that ( i, j ) · (1 , d ) ≤ ( m + ( l − j ) d, j ) · (1 , d ) = m + ld , which is the weighted degree of P ( x, y ). To ﬁnd the polynomial which satiﬁes theconditions in (1), we require that P lj =0 P m +( l − j ) di =0 c ij ( x k ) i ( y k ) j = 0 , ∀ k ∈ [ n ]. Let | F | = N . Using a bruteforce approach to determine the appropriate values of c ij , we can obtain a solution in O ( n · N ( m + ld ) l ), whichis polynomial in N for ﬁxed parameters l, m, d . However, this can be solved in polynomial time with respectto the number of constraints n . Proposition 2.

If the parameters m, l are such that ( m + 1)( l + 1) + d (cid:0) l +12 (cid:1) > n , then a function P ( x, y ) ∈ F [ x, y ] as described in (1) exists.Proof. Let η = ( m +1)( l +1)+ d (cid:0) l +12 (cid:1) Note that if P ( x, y ) ∈ F [ x, y ] is deﬁned as P ( x, y ) = P lj =0 P m +( l − j ) di =0 c ij x i y j ,then there are η many x ij ’s. To ﬁnd the polynomial P ( x, y ), we need to solve the system A~x = 0, where ~x represents the c ij , and A has dimensions n × η . So this amounts to ﬁnding the null space of A . Under theassumption that η > n , we have that dim( N ( A )) ≥

1, where N ( A ) is the null space of A . Hence, we maychoose a y ∈ N ( A ) \ { } to obtain the desired c ij ’s. Proposition 3. If P ( x, y ) ∈ F [ x, y ] satisﬁes (1), and f ( x ) ∈ F [ x ] satisﬁes |{ i ∈ [ n ] : f ( x i ) = y i }| ≥ t , and t > m + ld , then y − f ( x ) divides P ( x, y ) . Remark 4.

Let f ( x ) ∈ F [ x ] . Denote the condition |{ i ∈ [ n ] : f ( x i ) = y i }| ≥ t as (*), and say f ( x ) satiﬁes(*) should it be the caseProof. Let f ( x ) ∈ F [ x ] satisfy (*). We claim that P ( x, f ( x )) is identically zero. Since P ( x, y ) has (1 , d )weighted degree at most m + ld , we have that P ( x, f ( x )) (as a uni-variate polynomial) has degree at most m + ld since f ( x ) has degree at most d . However P ( x, f ( x )) = 0 whenever x = x i for some i ∈ [ n ]. If f ( x ) satisﬁes (*), then there are at least t zeros. Under the assumption that t > m + ld , we have that thenumber of roots of P ( x, f ( x )) is greater than its degree, so P ( x, f ( x )) ≡

0. Consider P ( x, y ) = P x ( y ) = P l − j =0 P j ( x ) y j : P j ( x ) ∈ F [ x ]. Since P x ( f ( x )) = 0, we have that P x ( y ) has a root f ( x ). By the divisionalgorithm, ( y − f ( x )) divides P x ( y ) = P ( x, y ), which is the claim. Remark 5.

It remains to choose the parameters m, l such that t > m + ld and ( m + 1)( l + 1) + d (cid:0) l +12 (cid:1) > n ,We can rephrase the condition to be ( m + 1)( l + 1) + d (cid:0) l +12 (cid:1) ≥ n + 1 Observe that this condition yields m ≥ n +1 − d ( l +12 ) l +1 − . Suppose that we want t ≥ m + ld +1 = ⇒ t ≥ n +1 − d ( l +12 ) l +1 + ld = n +1 l +1 − dl + dl = n +1 l +1 + dl .To ﬁnd the minimum of this function with respect to l , we perform a ﬁrst derivative which yields that − ( n +1)( l +1) + d = 0 = ⇒ l = q n +1) d − . Substituting the expression for l in for the expression on m , weobtain that m ≥ n +1 − d ( l +12 ) l +1 − n +1 l +1 − dl − q d ( n +1)2 − (cid:18)q d ( n +1)2 − d (cid:19) − d − . This yields forthe condition on t that t ≥ m + ld + 1 ≥ d + d · (cid:18)q n +1) d − (cid:19) = d + p n + 1) d − d = p n + 1) d − d .This will allow us to make the following claim, which follows from the previous propositions. Corollary 1.

Given a ﬁeld F and a set of points { ( x i , y i ) } i ∈ [ n ] ⊂ F , and paramters d, t ∈ N such that t ≥ d ·⌈ q n +1) d ⌉−⌊ d ⌋ , then there is a polynomial time algorithm in n which ﬁnds all polynomials f ( x ) ∈ F [ x ] which satisfy (*), and have degree at most d . roof. Setting m = ⌊ d ⌋ − l = ⌈ q n +1) d ⌉ − m + 1)( l + 1) + d (cid:0) l +12 (cid:1) ≥ n + 1. Byproposition 2, a function P ( x, y ) not identically zero satisfying that P ( x i , y i ) = 0 , ∀ i ∈ [ n ] exists. Under theassumption that t ≥ d · ⌈ q n +1) d ⌉ − ⌊ d ⌋ > m + ld , we have that ( y − f ( x )) divides P ( x, y ) should such an f ( x ) ∈ F [ x ] satisfy (*). By step 3 in the algorithm, f ( x ) will be reported as output. Deﬁnition 6.

Denote the tuple of k variables ( x , . . . , x k ) = ~x . Let F be a ﬁeld, H ⊂ F , and g : H k −→ F .Given parameters t, d ∈ N , output all polynomials f of degree at most d such that (cid:12)(cid:12) { ~x ∈ H k : f ( ~x ) = g ( ~x ) } (cid:12)(cid:12) ≥ t . Here deﬁne the degree of f to be the maximum degree of its monomials. Remark 6.

Let f ( x ) ∈ F [ ~x ] . We say f ( x ) satisﬁes the condition (*) if (cid:12)(cid:12) { ~x ∈ H k : f ( ~x ) = g ( ~x ) } (cid:12)(cid:12) ≥ t . Deﬁnition 7.

Generalize the deﬁnition of weight degree to an n -variate polynomial. First, the ( w , . . . , w n ) weighted degree of a monomial Q ni =1 x d i i is deﬁned to be P ni =1 w i d i . Deﬁne the ( w , . . . , w n ) weighted degreeof an n -variate polynomial to be the maximum of the weighted degrees of its monomials (which have non-zerocoeﬃcients). Algorithm 2.

Deﬁne the following algorithm. Let

F, H, k, t, d, g be as in deﬁnition , and m, l ∈ N beparameters to be determined. • Find a P ( x , . . . , x k , y ) ∈ F [ x , . . . , x k , y ] such that P ( ~x, y ) has weighted degree with weights (1 , . . . , , d ) at most m + l · d , P ( ~x, y ) is not identically zero, and P ( ~x, y ) vanishes on { ( ~x, g ( ~x )) : ~x ∈ H k } . Thatis, P ( ~x, g ( ~x )) = 0 , ∀ ~x ∈ H k . • Factor P ( ~x, y ) into irreducible polynomials in F [ ~x, y ] . (2) • Check all functions f ( ~x ) ∈ F [ ~x ] of degree at most d , such that ( y − f ( ~x )) | P ( ~x, y ) , and f ( ~x ) = g ( ~x ) for at least t distinct choices of ~x ∈ H k . (3) Remark 7.

This is more or less a generalization of the uni-variate case. We will state conditions for theexistence of P ( ~x, y ) , and show that a polynomial f ( ~x ) ∈ F [ ~x ] satisfying (*) will be such that y − f ( ~x ) | P ( ~x, y ) .Let | H | = h . Proposition 4. If m + ld ≥ k ( h − , then a non-trivial polynomial P ( ~x, y ) ∈ F [ ~x, y ] vanishing on S = { ( ~x, f ( ~x )) ∈ F k +1 : ~x ∈ H k } exists.Proof. We want to show that the number of monomials of a k + 1 variate weighted degree polynomial isgreater than | H | k . Then we can apply a similar argument for there being a non-trivial solution to the systemof linear equations A~z = ~ P ( ~x, y ). We observe that apolynomial of (1 , . . . , , d ) weighted degree m + ld contains P lj =0 (cid:0) m +( l − j ) d + kk (cid:1) monomials. This is because P ( ~x, y ) = P lj =0 P j ( ~x ) y j , where P j has total degree at most m + ld − jd = m +( l − j ) d . Hence, let M ( Q ) denotethe number of distinct monomials of a polynomial Q . Then M ( P ) = P lj =0 M ( P j ) = P lj =0 (cid:0) m +( l − j ) d + kk (cid:1) ,which implies the claim. Now we would like M ( P ) > h k . To do this, we provide some lower bounds.Observe that M ( P ) = P lj =0 (cid:0) m +( l − j ) d + kk (cid:1) ≥ P lj =0 (cid:16) m +( l − j ) d + kk (cid:17) k ≥ (cid:0) m + ld + kk (cid:1) k + l · (cid:0) m + kk (cid:1) k > (cid:0) m + ld + kk (cid:1) k ≥ (cid:16) k ( h − kk (cid:17) k = h k , which proves the proposition. Proposition 5. If t > ( m + ld ) h k − , where t is the number of agreements of a k -variate polynomial f onthe set S = { ( ~x, g ( ~x )) ∈ F k +1 : ~x ∈ H k } , then y − f ( ~x ) | P ( ~x, y ) .Proof. We observe that θ f ( ~x ) = P ( ~x, f ( ~x )) is a k -variate polynomial of total degree m + ld . Let Z ( Q, S ) = { x ∈ S : Q ( x ) = 0 } . By the Schwartz-Zippel Lemma, if (cid:12)(cid:12) Z ( θ f , H k ) (cid:12)(cid:12) > deg( θ f ) · | H | k − , then θ f = P ( ~x, f ( ~x )) ≡

0. But t = (cid:12)(cid:12) Z ( θ f , H k ) (cid:12)(cid:12) > deg( θ f ) · | H | k − = ( m + ld ) h k − by assumption so P ( ~x, f ( ~x )) ≡ y − f ( ~x ) | P ( ~x, y ), since it is a root of P ( ~x, y ). Lemma 1. (Schwartz-Zippel) Let p ( x , . . . , x n ) ∈ F [ x , . . . , x n ] be a polynomial of total degree d that is notequivalently . Let | S | ⊂ F be an arbitrary ﬁnite subset of the ﬁeld F . Then P ~x ∈ R S k [ p ( ~x ) = 0] ≤ d | S | . roof. We proceed by induction. Considering the uni-variate case yields that P x ∈ R S [ p ( x ) = 0] ≤ d | S | . Thisis true because p has degree d and since p is not identically 0 we have that there are at most d roots in F , and hence there are at most d roots of p in a ﬁnite subset S ⊂ F . Let k = deg x n ( p ). Then we maywrite p ( x ) = x kn q ( x , . . . , x n − ) + r ( x , . . . , x n ), where q ( x , . . . , x n − ) has total degree at most d − k and r ( x , . . . , x n ) has x n degree strictly less than k . For a ~x ∈ R S k , we have that P [ p ( ~x ) = 0] = P [ p ( ~x ) = 0 | q ( ~x ) = 0] P [ q ( ~x ) = 0] + P [ p ( ~x ) = 0 | q ( ~x ) = 0] P [ q ( ~x ) = 0]by Bayes Formula. But P [ p = 0] ≤ P [ q = 0] + P [ p = 0 | q = 0] ≤ d − k | S | + k | S | = d | S | , by the inductive hypothesis.This completes the proof. Remark 8.

There is some subtlety to the fact that P [ p = 0 | q = 0] ≤ k | S | . This is because if we are giventhat q = 0 , then p ( x , x , . . . , x n ) considered in that regard becomes a uni-variate polynomial of degree k in the variable x n that is not identically , as we are ﬁxing that x , . . . , x n − , and we then may apply theinductive hypothesis. The application of the Schwartz-Zippel Lemma as presented here to proposition isthat the contrapositive of the statement of lemma is suﬃcient to deduce that θ f ≡ as it has more than deg( f ) · h k − roots on H k . Theorem 1.

If the parameters d, t, | H | = h, k ∈ N , are such that tdh k − > k ( h − d , and the open interval (cid:16) k ( h − d , tdh k − (cid:17) contains a positive integer, then Algorithm 2 as described above outputs all the desiredpolynomials.Proof. Note that to obtain the non-trivial polynomial P ( ~x, y ) ∈ F [ x , . . . , x k , y ] which vanishes on S , weneed by proposition 4 that m + ld ≥ k ( h − f ( ~x ) ∈ F [ x , . . . , x k ] of degree atmost d having at least t agreements on S satisﬁes that y − f ( x ) | P ( ~x, y ), we require that ( m + ld ) < th k − .Hence, we want m, l ∈ N such that th k − > m + ld > k ( h − m = 0,and ﬁnding the appropriate l , we obtain that such an l exists precisely when I = (cid:16) k ( h − d , tdh k − (cid:17) containsa positive integer. Assuming that the given parameters are such that l exists, by proposition 4, we obtain anon-zero polynomial vanishing on S . and by proposition 5, we have that a polynomial f ∈ F [ ~x ] of degree atmost d having at least t agreements on S will have y − f divide P ( ~x, y ). Hence, Algorithm 2 will return thedesired polynomials. ••