[PDF] The Truncated & Supplemented Pascal Matrix and Applications

Abstract

Full PDF

aa r X i v : . [ m a t h . C O ] F e b The Truncated & Supplemented Pascal Matrix and Applications

M. Hua ∗ , S. B. Damelin † , J. Sun ‡ , and M. Yu § Department of Mathematics, University of Michigan. Mathematical Reviews, The American Mathematical Society. Australian National University.

Abstract

In this paper, we introduce the k × n (with k ≤ n ) truncated, supplemented Pascal matrix whichhas the property that any k columns form a linearly independent set. This property is also presentin Reed-Solomon codes; however, Reed-Solomon codes are completely dense, whereas the truncated,supplemented Pascal matrix has multiple zeros. If the maximal-distance separable code conjecture iscorrect, then our matrix has the maximal number of columns (with the aformentioned property) thatthe conjecture allows. This matrix has applications in coding, network coding, and matroid theory. Finite ﬁeld linear algebra is an import branch of linear algebra. Instead of using the inﬁnite ﬁeld R , it useslinearly independent vectors consisting of a ﬁnite number of elements, which can be represented by a ﬁnitenumber of bits. It has thus motivated many practical coding techniques, such as Reed-Solomon codes [15]and linear network coding [9, 8]. It is also closely related to structural matroid theory [13] through matroidrepresentability [13, 14, 5, 16].One of the most important problems in ﬁnite ﬁeld linear algebra is ﬁnding the size of the largest set ofvectors over a k -dimensional ﬁnite ﬁeld such that every subset of k vectors is linearly independent [1, 2].From a matrix perspective, the problem is described as: Problem 1.1.

Consider a ﬁnite ﬁeld F q , where q = p h , for p a prime and h a nonnegative integer. Givena positive integer k , what is the largest integer n , such that there exists a k × n matrix H over F q , in whichevery set of k columns is linearly independent? Such a matrix, upon its existence, could be the generator matrix of an [ n, k ] maximum-distance-separable(MDS) code [4], which can correct up to d = n − k bits of erasures or t = d/ H as an MDS matrix. Its existence also determines the representability of uniform matroids, whichwe will discuss in detail in Section 4.3. The maximal value of n , according to the MDS conjecture, is q + 1,unless q = 2 h and k = 3 or k = q −

1, in which case n ≤ q + 2. This conjecture has been recently proved forany q = p by Ball [1, 2]. But a complete proof of it remains open.Therefore, it is crucial to understand the construction of k × ( q + 1) MDS matrices. In coding theoryliterature, many construction algorithms have been proposed to meet certain coding requirements. However,their computational complexity is not necessarily satisfactory. On one hand, multiplications and additionsover large ﬁnite ﬁeld are required in the matrix construction. On the other hand, the resultant MDS matrixmay have a low sparsity (or high density), which is measured by the number of zeros in the matrix. Forexample, Reed-Solomon codes have no zeros in its generator matrix. A low sparsity can be translated into ∗ For correspondence regarding this paper, email [email protected]. Support from the National Science Foundation undergrants: 0901145, 1160720, 1104696 and the American Mathematical Society is gratefully acknowledged. † [email protected]. ‡ jeﬀjeﬀ@umich.edu. § [email protected]. supplemented Pascal matrix . A supplemented Pascal matrix can be generated by additionsand, in particular, without multiplications. It also has guaranteed number of zero entries for high sparsity.We will prove that a supplemented Pascal matrix is an MDS matrix in Section 3. We will then extend ourresults into a general code construction framework in section 4.1, and then discuss its applications to networkcoding theory and matroid theory in sections 4.2 and 4.3, respectively. For clarity we should ﬁrst label the elements of a ﬁnite ﬁeld. Henceforth, let p be a prime and h be anonnegative integer. A ﬁnite ﬁeld F q contains q = p h elements, each represented by a polynomial g ( x ) = P h − i =0 β i x i , whose coeﬃcients are { β i } h − i =0 ∈ [0 , p − x = p to a diﬀerent g ( x ) will yield adiﬀerent value between 0 and q −

1, which is an intuitive index of the corresponding element. Speciﬁcally,we deﬁne a index function σ q ( n ): Deﬁnition 2.1.

For any integer n ∈ [0 , q − , σ q ( n ) is the element of F q whose polynomial coeﬃcientssatisfy P h − i =0 β i p i = n . For example, given q = 2 , we have σ q (0) = 0, σ q (1)=1, and σ q (5) = x + 1.Based on σ q ( n ), we deﬁne a ﬁnite ﬁeld binomial polynomial f m ( n ): f m ( n ) = ( σ q ( n )] m , m = 0 Q mi =1 σ q ( n ) − σ q ( i − σ q ( i ) , m > { m, n } ∈ [0 , q −

1] are non-negative integers. Intuitively, f m ( n ) is a polynomial of σ q ( n ) of degree m .Based on f m ( n ), we introduce the key matrix in this paper, called the Pascal matrix : Deﬁnition 2.2.

The upper-triangular Pascal matrix P q over F q is a q × q matrix with its element P q ( m, n ) = f m ( n ) : P q =  f (0) f (1) · · · f ( q − f (0) f (1) · · · f ( q − ... . . . . . . ... f q − (0) f q − (1) · · · f q − ( q −  , (2) For brevity, we call the upper-triangular Pascal matrix the Pascal matrix.

Note that the matrix index starts from 0. For example, when q = 2 = 4, we have: Example 2.3. P =  x x + 10 0 1 x + 10 0 0 1  Our considered matrix P q is named after Pascal because its entries are binomial coeﬃcients, which is thesame as traditional Pascal matrix, except that the ﬁeld applied is F q and Z ≥ , respectively. Indeed, when q = p , P p is equal to the traditional Pascal matrix modulo- p . For example, when q = p = 5: Example 2.4. P , traditional =   v . s . P =   F p shares the same additive formula as the traditionalPascal matrix. Explicitly, P p ( m, n ) = P p ( m − , m −

1) + P p ( m, n −

1) for every pair of { m, n } ∈ [1 , q − p ). This idea appears in section 4.2. Deﬁnition 2.5.

The truncated Pascal matrix P q,k is the Pascal matrix P q truncated to the ﬁrst k rows. Example 2.6. P , = (cid:20) (cid:21) Deﬁnition 2.7. A supplemented Pascal matrix , denoted by H q,k , is a truncated Pascal matrix P q,k appendedwith a column vector s k , which has a one in the bottom entry and zeroes everywhere else: H q,k =  U q,k (cid:12)(cid:12)(cid:12)(cid:12)(cid:12)(cid:12)(cid:12)(cid:12)(cid:12) ...  (3) Example 2.8. H , = (cid:20) (cid:21) Our supplemented Pascal matrix has a desirable property, namely:

Theorem 2.9.

Any k columns of H q,k are linearly independent. We will ﬁrst prove the following property of P q,k , and then prove that H q,k preserves this property. Lemma 3.1 (Truncation Lemma) . Any k columns of P q,k are linearly independent.Proof. To prove it, we ﬁrst note that P q (and thus P q,k ) has two important properties: ( m begins at 0.)1. All the entries in the m -th row are deﬁned by the same polynomial f m ( n ), which has a degree of m ;2. This polynomial has m roots, which are { σ q ( n ) } m − n =0 . Consequently, the ﬁrst m entries of the m -throw are all zeros.Given a truncated Pascal matrix P q,k , our hypothesis (to be disproved) is that there exist k distinc-tive values of n , say { n , n , · · · , n k − } , such that columns { n , n , · · · , n k − } of P q,k constitute a linearlydependent set. In other words, if our hypothesis is valid, then there exists an k × k sub-matrix M of P q,k : M =  f ( n ) f ( n ) · · · f ( n k − ) f ( n ) f ( n ) · · · f ( n k − )... . . . . . . ... f k − ( n ) f k − ( n ) · · · f k − ( n k − )  , (4)whose rank is smaller than k .If this is the case, then there must exist a length- k non-zero vector a ∈ F kq such that a × M = z , where z is an all-zero vector of length k : [ α , α , · · · , α k − ] a × M = [0 , , · · · , z Recall that the m -th row of P q,k (and thus M ) is deﬁned by f m ( n ). Correspondingly, z is deﬁned by: f ′ ( n ) , k − X m =0 α m f m ( n ) , z ( k ) = f ′ ( n k ) = 0 for all m ∈ [0 , k − f ′ ( n ) is at most k −

1, becausethe highest degree of its summands is the degree of f k − ( n ) with a value of k − { n , n , · · · , n k − } of U q,k constitute a linearlydependent set, then we will obtain a polynomial f ′ ( n ) such that: • Its degree is at most k − • It has k roots, whose values are { σ q ( n ) , σ q ( n ) , · · · , σ q ( n k − ) } .However, with a degree of at most k − f ′ ( n ) can only have at most k − f ′ ( n ) = 0,which is not the case because a is non-zero. Hence, f ′ ( n ) does not exist, and thus our hypothesis is invalid.Therefore, every k columns of U q,k are linearly independent. (lemma)Since H q,k is constructed by appending s k to P q,k , to prove Theorem 2.9 we only need to prove that any k − P q,k and s k together never constitute a linearly dependent set. To see this, we can simplyuse s k to linearly cancel the ﬁrst q entries in the last row of H q,k . This will transform H q,k from (3) into: H ′ q,k =  P q,k − · · ·  (5)which indicates that s k is orthogonal to all the other columns of H ′ q,k . Then, by applying the truncationlemma to P q,k − , we know that every k − q columns of H ′ q,k are linearly independent.Adding s k to them will yield a linearly independent set of k . Theorem 2.9 is thus proved. The truncation lemma can be immediately generalized to any appropriately deﬁned k × n matrix that satisﬁes:1) n ≤ q , and 2) the m -th ( m ∈ [0 , k − m . For example, bysetting f m ( n ) = σ q ( n ) m − , we can obtain a k × n matrix under F q such that every set of k columns is alinearly independent set. Indeed, this matrix is the generator matrix G of a ( n, k ) Reed-Solomon code:  σ q (1) σ q (2) · · · σ q ( n ) σ q (1) σ q (2) · · · σ q ( n ) σ q (1) σ q (2) · · · σ q ( n ) ... ... . . . ... σ q (1) k − σ q (2) k − · · · σ q ( n ) k −  Then by appending s k , we can obtain a [ n +1 , k ] Reed-Solomon code. Therefore, our polynomial approachis a general approach of constructing non-trivial [ n, k ] MDS codes. It also indicates that the maximum lengthof any MDS code is at least q + 1 for any k q . This result well resonates the MDS conjecture [1, 2].Among all the possible constructions, the supplemented Pascal matrix H q,k enjoys a high sparsity, whichis the number of zeros in the matrix. Higher sparsity is advantageous, because it generally leads to easierencoding/decoding. However, the sparsity has an upper bound. In the following lemma, we will prove that H q,k approximates this bound with a factor of : Lemma 4.1 (Matrix Sparsity) . The number of zeros in the supplemented Pascal matrix H q,k is of themaximum sparsity of any ( n, k ) code.Proof. Since any k × k sub-matrix of G has a rank of k , there is no all-zero row in this matrix. Hence, thereis at most k − G , and at most k + k zeros in total. Recall that in H q,k the m -th row( m ∈ [0 , k − m zeros. The total number of zeros is k − k , which is half of the maximum.4 .2 Network Coding Theory Network coding (NC) is a class of packet-based coding techniques. Consider a block of K ≥ { x k } K − k =0 , each containing L bits of information. NC treats these data packets as K variables, and sends inthe u -th ( u ∈ [0 , + ∞ ]) transmission a linear combination y u of all of them: y u = K − X k =0 α k,u x k , (6)where coeﬃcients { α k } K − k =0 are elements of a ﬁnite ﬁeld F q .Ideally, NC is able to allow any receiver that has received any K coded packets to decode all the K datapackets by solving a set of K linear equations. To this end, the associated coeﬃcient matrix C , where C =  α , α , · · · · · · α , α , · · · · · · ... ... . . . ... α K − , α K − , · · · · · ·  , (7)must satisfy that every set of K columns of it is a linearly independent set. Once this condition is met, NCis able to achieve the optimal throughput in wireless broadcast scenarios [16].However, it is highly non-trivial to meet this condition, which hinders the implementation of NC. First, toguarantee the linear independence, the sender either chooses coeﬃcients randomly from a suﬃciently large F q [11, 7] or regularly collect receiver feedback to make online coding decisions [6]. While large F q incurs heavycomputational loads, collecting feedback could be expensive or even impossible in certain circumstances, suchas time-division-duplex satellite communications [11]. Second, to enable the decoding, coding coeﬃcientsmust be attached to each coded packet, which constitute ⌈ K log q ⌉ bits of overhead in each transmission.When q is large and L is small, the throughput loss due to the overhead may overwhelm all the other beneﬁtsof NC.These practical shortages of NC can be easily overcome by the proposed supplemented Pascal matrix.By choosing a suﬃciently large p and let C = H p,K , we obtain an NC that is both computational friendly(only Mod- p operations) and feedback-free. Moreover, for the receivers to retrieve the coding coeﬃcients,the sender only needs to attach the index u to the u -th packet, rather than attaching the complete coeﬃ-cients. Furthermore, the additive formula for Pascal matrix may enable eﬃcient progressive coding/decodingalgorithms, which could be our future research direction. A matroid M = ( E, I ) is a ﬁnite collection of elements called the ground set, E , paired with its comprehensiveset of independent subsets, I . A uniform matroid U kn has | E | = n and the property that any size k subsetof E is an element of I and no size ( k + 1) subset is in I . U kn is called q -representable if there is a k × n matrix such that every k columns of it are linearly independent under F q . Corollary 4.2 (Representability of Uniform Matroid) . Any uniform matroid U kn that satisﬁes n q + 1 is q -representable by any n columns of H q,k . The statement:

Any uniform matroid U kn that satisﬁes n q + 1 is q -representable is known [13, 3, 15];one can obtain another construction from Reed-Solomon codes. H q,k is just another, sparse example. In this paper, we proposed the supplemented Pascal matrix, whose ﬁrst k rows is an MDS matrix under F q for any prime power q and positive integer k q . Our construction can be potentially generalizedto a framework that enables low-complexity MDS code constructions and encoding/decoding as well. Ourmatrix can overcome some practical shortages of network coding and, thus, enables high-performance wirelessnetwork coded packet broadcast. Our matrix resonates with existing results on the representability of uniform5atroids, while also providing new insights into this topic. In the future, we intend to study Pascal-basednetwork coding algorithms. We are also interested in applying our results to other ﬁelds such as projectivegeometry and graph theory. References [1] S. Ball, “On large subsets of a ﬁnite vector spacein which every subset of a basis size is a basis”,

J. Eur.Math. Soc. , no. 2, pp. 733-748, 2012.[2] S. Ball and J. De Beule, “On sets of vectors of a ﬁnite vector space in which every subset of basis sizeis a basis II”,

Des. Codes Cryptogr , vol. 65, no. 1-2, pp.5-14, 2012.[3] S. Ball, “Finite Geometry and Combinatorial Applications”, London Mathematical Society StudentTexts (82), Cambridge University Press.[4] D. Costello and S. Lin,

Error control coding , Pearson-Prentice Hall Press, 2004.[5] S. El Rouayheb, A. Sprintson and C. Georghiades, “On the index coding problem and its relation tonetwork coding and matroid theory”,

IEEE Trans. Information Theory , vol.56, no. 7, pp. 3187-3195,2010.[6] C. Fragouli, D. Lun, M. Medard and P. Pakzad, “On feedback for network coding”,

IEEE Conf. Infor-mation Sciences and Sytems (CISS) , 2007, pp. 248-252.[7] J. Heide, M. V. Pedersen, F. H. Fitzek and T. Larsen, “Network coding for mobile devices-systematicbinary random rateless codes”,

IEEE Int. Conf. Communications (ICC) Workshop , 2009, pp.1-6.[8] T. Ho, M. Medard, R. Koetter, D. R. Karger, M. Efros, J. Shi and B. Leong, “A random linear networkcoding approach to multicast”,

IEEE Trans. Information Theory , vol. 52, no. 10, pp. 4413-4430, 2006.[9] S. Y. Li, R. W. Yeung and N. Cai, “Linear network coding”,

IEEE Trans. Information Theory , vol.49,no. 2, pp. 371-381, 2003.[10] R. Lidl and H. Niederreiter, “Finite Fields”, Encyclopedia of Mathematics and its Applications,

Cam-bridge University Press (1997).[11] D. E. Lucani, M. Medard and M. Stojanovic, “Random linear network coding for time-division duplex-ing: ﬁeld size considerations”,

IEEE Global Telecommunication Conf. (GlobalComm) , 2009, pp.1-6.[12] L. Moura, G. L. Mullen and D. Panario, “Finite ﬁeld constructions of combinatorial arrays,

Des.CodesCryptogr (2016) 78: pp. 197-219.[13] J. G. Oxley, Matroid theory,

Oxford University Press , 2006, vol.3.[14] J. Oxley, D. Vertigan and G. Whittle, “On inequivalent representations of matroids over ﬁnite ﬁelds”,

Journal of combinatorial theory , Series B, vol. 67, no. 2, pp. 325-343, 1996.[15] I. S. Reed and G. Solomon, “Polynomial codes over certain ﬁnite ﬁelds”,

Journal of the society forindustrial and applied mathematics , vol. 8, no. 2, pp. 300-304, 1960.[16] M. Yu, P. Sadeghi and N. Aboutorab, “On deterministic linear network coded broadcast and its relationto matroid theory”,