[PDF] New upper bounds for (b,k)-hashing

Abstract

For fixed integers b\geq k, the problem of perfect (b,k)-hashing asks for the asymptotic growth of largest subsets of \{1,2,\ldots,b\}^n such that for any k distinct elements in the set, there is a coordinate where they all differ. An important asymptotic upper bound for general b, k, was derived by Fredman and Koml\'os in the '80s and improved for certain b\neq k by K\"orner and Marton and by Arikan. Only very recently better bounds were derived for the general b,k case by Guruswami and Riazanov, while stronger results for small values of b=k were obtained by Arikan, by Dalai, Guruswami and Radhakrishnan and by Costa and Dalai. In this paper, we both show how some of the latter results extend to b\neq k and further strengthen the bounds for some specific small values of b and k. The method we use, which depends on the reduction of an optimization problem to a finite number of cases, shows that further results might be obtained by refined arguments at the expense of higher complexity.

Full PDF

NNew upper bounds for ( 𝑏, 𝑘 ) -hashing Stefano Della Fiore, Simone Costa, Marco Dalai,

Department of Information Engineering, University of Brescia{s.dellaﬁore001, simone.costa, marco.dalai}@unibs.it

Abstract —For ﬁxed integers 𝑏 ≥ 𝑘 , the problem of perfect ( 𝑏, 𝑘 ) -hashing asks for the asymptotic growth of largest subsetsof { , , . . . , 𝑏 } 𝑛 such that for any 𝑘 distinct elements in the set,there is a coordinate where they all differ.An important asymptotic upper bound for general 𝑏, 𝑘 , wasderived by Fredman and Komlós in the ’80s and improved forcertain 𝑏 ≠ 𝑘 by Körner and Marton and by Arikan. Only veryrecently better bounds were derived for the general 𝑏, 𝑘 caseby Guruswami and Riazanov, while stronger results for smallvalues of 𝑏 = 𝑘 were obtained by Arikan, by Dalai, Guruswamiand Radhakrishnan and by Costa and Dalai.In this paper, we both show how some of the latter resultsextend to 𝑏 ≠ 𝑘 and further strengthen the bounds for somespeciﬁc small values of 𝑏 and 𝑘 . The method we use, whichdepends on the reduction of an optimization problem to a ﬁnitenumber of cases, shows that further results might be obtainedby reﬁned arguments at the expense of higher complexity. Index Terms —perfect hashing, list decoding, zero-error capac-ity

I. I

NTRODUCTION

Let 𝑏 , 𝑘 and 𝑛 be integers, with 𝑏 ≥ 𝑘 , and let C bea subset of { , , . . . , 𝑏 } 𝑛 with the property that for any 𝑘 distinct elements we can ﬁnd a coordinate where they all differ.Such a set can be interpreted, by looking at it coordinate-wise, as a family of 𝑛 hashing functions on some universe ofsize |C| . The required property then says that the family isa perfect hash family, that is, any 𝑘 elements in the universeare 𝑘 -partitioned by at least one function. Alternatively C canbe interpreted as a code of rate 𝑛 log |C| for communicationover a channel with 𝑏 inputs. Assume that the channels is a 𝑏 /( 𝑘 − ) channel, meaning that any 𝑘 − of the 𝑏 inputsshare one output but no 𝑘 distinct inputs do (see Figure 1).The required property for C is what is needed for the code tobe a zero-error code when list decoding with list-size 𝑘 − isallowed. We refer the reader to [8], [9], [13], [14] and [4] foran overview of the the more general context of this problem. InputOutput

Fig. 1. A / channel. Edges represent positive probabilities. Here, zero-errorcommunication is possible when decoding with list-size equal to . We will call any subset C of { , , . . . , 𝑏 } 𝑛 with the de-scribed property a ( 𝑏, 𝑘 ) -hash code. For the reasons mentionedabove, bounding the size of ( 𝑏, 𝑘 ) -hash codes is a combina-torial problem which has been of interest both in computerscience and information theory. It is known that ( 𝑏, 𝑘 ) -hashcodes of exponential size in 𝑛 can be constructed and thequantity of interest is usually the rate of such codes. We willthus study the quantity 𝑅 ( 𝑏,𝑘 ) = lim sup 𝑛 →∞ 𝑛 log |C 𝑛 | , (1)where the C 𝑛 are ( 𝑏, 𝑘 ) -hash codes of length 𝑛 with maximalrate. Note that, throughout, all logarithms are to base 2. Fewlower bounds on 𝑅 ( 𝑏,𝑘 ) are known. First results in this sensewere given by [9], [8] and a better bound was derived in [12]for ( 𝑏, 𝑘 ) = ( , ) . More recently, new lower bounds werederived in [16] for inﬁnitely many other values of 𝑘 . Theﬁrst, landmark result concerning upper bounds was obtainedby Fredman and Komlós [9], who showed that 𝑅 ( 𝑏,𝑘 ) ≤ 𝑏 𝑘 − 𝑏 𝑘 − log ( 𝑏 − 𝑘 + ) , (2)where 𝑏 𝑘 − = 𝑏 ( 𝑏 − ) · · · ( 𝑏 − 𝑘 + ) . Progresses have sincebeen rare. A generalization of the bound given in equation (2)was derived by Körner and Marton [12] in the form 𝑅 ( 𝑏,𝑘 ) ≤ min ≤ 𝑗 ≤ 𝑘 − 𝑏 𝑗 + 𝑏 𝑗 + log 𝑏 − 𝑗𝑘 − 𝑗 − . (3)This was further improved for different values of 𝑏 and 𝑘 by Arikan [3]. In the case 𝑏 = 𝑘 , an improvement was ﬁrstobtained for 𝑘 = in [2] and then in [6], [7]. It was provedonly recently in [10] that the Fredman-Komlós bound is nottight for any 𝑘 > ; explicit better values were given therefor 𝑘 = , , and for larger 𝑘 modulo a conjecture which isproved in [5], where further improvements are also obtainedfor 𝑘 = , .In this paper, we develop a new strategy to attack someof the cases which appear not to be optimally handled bythose methods, obtaining new bounds for 𝑏 = 𝑘 = , . . . , .Furthermore, we also show that our procedure improves onthe existing literature for some 𝑏 ≠ 𝑘 cases, among whichfor example ( 𝑏, 𝑘 ) = ( , ) , ( , ) , ( , ) , ( , ) . In orderto evaluate in a fair way these 𝑏 ≠ 𝑘 cases, we ﬁrst analyzethe results (not derived in the referenced papers) which areobtained when the methods of [6] and [5] are extended to 𝑏 ≠ 𝑘 , and compare them with the ones of [12], [3] and [10]. a r X i v : . [ c s . I T ] J a n he generalization of the procedure used in [6] is rathereasy and it provides us the following bound 𝑅 ( 𝑏,𝑘 ) ≤ (cid:32) 𝑏 + 𝑏 ( 𝑏 − 𝑏 + ) log 𝑏 − 𝑘 − (cid:33) − . (4)In Table I we give a comparison between the bounds (4) and(3), the bounds from [3] and [10] and the generalized boundfrom [5] for different values of 𝑏 and 𝑘 . The integers in theparentheses for the bound (3) represent the minimizing 𝑗 ; aparameter 𝑗 with the same role is involved in the other boundsand it will be discussed later. For the bounds of [5], [3] and[10] it is equal to 𝑘 − , while for the bound of [6] it is equalto .In Table II we compare our new bounds with the best knownbounds for 𝑏 = 𝑘 = , . . . , and for ( 𝑏, 𝑘 ) = ( , ) , ( , ) , ( , ) , ( , ) . TABLE IU

PPER BOUNDS ON 𝑅 ( 𝑏,𝑘 ) . A LL NUMBERS ARE ROUNDED UPWARDS . ( 𝑏, 𝑘 ) [5]* [6]* [3] [10] [12] ( , ) ( , ) ( , ) ( , ) ( , ) ( , ) ( , ) ( , ) ( , ) ( , ) ( , ) ( , ) ( , ) ( , ) ( , ) ( , ) ∗ The generalized bound for the ( 𝑏, 𝑘 ) caseTABLE IIU PPER BOUNDS ON 𝑅 ( 𝑏,𝑘 ) . A LL NUMBERS ARE ROUNDED UPWARDS . ( 𝑏, 𝑘 ) This work [5] [6] [3] [10] ( , ) ( , ) ( , ) ( , ) ( , ) ( , ) ( , ) ( , ) The paper is structured as follows. In the Section II wegive the general structure of the method used in the men-tioned recent series of works to ﬁnd upper bounds using the The interested reader will ﬁnd, upon inspection of the proof of Theorem3 in [6], that modulo using a hypergraph version of the Hansel Lemma, theonly new condition to check is that the upper bound given in (4) is greaterthan log 𝑏 − 𝑏 − for every 𝑏 ≥ 𝑘 ≥ . hypergraph version of the Hansel’s lemma. In Section III wepresent the main new ingredient of this paper, which is a wayto improve the bounds derived in [5] by means of a morecareful analysis of a quadratic form that was also objectiveof that study. In Section IV, we show how this idea can beeffectively implemented after an appropriate reduction of theproblem to a list of cases that can be studied exhaustively.II. S TRUCTURE OF THE G ENERAL M ETHOD

The best upper bounds on 𝑅 ( 𝑏,𝑘 ) available in the literaturecan all be seen as different applications of a central idea,which is the study of ( 𝑏, 𝑘 ) -hashing by comparison with acombinations of binary partitions. This main line of approachto the problem comes from the original work of Fredman andKómlos [9]. A clear and productive formulation of the ideawas given by Radhakrishnan in terms of Hansel’s lemma [15],which remained the main tool used in all recent results [7],[10] and [5]. We state the Lemma here and brieﬂy revise forthe reader convenience how this was applied in those works. Lemma 1 (Hansel for Hypergraphs [11], [14]):

Let 𝐾 𝑑𝑟 be a complete 𝑑 -uniform hypergraph on 𝑟 vertices and let 𝐺 , . . . , 𝐺 𝑚 be 𝑐 -partite 𝑑 -uniform hypergraphs on those samevertices such that ∪ 𝑖 𝐺 𝑖 = 𝐾 𝑑𝑟 . Let 𝜏 ( 𝐺 𝑖 ) be the number ofnon-isolated vertices in 𝐺 𝑖 . Then log 𝑐𝑑 − 𝑚 ∑︁ 𝑖 = 𝜏 ( 𝐺 𝑖 ) ≥ log 𝑟𝑑 − . (5)The application to ( 𝑏, 𝑘 ) -hashing relies on the followingobservation. Given a ( 𝑏, 𝑘 ) -hash code 𝐶 , ﬁx any 𝑗 elements 𝑥 , 𝑥 , . . . , 𝑥 𝑗 in 𝐶 , with 𝑗 = , . . . , 𝑘 − . For any coordinate 𝑖 let 𝐺 𝑥 ,...,𝑥 𝑗 𝑖 be the ( 𝑏 − 𝑗 ) -partite ( 𝑘 − 𝑗 ) -uniform hypergraphwith vertex set 𝐺 \ { 𝑥 , 𝑥 , . . . , 𝑥 𝑗 } and edge set 𝐸 = (cid:8) ( 𝑦 , . . . , 𝑦 𝑘 − 𝑗 ) : 𝑥 ,𝑖 , . . . , 𝑥 𝑗,𝑖 , 𝑦 ,𝑖 , . . . , 𝑦 𝑘 − 𝑗,𝑖 are all distinct (cid:9) . (6)Since 𝐶 is a ( 𝑏, 𝑘 ) -hash code, then (cid:208) 𝑖 𝐺 𝑥 ,...,𝑥 𝑗 𝑖 is the complete ( 𝑘 − 𝑗 ) -uniform hypergraph on 𝐺 \ { 𝑥 , 𝑥 , . . . , 𝑥 𝑗 } and so log 𝑏 − 𝑗𝑘 − 𝑗 − 𝑛 ∑︁ 𝑖 = 𝜏 ( 𝐺 𝑥 ,...,𝑥 𝑗 𝑖 ) ≥ log | 𝐶 | − 𝑗𝑘 − 𝑗 − . (7)This inequality allows one to upper bound | 𝐶 | by upperbounding the left hand side. Inequality (7) holds for any choiceof 𝑥 , 𝑥 , . . . , 𝑥 𝑗 , so the main goal is proving that the left handside is not too large for all possible choices of 𝑥 , 𝑥 , . . . , 𝑥 𝑗 .The choice can be deterministic or we can take the expectationover any random selection.Note that if the 𝑥 ,𝑖 , 𝑥 ,𝑖 , . . . , 𝑥 𝑗,𝑖 are not all distinct (let ussay that they “collide”) then the hypergraph in (6) is empty,that is the corresponding 𝜏 in the left hand side of (7) iszero. So, using codewords 𝑥 , 𝑥 , . . . , 𝑥 𝑗 which collide in manycoordinates helps in upper bounding |C| . On the other hand, ina coordinate 𝑖 where the codewords do not collide, 𝜏 ( 𝐺 𝑥 ,...,𝑥 𝑗 𝑖 ) depends on what a fraction of the code uses the remaining 𝑏 − 𝑗 symbols in the alphabet. This can be made small “onaverage” if 𝑥 , . . . , 𝑥 𝑗 are picked randomly. More precisely, let 𝑖 be probability distribution of the 𝑖 -th coordinate of 𝐶 , thatis, 𝑓 𝑖,𝑎 is the fraction of elements of 𝐶 whose 𝑖 -th coordinateis 𝑎 . Then, we have 𝜏 ( 𝐺 𝑥 ,...,𝑥 𝑗 𝑖 ) = (cid:40) 𝑥 , . . . , 𝑥 𝑗 collide in coordinate 𝑖 (cid:16) | 𝐶 || 𝐶 |− 𝑗 (cid:17) (cid:16) − (cid:205) 𝑗ℎ = 𝑓 𝑖,𝑥 ℎ𝑖 (cid:17) otherwise . (8)So, one can make the left hand side in (7) small by using 𝑥 , . . . , 𝑥 𝑗 which collide in many coordinates and at the sametime have in the remaining coordinates symbols 𝑥 ℎ𝑖 for whichthe 𝑓 𝑖,𝑥 ℎ𝑖 are not too small. This can be obtained “on average”if 𝑥 , . . . , 𝑥 𝑗 are picked in some random way over the code,since this will force values with large 𝑓 𝑖,𝑥 ℎ𝑖 to a appearfrequently as the 𝑖 -th coordinate in some of the 𝑥 , . . . , 𝑥 𝑗 .There are different ways to turn this into a precise agrumentto bound the right hand side of (7). We refer the reader to[5] for a detailed discussion, and we only discuss here theprocedure as used there, since it is the base for our currentcontribution.The idea is to partition the code C in subcodes C 𝜔 , 𝜔 ∈ Ω .The only requirement is that each subcode has size whichgrows unbounded with 𝑛 and uses in any of its ﬁrst ℓ coordinates only ( 𝑗 − ) symbols. It can be show, by an easyextension of the method used for the case 𝑏 = 𝑘 and 𝑗 = 𝑘 − in [5], that if the original code has rate 𝑅 , then for any 𝜖 > one can do this with a choice of ℓ = 𝑛 ( 𝑅 − 𝜖 )/ log (cid:16) 𝑏𝑗 − (cid:17) for 𝑛 large enough. Given such a partition of our code, if weselect codewords 𝑥 , . . . , 𝑥 𝑗 within the same subcode C 𝜔 , theywill collide in the ﬁrst ℓ coordinates and the correspondingcontribution to the l.h.s. of (7) will be zero. We then add therandomization. We pick randomly one of the subcodes C 𝜔 andrandomly select the codewords 𝑥 , . . . , 𝑥 𝑗 within C 𝜔 . We thenupper bound the expected value of the left hand side of (7)under this random selection to obtain an upper bound on |C| ,that is log | 𝐶 | − 𝑗𝑘 − 𝑗 − ≤ log 𝑏 − 𝑗𝑘 − 𝑗 − E 𝜔 ( E [ ∑︁ 𝑖 ∈[ ℓ + ,𝑛 ] 𝜏 ( 𝐺 𝑥 ,𝑥 ,...,𝑥 𝑗 𝑖 )| 𝜔 ]) = log 𝑏 − 𝑗𝑘 − 𝑗 − ∑︁ 𝑖 ∈[ ℓ + ,𝑛 ] E 𝜔 ( E [ 𝜏 ( 𝐺 𝑥 ,𝑥 ,...,𝑥 𝑗 𝑖 )| 𝜔 ]) . (9)Here, each subcode C 𝜔 is taken with probability 𝜆 𝜔 = |C 𝜔 |/|C| , and 𝑥 , . . . , 𝑥 𝑗 are taken uniformly at random (with-out repetitions) from C 𝜔 .As mentioned before, let 𝑓 𝑖 be the probability distribution ofthe 𝑖 -th coordinate of 𝐶 , and let instead 𝑓 𝑖 | 𝜔 be the distributionof the 𝑖 -th coordinate of the subcode 𝐶 𝜔 (with components,say, 𝑓 𝑖,𝑎 | 𝜔 ) . Then, for 𝑖 > ℓ , we can write E [ 𝜏 ( 𝐺 𝑥 ,...,𝑥 𝑗 𝑖 )| 𝜔 ] = ( + 𝑜 ( )) ∑︁ distinct 𝑎 ,...,𝑎 𝑗 𝑓 𝑖,𝑎 | 𝜔 𝑓 𝑖,𝑎 | 𝜔 · · · 𝑓 𝑖,𝑎 𝑗 | 𝜔 ( − 𝑓 𝑖,𝑎 − · · · − 𝑓 𝑖,𝑎 𝑗 ) (10) where the 𝑜 ( ) is meant as 𝑛 → ∞ and is due, under theassumption that 𝐶 𝜔 grows unbounded with 𝑛 , to samplingwithout replacement within 𝐶 𝜔 . Now, since 𝜆 𝜔 = |C 𝜔 |/|C| , 𝑓 𝑖 is actually the expectation of 𝑓 𝑖 | 𝜔 over the random 𝜔 , thatis, using a different dummy variable 𝜇 to index the subcodesfor convenience, 𝑓 𝑖 = ∑︁ 𝜇 𝜆 𝜇 𝑓 𝑖 | 𝜇 . Using this in (10), one notices that when taking furtherexpectation over 𝜔 it is possible to operate a symmetrizationin 𝜔 and 𝜇 . If we denote with Ψ for the polynomial functiondeﬁned for two probability distribution 𝑝 = ( 𝑝 , 𝑝 , . . . , 𝑝 𝑏 ) and 𝑞 = ( 𝑞 , 𝑞 , . . . , 𝑞 𝑏 ) as Ψ ( 𝑝, 𝑞 ) = ( 𝑏 − 𝑗 − ) ! (11) ∑︁ 𝜎 ∈ 𝑆 𝑏 𝑝 𝜎 ( ) 𝑝 𝜎 ( ) . . . 𝑝 𝜎 ( 𝑗 ) 𝑞 𝜎 ( 𝑗 + ) + 𝑞 𝜎 ( ) 𝑞 𝜎 ( ) . . . 𝑞 𝜎 ( 𝑗 ) 𝑝 𝜎 ( 𝑗 + ) . (12)Then the expectation of (10) over 𝜔 can be written as E [ 𝜏 ( 𝐺 𝑥 ,𝑥 ,...,𝑥 𝑗 𝑖 )] = ( + 𝑜 ( )) ∑︁ 𝜔,𝜇 ∈ Ω 𝜆 𝜔 𝜆 𝜇 Ψ ( 𝑓 𝑖 | 𝜔 , 𝑓 𝑖 | 𝜇 ) . (13)In [5], the global maximum of the function Ψ ( 𝑝, 𝑞 ) , overarbitrary distributions 𝑝 and 𝑞 , say Ψ max = max 𝑝,𝑞 Ψ ( 𝑝, 𝑞 ) , (14)was used to deduce the inequality, valid for any 𝑖 > ℓ , E [ 𝜏 ( 𝐺 𝑥 ,𝑥 ,...,𝑥 𝑗 𝑖 )] ≤ ( + 𝑜 ( )) Ψ max . (15)Then log | 𝐶 | ≤ ( + 𝑜 ( )) ( 𝑛 − ℓ ) Ψ max log 𝑏 − 𝑗𝑘 − 𝑗 − , (16)from which, using the value of ℓ described above, one deduces 𝑅 ≤ ( + 𝑜 ( ))  − 𝑅 log (cid:16) 𝑏𝑗 − (cid:17)  Ψ max log 𝑏 − 𝑗𝑘 − 𝑗 − . This gives the explicit bound 𝑅 ( 𝑏,𝑘 ) ≤ Ψ max log 𝑏 − 𝑗𝑘 − 𝑗 − + (cid:16) 𝑏𝑗 − (cid:17) . (17)A weakness in this bound comes from the fact that distri-butions 𝑝 and 𝑞 that maximize Ψ ( 𝑝, 𝑞 ) could exhibit someopposing asymmetries, in the sense that they give higherprobabilities to different symbols. When used as a replacementfor each of the pairs of 𝑓 𝑖 | 𝜔 and 𝑓 𝑖 | 𝜇 in (13), we have arather conservative bound, because pairs ( 𝑝, 𝑞 ) which givehigh values for Ψ ( 𝑝, 𝑞 ) will give low values for Ψ ( 𝑝 ; 𝑝 ) and Ψ ( 𝑞 ; 𝑞 ) , and equation (13) contains a weighted contributionfrom all pairings of 𝑓 𝑖 | 𝜔 and 𝑓 𝑖 | 𝜇 . In other words, observedthat (13) is a quadratic form in the distribution 𝜆 with kernel ( 𝑝, 𝑞 ) , if the kernel has maximum value Ψ max in some off-diagonal ( 𝑝, 𝑞 ) -positions to which there correspond small “in-diagonal” values at ( 𝑝, 𝑝 ) and ( 𝑞, 𝑞 ) , then using Ψ max as abound for the whole quadratic form can be quite a conservativeapproach.In this paper, we approach (13) more carefully by clusteringthe possible distributions 𝑓 𝑖 | 𝜔 in different groups dependingon how balanced or unbalanced they are, and bounding Ψ ( 𝑓 𝑖 | 𝜔 , 𝑓 𝑖 | 𝜇 ) for 𝑓 𝑖 | 𝜔 and 𝑓 𝑖 | 𝜇 in those different groups. Fromthis, we deduce a bound on the quadratic form. Note thatsince in the problem under consideration (that is, as 𝑛 → ∞ )we have no limit in the granularity of the distributions 𝑓 𝑖,𝜔 ,the quadratic form that we have to bound might in principlehave a limiting value which is only achieved with a continuousdistribution 𝜆 over the simplex of 𝑏 -dimensional distributions P 𝑏 . Still, once we consider a ﬁnite number of clusters 𝑟 forthe distributions 𝑓 𝑖 | 𝜔 , our quadratic form is upper boundedby a corresponding 𝑟 -dimensional one. In our derivation, wewill use 𝑏 + clusters with some symmetric structure whichallows us to further reduce the complexity to an equivalentfour dimensional form and then to a quadratics in one singlevariable. III. B OUNDING THE QUADRATIC FORM

Based on the discussion in the previous Section, we nowenter the problem of determining better upper bounds on theright hand side of (13). We simplify here the notation andconsider the quadratic form ∑︁ 𝑝,𝑞 𝜆 𝑝 𝜆 𝑞 Ψ ( 𝑝, 𝑞 ) (18)where 𝑝 and 𝑞 run over an arbitrary ﬁnite set of points in thesimplex P 𝑏 of 𝑏 -dimensional probability distribution and 𝜆 isa probability distribution over such set. We consider partitionsof P 𝑏 in disjoint subsets to ﬁnd upper bounds on the quadraticform (18) in terms of simpler ones. If we have a partition {P 𝑏 , P 𝑏 , . . . , P 𝑟𝑏 } of P 𝑏 and we deﬁne 𝑚 𝑖,ℎ = sup 𝑝 ∈P 𝑖𝑏 ,𝑞 ∈P ℎ𝑏 Ψ ( 𝑝, 𝑞 ) , 𝜂 𝑖 = ∑︁ 𝑝 ∈P 𝑖𝑏 𝜆 𝑝 , then clearly ∑︁ 𝑝,𝑞 𝜆 𝑝 𝜆 𝑞 Ψ ( 𝑝, 𝑞 ) ≤ ∑︁ 𝑖,ℎ ∑︁ 𝑝 ∈P 𝑖𝑏 ∑︁ 𝑞 ∈P ℎ𝑏 𝜆 𝑝 𝜆 𝑞 𝑚 𝑖,ℎ ≤ ∑︁ 𝑖,ℎ 𝜂 𝑖 𝜂 ℎ 𝑚 𝑖,ℎ . (19)This is a convenient simpliﬁcation since we have now an 𝑟 -dimensional problem which we might be able to deal with insome computationally feasible way. We will use this procedurewith two different partitions in terms of how balanced or un-balanced the distributions are. We take 𝑏 + subsets with somesymmetry which allows us to further reduce the complexity. Partition based on maximum value.

We ﬁrst consider apartition of P 𝑏 in terms of the largest probability value whichappears in a distribution. We use a parameter 𝜖 < /( 𝑏 − ) ; allquantities will depend on 𝜖 but we do not write this in order to avoid cluttering the notation. We deﬁne 𝑏 sets of unbalanceddistributions q P 𝑖𝑏 = { 𝑝 ∈ P 𝑏 : 𝑝 𝑖 > − 𝜖 } for every ≤ 𝑖 ≤ 𝑏 , and correspondingly a set of balanceddistributions q P 𝑏 = { 𝑝 ∈ P 𝑏 : 𝑝 𝑖 ≤ − 𝜖 ∀ 𝑖 } . Note that these are all disjoint sets since 𝜖 < /( 𝑏 − ) .Following the scheme mentioned above, we can consider thevalues 𝑚 𝑖,ℎ and 𝜂 𝑖 for this speciﬁc partition. However, due tosymmetry, the values 𝑚 𝑖,ℎ can be reduced to only four cases,depending on whether 𝑝 and 𝑞 are both balanced, one balancedand one unbalanced, or both unbalanced, either on the samecoordinate or on different coordinates.Assuming ≤ 𝑖, ℎ ≤ 𝑏 with 𝑖 ≠ ℎ , the following quantitiesare then well deﬁned and independent of the speciﬁc valueschosen for 𝑖 and ℎ q 𝑀 = sup 𝑝,𝑞 ∈ q P 𝑏 Ψ ( 𝑝, 𝑞 ) q 𝑀 = sup 𝑝 ∈ q P 𝑏 ,𝑞 ∈ q P 𝑖𝑏 Ψ ( 𝑝, 𝑞 ) q 𝑀 = sup 𝑝,𝑞 ∈ q P 𝑖𝑏 Ψ ( 𝑝, 𝑞 ) q 𝑀 = sup 𝑝 ∈ q P 𝑖𝑏 ,𝑞 ∈ q P ℎ𝑏 Ψ ( 𝑝, 𝑞 ) (20)These values can then be used in (19) in place of the values 𝑚 𝑖,ℎ . Partition based on the minimum value.

We also considera partition of P 𝑏 using constraints from below. Again we usea parameter 𝜖 which will be then tuned. We assume here 𝜖 < / 𝑏 . Consider now the following disjoint sets of unbalanceddistributions (cid:98) P 𝑖𝑏 = { 𝑝 ∈ P 𝑏 : 𝑝 𝑖 < 𝜖 , 𝑝 ℎ ≥ 𝑝 𝑖 ∀ ℎ , 𝑝 ℎ > 𝑝 𝑖 ∀ ℎ < 𝑖 } for ≤ 𝑖 ≤ 𝑏 , that is, distributions in (cid:98) P 𝑖𝑏 have a minimumcomponent in the 𝑖 -th coordinate, which is smaller than 𝜖 , andstrictly smaller than any of the preceding components (unlessof course 𝑖 = ). Correspondingly, deﬁne a set of balanceddistributions as (cid:98) P 𝑏 = { 𝑝 ∈ P 𝑏 : 𝑝 𝑖 ≥ 𝜖 ∀ 𝑖 } . The symmetry argument mentioned before also applies in thiscase and we can continue in analogy replacing the 𝑚 𝑖,ℎ of(19) with the following quantities (cid:98) 𝑀 = sup 𝑝,𝑞 ∈ (cid:98) P 𝑏 Ψ ( 𝑝, 𝑞 ) (cid:98) 𝑀 = sup 𝑝 ∈ (cid:98) P 𝑏 ,𝑞 ∈ (cid:98) P 𝑖𝑏 Ψ ( 𝑝, 𝑞 ) (cid:98) 𝑀 = sup 𝑝,𝑞 ∈ (cid:98) P 𝑖𝑏 Ψ ( 𝑝, 𝑞 ) (cid:98) 𝑀 = sup 𝑝 ∈ (cid:98) P 𝑖𝑏 ,𝑞 ∈ (cid:98) P ℎ𝑏 Ψ ( 𝑝, 𝑞 ) (21)where again ≤ 𝑖, ℎ ≤ 𝑏 with 𝑖 ≠ ℎ .Applying the above scheme with the symmetric partitionswe just deﬁned, we can now rewrite the upper bound ofequation (19) in the form ∑︁ 𝑝,𝑞 𝜆 𝑝 𝜆 𝑞 Ψ ( 𝑝, 𝑞 )≤ 𝜂 𝑀 + 𝜂 ∑︁ 𝑖> 𝜂 𝑖 𝑀 + ∑︁ 𝑖> 𝜂 𝑖 𝑀 + ∑︁ <𝑖<ℎ 𝜂 𝑖 𝜂 ℎ 𝑀 . (22)all 𝑀 be the maximum value achieved by the right handside of (22) over all possible probability distributions 𝜂 = 𝜂 , 𝜂 , . . . , 𝜂 𝑏 (which will of course depend on whether we usethe (cid:98) 𝑀 𝑖 ’s or q 𝑀 𝑖 ’s values in place of the 𝑀 𝑖 ’s). The optimizationof (22), once known the 𝑀 𝑖 ’s values, is easy using the standardlagrange multipliers method (or see Lemma 2 of [17]). Thenwe can then replace Ψ max in (17) with 𝑀 to derive the bound 𝑅 ( 𝑏,𝑘 ) ≤ 𝑀 log 𝑏 − 𝑗𝑘 − 𝑗 − + (cid:16) 𝑏𝑗 − (cid:17) . We will describe in the next Section our procedure to deter-mine, or upper bound the values (cid:98) 𝑀 𝑖 , q 𝑀 𝑖 and the corresponding 𝑀 . Here we only state the obtained results.Using the partition based on the maximum value { q P 𝑖𝑏 } 𝑖 = ,...,𝑏 we obtain the following theorem. Theorem 1:

We have 𝑅 ( , ) ≤ . , 𝑅 ( , ) ≤ . , 𝑅 ( , ) ≤ . ,𝑅 ( , ) ≤ . , 𝑅 ( , ) ≤ . . Using the partition based on the minimum value { (cid:98) P 𝑖𝑏 } 𝑖 = ,...,𝑏 we obtain the following theorem. Theorem 2:

We have 𝑅 ( , ) ≤ . , 𝑅 ( , ) ≤ . ,𝑅 ( , ) ≤ ≈ . . Based on the results in [7], on its generalization given inequation (4) and on Theorem 2 when ( 𝑏, 𝑘 ) = ( , ) , we areled to formulate the following conjecture. Conjecture 1:

For 𝑏 ≥ 𝑘 > , 𝑅 ( 𝑏,𝑘 ) ≤ min ≤ 𝑗 ≤ 𝑘 − (cid:169)(cid:173)(cid:171) 𝑏𝑗 − + 𝑏 𝑗 + 𝑏 𝑗 + log 𝑏 − 𝑗𝑘 − 𝑗 − (cid:170)(cid:174)(cid:172) − . Note that the conjectured expression can be seen as a modi-ﬁcation of the Körner-Marton bound in (3) which takes intoaccount the effects of preﬁx-based partitions.IV. C

OMPUTATION OF 𝑀 Thanks to a straightforward generalization of some lemmasdeﬁned and proved in [17], we have determined and inspectedusing Mathematica all the possible maximum points (see theAppendices in [17]) in which each q 𝑀 𝑖 (or (cid:98) 𝑀 𝑖 ) can be attained,obtaining the following propositions. Proposition 1:

For 𝑗 = 𝑘 − , we have that ( 𝑏, 𝑘 ) 𝜖 | 𝑀 | 𝑀 | 𝑀 | 𝑀 ( , ) / ( , ) / ( , ) / ( , ) / . · − . · − ( , ) / . · − . · − | 𝑀 attained at ( 𝑏 , . . . , 𝑏 ; 𝑏 , . . . , 𝑏 ) | 𝑀 attained at ( , , . . . ,

0; 0 , 𝑏 − , . . . , 𝑏 − ) | 𝑀 attained at ( − 𝜖 , 𝜖𝑏 − , . . . , 𝜖𝑏 − ; 1 − 𝜖 , 𝜖𝑏 − , . . . , 𝜖𝑏 − ) | 𝑀 attained at ( − 𝜖 , 𝜖𝑏 − , . . . , 𝜖𝑏 − ,

0; 0 , 𝜖𝑏 − , . . . , 𝜖𝑏 − , − 𝜖 ) Proposition 2:

For 𝑗 = , ( 𝑏, 𝑘 ) = ( , ) and 𝜖 = ( + √ ) we have that (cid:99) 𝑀 𝑖 Attained at point ( 𝑝 ; 𝑞 ) Values ≈ (cid:99) 𝑀 ( 𝜖 , − 𝜖𝑏 − , . . . , − 𝜖𝑏 − ; 𝛾, 𝛿, . . . , 𝛿 ) , 𝛿 ≈ . (cid:99) 𝑀 ( , 𝑏 − , . . . , 𝑏 − ; 𝛾, 𝛿, . . . , 𝛿 ) , 𝛿 = 𝜖 (cid:99) 𝑀 ( 𝜖 , − 𝜖𝑏 − , . . . , − 𝜖𝑏 − , 𝜖 , 𝛼, . . . , 𝛼, 𝛽 ) , 𝛽 ≈ . (cid:99) 𝑀 ( , 𝑏 − , . . . , 𝑏 − ; 𝛾, 𝛿, . . . , 𝛿 ) , 𝛿 = 𝜖 For 𝑗 = , ( 𝑏, 𝑘 ) = ( , ) and 𝜖 = we have that (cid:99) 𝑀 𝑖 Attained at point ( 𝑝 ; 𝑞 ) Values ≈ (cid:99) 𝑀 ( 𝜖 , − 𝜖𝑏 − , . . . , − 𝜖𝑏 − ; 𝛾, 𝛿, . . . , 𝛿 ) , 𝛿 ≈ . (cid:99) 𝑀 ( , 𝑏 − , . . . , 𝑏 − ; 𝛾, 𝛿, . . . , 𝛿 ) , 𝛿 ≈ . (cid:99) 𝑀 ( 𝜖 , − 𝜖𝑏 − , . . . , − 𝜖𝑏 − , 𝜖 , 𝛼, . . . , 𝛼, 𝛽 ) , 𝛽 ≈ . (cid:99) 𝑀 ( , 𝑏 − , . . . , 𝑏 − ; 𝛾, 𝛿, . . . , 𝛿 ) , 𝛿 ≈ . For 𝑗 = , ( 𝑏, 𝑘 ) = ( , ) and 𝜖 = we have that (cid:99) 𝑀 𝑖 Attained at point ( 𝑝 ; 𝑞 ) Values ≈ (cid:99) 𝑀 ( 𝑏 , . . . , 𝑏 ; 𝑏 , . . . , 𝑏 ) (cid:99) 𝑀 ( 𝜖 , − 𝜖𝑏 − , . . . , − 𝜖𝑏 − ; 𝛾, 𝛿, . . . , 𝛿 ) , 𝛿 ≈ . (cid:99) 𝑀 ( 𝜖 , , − 𝜖𝑏 − , . . . , − 𝜖𝑏 − ; 0 , , , . . . , ) (cid:99) 𝑀 ( , , . . . ,

0; 0 , 𝑏 − , . . . , 𝑏 − ) . The values reported for (cid:98) 𝑀 are not approximate values of theexact values of (cid:98) 𝑀 but, instead, they are upper bounds. Remark 1:

We point out that the value (cid:98) 𝑀 for ( 𝑏, 𝑘 ) = ( , ) is only attained for uniform distributions.As a consequence of Propositions 1, 2 and equation (22)we are able to evaluate the values of 𝑀 for both the partitions { q 𝑃 𝑖𝑏 } 𝑖 = ,...,𝑏 and { (cid:98) 𝑃 𝑖𝑏 } 𝑖 = ,...,𝑏 . Then we state the followingtheorem Theorem 3:

Using the partition { q 𝑃 𝑖𝑏 } 𝑖 = ,...,𝑏 we get • for ( 𝑏, 𝑘 ) = ( , ) we have that 𝑀 ≈ . ; • for ( 𝑏, 𝑘 ) = ( , ) we have that 𝑀 ≈ . ; • for ( 𝑏, 𝑘 ) = ( , ) we have that 𝑀 ≈ . . • for ( 𝑏, 𝑘 ) = ( , ) we have that 𝑀 ≈ . . • for ( 𝑏, 𝑘 ) = ( , ) we have that 𝑀 ≈ . .Using the partition { (cid:98) 𝑃 𝑖𝑏 } 𝑖 = ,...,𝑏 we get • for ( 𝑏, 𝑘 ) = ( , ) we have that 𝑀 ≈ . ; • for ( 𝑏, 𝑘 ) = ( , ) we have that 𝑀 ≈ . ; • for ( 𝑏, 𝑘 ) = ( , ) we have that 𝑀 = ≈ . .For the values of ( 𝑏, 𝑘 ) reported in Table I except thecases in which 𝑘 = , 𝑏 = 𝑘 = , , and ( 𝑏, 𝑘 ) = ( , ) , ( , ) , ( , ) , it is interesting to note that the bounds in bold(the generalized bounds [5] or [6]) are achieved for uniformdistributions. This means that, for these particular cases, anynew upper bounds that can be found on the quadratic form inequation (13) cannot further improve those bounds. However,for such globally balanced codes, one can use a differentargument based on the minimum distance of the code to geteven stronger upper bounds. A proof that 𝑅 ( , ) < / , basedon the Aaltonen bound [1], can be found in [17]. EFERENCES[1] M. Aaltonen.

A new upper bound on nonbinary block codes , DiscreteMath. vol 83, 139-160, 1990.[2] E. Arikan, An upper bound on the zero-error list-coding capacity,

IEEETransactions on Information Theory (1994), 1237–1240.[3] E. Arikan, An improved graph-entropy bound for perfect hashing, IEEEInternational Symposium on Information Theory (1994).[4] S. Bhandari and J. Radhakrishnan, Bounds on the Zero-Error List-Decoding Capacity of the q/(q-1) Channel, .[5] S. Costa, M. Dalai.

New bounds for perfect 𝑘 -hashing, in press onDiscrete Applied Mathematics, 2020 .[6] M. Dalai, V. Guruswami, and J. Radhakrishnan, An improved bound onthe zero-error listdecoding capacity of the 4/3 channel, IEEE InternationalSymposium on Information Theory (ISIT) (2017), 1658–1662.[7] M. Dalai, V. Guruswami, and J. Radhakrishnan, An improved boundon the zero-error listdecoding capacity of the 4/3 channel, in

IEEETransactions on Information Theory , vol. 66, no. 2, pp. 749-756, Feb.2020[8] P. Elias, Zero error capacity under list decoding,

IEEE Transactions onInformation Theory (1988), 1070–1074. [9] Michael L. Fredman and János Komlós, On the Size of SeparatingSystems and Families of Perfect Hash Functions, SIAM Journal onAlgebraic Discrete Methods (1984), 61–68.[10] V. Guruswami, A. Riazanov, Beating Fredman-Komlos for perfect 𝑘 -hashing, Leibniz International Proceedings in Informatics (2019).[11] G. Hansel, Nombre minimal de contacts de fermature nécessaires pourréaliser une fonction booléenne symétrique de 𝑛 variables, C. R. Acad.Sci. Paris , pp. 6037–6040, 1964.[12] J. Korner and K. Marton, New Bounds for Perfect Hashing via Infor-mation Theory,

European Journal of Combinatorics (1988), 523–530.[13] J. Korner, Fredman–Komlós bounds and information theory, SIAMJournal on Algebraic Discrete Methods (1986), 560–570.[14] A. Nilli, “Perfect hashing and probability,” Combinatorics, Probabilityand Computing arXiv preprint arXiv:1908.08792 , (2019).[17] S. Della Fiore, S. Costa and M. Dalai, Further strengthening of upperbounds for perfect 𝑘 -Hashing, arXiv preprint arXiv:2012.00620arXiv preprint arXiv:2012.00620