Ariel Shiftan
Bar-Ilan University
Network
Latest external collaboration on country level. Dive into details by clicking on the dots.
Publication
Featured researches published by Ariel Shiftan.
Information & Computation | 2011
Guy Feigenblat; Ely Porat; Ariel Shiftan
In this paper we extend the notion of min-wise independent family of hash functions by defining a k-min-wise independent family of hash functions. Informally, under this definition, all subsets of size k of any fixed set X have an equal chance to have the minimal hash values among all the elements in X, when the probability is over the random choice of hash function from the family. This property measures the randomness of the family as choosing a truly random function, obviously, satisfies the definition for k = |X|. We define and give an efficient time and space construction of approximately k-min-wise independent family of hash functions by extending Indyks construction of approximately min-wise independent [1]. The number of words needed to represent each function is O(k log log(1/ε) + log(1/ε)), which is only suboptimal by a factor of O(log log(1/ε)), where ε ∈ (0, 1) is the desired error bound. This construction is the first applicable for sampling bottom-k sketches [2, 3] out of the universe. In addition, we introduce a general and novel technique that utilizes our construction, and can be used to improve many min-wise based algorithms, such as [4, 5, 6, 7, 3, 2, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19]. As an example we show how to apply it for similarity estimation over data streams, and reduce exponentially the run time of the current known result [5]. In addition, we also discuss improvements of known algorithms for estimating rarity and entropy of random walk over graphs (from SODA07 [20]).
combinatorial pattern matching | 2014
Guy Feigenblat; Ely Porat; Ariel Shiftan
In this work, we focus on building an efficient succinct dynamic dictionary that significantly improves the query time of the current best known results. The algorithm that we propose suffers from only a O((loglogn)2 ) multiplicative slowdown in its query time and a \(O(\frac{1}{\epsilon} \log n)\) slowdown for insertion and deletion operations, where n is the sum of all of the patterns’ lengths, the size of the alphabet is polylog(n) and e ∈ (0,1). For general alphabet the query time is O((loglogn) logσ), where σ is the size of the alphabet.
foundations of software technology and theoretical computer science | 2012
Guy Feigenblat; Ely Porat; Ariel Shiftan
In this paper we introduce a general framework that exponentially improves the space, the degree of independence, and the time needed by min-wise based algorithms. The authors, in SODA 2011, we introduced an exponential time improvement for min-wise based algorithms by defining and constructing an almost k-min-wise independent family of hash functions. Here we develop an alternative approach that achieves both exponential time and exponential space improvement. The new approach relaxes the need for approximately min-wise hash functions, hence gets around the Omega(log(1/epsilon)) independence lower bound in [Patrascu 2010]. This is done by defining and constructing a d-k-min-wise independent family of hash functions. Surprisingly, for most cases only 8-wise independence is needed for the additional improvement. Moreover, as the degree of independence is a small constant, our function can be implemented efficiently. Informally, under this definition, all subsets of size d of any fixed set X have an equal probability to have hash values among the minimal k values in X, where the probability is over the random choice of hash function from the family. This property measures the randomness of the family, as choosing a truly random function, obviously, satisfies the definition for d=k=|X|. We define and give an efficient time and space construction of approximately d-k-min-wise independent family of hash functions for the case where d=2, as this is sufficient for the additional exponential improvement. We discuss how this construction can be used to improve many min-wise based algorithms. To our knowledge such definitions, for hash functions, were never studied and no construction was given before. As an example we show how to apply it for similarity and rarity estimation over data streams. Other min-wise based algorithms, can be adjusted in the same way.
Algorithmica | 2017
Guy Feigenblat; Ely Porat; Ariel Shiftan
In this work, we focus on building an efficient succinct dynamic dictionary that significantly improves the query time of the current best known results. The algorithm that we propose suffers from only a
Theoretical Computer Science | 2016
Ariel Shiftan; Ely Porat
arXiv: Data Structures and Algorithms | 2011
Guy Feigenblat; Ely Porat; Ariel Shiftan
O((\log \log n)^2 )
Archive | 2011
Guy Feigenblat; Ely Porat; Ariel Shiftan
string processing and information retrieval | 2009
Ariel Shiftan; Ely Porat
O((loglogn)2) multiplicative slowdown in its query time and a
data compression conference | 2016
Guy Feigenblat; Ely Porat; Ariel Shiftan
Journal of Computer and System Sciences | 2017
Guy Feigenblat; Ely Porat; Ariel Shiftan
O(\frac{1}{\epsilon } \log n)