Søren Juhl Vind
Technical University of Denmark
Network
Latest external collaboration on country level. Dive into details by clicking on the dots.
Publication
Featured researches published by Søren Juhl Vind.
workshop on algorithms and data structures | 2013
Philip Bille; Patrick Hagge Cording; Inge Li Gørtz; Benjamin Sach; Hjalte Wedel Vildhøj; Søren Juhl Vind
The Karp-Rabin fingerprint of a string is a type of hash value that due to its strong properties has been used in many string algorithms. In this paper we show how to construct a data structure for a string S of size N compressed by a context-free grammar of size n that answers fingerprint queries. That is, given indices i and j, the answer to a query is the fingerprint of the substring S[i,j]. We present the first O(n) space data structures that answer fingerprint queries without decompressing any characters. For Straight Line Programs (SLP) we get O(logN) query time, and for Linear SLPs (an SLP derivative that captures LZ78 compression and its variations) we get O(loglogN) query time. Hence, our data structures has the same time and space complexity as for random access in SLPs. We utilize the fingerprint data structures to solve the longest common extension problem in query time O(logNlogl) and O(loglloglogl+loglogN) for SLPs and Linear SLPs, respectively. Here, l denotes the length of the LCE.
international symposium on algorithms and computation | 2016
Philip Bille; Patrick Hagge Cording; Inge Li Gørtz; Frederik Rye Skjoldjensen; Hjalte Wedel Vildhøj; Søren Juhl Vind
Given a static reference string R and a source string S, a relative compression of S with respect to R is an encoding of S as a sequence of references to substrings of R. Relative compression schemes are a classic model of compression and have recently proved very successful for compressing highly-repetitive massive data sets such as genomes and web-data. We initiate the study of relative compression in a dynamic setting where the compressed source string S is subject to edit operations. The goal is to maintain the compressed representation compactly, while supporting edits and allowing efficient random access to the (uncompressed) source string. We present new data structures that achieve optimal time for updates and queries while using space linear in the size of the optimal relative compression, for nearly all combinations of parameters. We also present solutions for restricted and extended sets of updates. To achieve these results, we revisit the dynamic partial sums problem and the substring concatenation problem. We present new optimal or near optimal bounds for these problems. Plugging in our new results we also immediately obtain new bounds for the string indexing for patterns with wildcards problem and the dynamic text and static pattern matching problem.
language and automata theory and applications | 2015
Philip Bille; Inge Li Gørtz; Søren Juhl Vind
We study the orthogonal range searching problem on points that have a significant number of geometric repetitions, that is, subsets of points that are identical under translation. Such repetitions occur in scenarios such as image compression, GIS applications and in compactly representing sparse matrices and web graphs. Our contribution is twofold. First, we show how to compress geometric repetitions that may appear in standard range searching data structures (such as K-D trees, Quad trees, Range trees, R-trees, Priority R-trees, and K-D-B trees), and how to implement subsequent range queries on the compressed representation with only a constant factor overhead. Secondly, we present a compression scheme that efficiently identifies geometric repetitions in point sets, and produces a hierarchical clustering of the point sets, which combined with the first result leads to a compressed representation that supports range searching.
scandinavian workshop on algorithm theory | 2014
Roberto Grossi; Søren Juhl Vind
In colored range searching, we are given a set of n colored points in d ≥ 2 dimensions to store, and want to support orthogonal range queries taking colors into account. In the colored range counting problem, a query must report the number of distinct colors found in the query range, while an answer to the colored range reporting problem must report the distinct colors in the query range.
international symposium on multimedia | 2014
Søren Juhl Vind; Philip Bille; Inge Li Gørtz
We show how to compactly index video data to support fast motion detection queries. A query specifies a time interval T, a area A in the video and two thresholds v and p. The answer to a query is a list of timestamps in T where = p% of A has changed by = v values. Our results show that by building a small index, we can support queries with a speedup of two to three orders of magnitude compared to motion detection without an index. For high resolution video, the index size is about 20% of the compressed video size.
scandinavian workshop on algorithm theory | 2012
Philip Bille; Inge Li Gørtz; Hjalte Wedel Vildhøj; Søren Juhl Vind
Theory of Computing Systems \/ Mathematical Systems Theory | 2014
Philip Bille; Inge Li Gørtz; Hjalte Wedel Vildhøj; Søren Juhl Vind
Theoretical Computer Science | 2017
Roberto Grossi; Giulia Menconi; Nadia Pisanti; Roberto Trani; Søren Juhl Vind
foundations of software technology and theoretical computer science | 2014
Roberto Grossi; Giulia Menconi; Nadia Pisanti; Roberto Trani; Søren Juhl Vind
IMM-B.Sc.-2009-21 | 2009
Søren Juhl Vind; Hjalte Wedel Vildhøj