Roman Dementiev
Max Planck Society
Network
Latest external collaboration on country level. Dive into details by clicking on the dots.
Publication
Featured researches published by Roman Dementiev.
ACM Journal of Experimental Algorithms | 2008
Roman Dementiev; Juha Kärkkäinen; Jens Mehnert; Peter Sanders
Suffix arrays are a simple and powerful data structure for text processing that can be used for full text indexes, data compression, and many other applications, in particular, in bioinformatics. However, so far, it has appeared prohibitive to build suffix arrays for huge inputs that do not fit into main memory. This paper presents design, analysis, implementation, and experimental evaluation of several new and improved algorithms for suffix array construction. The algorithms are asymptotically optimal in the worst case or on average. Our implementation can construct suffix arrays for inputs of up to 4-GB in hours on a low-cost machine. As a tool of possible independent interest, we present a systematic way to design, analyze, and implement pipelined algorithms.
acm symposium on parallel algorithms and architectures | 2003
Roman Dementiev; Peter Sanders
We develop an algorithm for parallel disk sorting, whose I/O cost approaches the lower bound and that guarantees almost perfect overlap between I/O and computation. Previous algorithms have either suboptimal I/O volume or cannot guarantee that I/O and computations can always be overlapped. We give an efficient implementation that can (at least) compete with the best practical implementations but gives additional performance guarantees. For the experiments we have configured a state of the art machine that can sustain full bandwidth I/O with eight disks and is very cost effective.
symposium on discrete algorithms | 2006
Deepak Ajwani; Roman Dementiev; Ulrich Meyer
Breadth First Search (BFS) traversal is an archetype for many important graph problems. However, computing a BFS level decomposition for massive graphs was considered nonviable so far, because of the large number of I/Os it incurs. This paper presents the first experimental evaluation of recent external-memory BFS algorithms for general graphs. With our STXXL based implementations exploiting pipelining and disk-parallelism, we were able to compute the BFS level decomposition of a web-crawl based graph of around 130 million nodes and 1.4 billion edges in less than 4 hours using single disk and 2.3 hours using 4 disks. We demonstrate that some rather simple external-memory algorithms perform significantly better (minutes as compared to hours) than internal-memory BFS, even if more than half of the input resides internally.
Untitled Event | 2004
Roman Dementiev; Peter Sanders; Dominik Schultes; Jop F. Sibeyn
We develop an external memory algorithm for computing minimum spanning trees. The algorithm is considerably simpler than previously known external memory algorithms for this problem and needs a factor of at least four less I/Os for realistic inputs.
european symposium on algorithms | 2005
Roman Dementiev; Lutz Kettner; Peter Sanders
We present a software library Stxxl, that enables practice-oriented experimentation with huge data sets. Stxxl is an implementation of the C++ standard template library STL for external memory computations. It supports parallel disks, overlapping between I/O and computation, and pipelining technique that can save more than half of the I/Os. Stxxl has already been used for computing minimum spanning trees, connected components, breadth-first search decompositions, constructing suffix arrays, and computing social network analysis metrics.
international parallel and distributed processing symposium | 2009
Andreas Beckmann; Roman Dementiev; Johannes Singlery
Large and fast hard disks for little money have enabled the processing of huge amounts of data on a single machine. For this purpose, the well-established STXXL library provides a framework for external memory algorithms with an easy-to-use interface. However, the clock speed of processors cannot keep up with the increasing bandwidth of parallel disks, making many algorithms actually compute-bound.
Untitled Event | 2005
Roman Dementiev; Lutz Kettner; Peter Sanders; Gerth Stølting Brodal; Stefano Leonardi
Untitled Event | 2006
Deepak Ajwani; Roman Dementiev; Ulrich Meyer
Untitled Event | 2005
Roman Dementiev; Juha Kärkkäinen; Jens Mehnert; Peter Sanders; Camil Demetrescu; Robert Sedgewick; Roberto Tamassia
IFIP TCS | 2004
Roman Dementiev; Peter Sanders; Dominik Schultes; Jop F. Sibeyn