Network


Latest external collaboration on country level. Dive into details by clicking on the dots.

Hotspot


Dive into the research topics where Nicola Prezza is active.

Publication


Featured researches published by Nicola Prezza.


combinatorial pattern matching | 2015

Composite Repetition-Aware Data Structures

Djamal Belazzougui; Fabio Cunial; Travis Gagie; Nicola Prezza; Mathieu Raffinot

In highly repetitive strings, like collections of genomes from the same species, distinct measures of repetition all grow sublinearly in the length of the text, and indexes targeted to such strings typically depend only on one of these measures. We describe two data structures whose size depends on multiple measures of repetition at once, and that provide competitive tradeoffs between the time for counting and reporting all the exact occurrences of a pattern, and the space taken by the structure. The key component of our constructions is the run-length encoded BWT (RLBWT), which takes space proportional to the number of BWT runs: rather than augmenting RLBWT with suffix array samples, we combine it with data structures from LZ77 indexes, which take space proportional to the number of LZ77 factors, and with the compact directed acyclic word graph (CDAWG), which takes space proportional to the number of extensions of maximal repeats. The combination of CDAWG and RLBWT enables also a new representation of the suffix tree, whose size depends again on the number of extensions of maximal repeats, and that is powerful enough to support matching statistics and constant-space traversal.


Proceedings of the ACM Conference on Bioinformatics, Computational Biology and Biomedicine | 2012

ERNE-BS5: aligning BS-treated sequences by multiple hits on a 5-letters alphabet

Nicola Prezza; Cristian Del Fabbro; Francesco Vezzi; Emanuale De Paoli; Alberto Policriti

Cytosine methylation is a DNA modification that has great impact on the regulation of gene expression and important implications for the biology and health of several living beings, including humans. Bisulfite conversion followed by next generation sequencing (BS-seq) of DNA is the gold standard technique used to detect DNA methylation at single-base resolution on a genome scale through the identification of 5-methylcytosine (5-mC). However, by converting unmethylated cytosines into thymines, BS-seq poses computational challenges to read alignment and aggravates the issue of multiple hits due to the ambiguity raised by the reduced sequence complexity. Here we present ERNE-BS5 (Extended Randomized Numerical alignEr - BiSulfite 5), an aligning program developed to efficiently map BS-treated reads against large genomes (e.g., human). To achieve this goal we have implemented three different ideas: (i) we use a 5-letters alphabet for storing methylation information, (ii) we use a weighted context-aware Hamming distance to identify a T coming from an unmethylated C context, and (iii) we use an iterative process to position multiple-hit reads starting from a preliminary map built using single-hit alignments. The map is corrected and extended at each cycle using the alignments added in the previous iteration. ERNE-BS5 is based on a new improved version of the rNA [20] aligning software with a more efficient core. ERNE (Extended Randomized Numerical alignEr) is a short string alignment package whose goal is to provide an all-inclusive set of tools to handle short reads. ERNE comprises: ERNE-MAP, ERNE-DMAP, ERNE-FILTER, ERNE-VISUAL, and, from now on, ERNE-BS5. ERNE is free software and distributed with an Open Source License (GPL V3) and can be downloaded at: http://erne.sourceforge.net


string processing and information retrieval | 2015

Fast Online Lempel-Ziv Factorization in Compressed Space

Alberto Policriti; Nicola Prezza

Let T be a text of length n on an alphabet


data compression conference | 2016

Computing LZ77 in Run-Compressed Space

Alberto Policriti; Nicola Prezza


international symposium on algorithms and computation | 2014

Hashing and Indexing: Succinct DataStructures and Smoothed Analysis

Alberto Policriti; Nicola Prezza

\Sigma


language and automata theory and applications | 2015

Average Linear Time and Compressed Space Construction of the Burrows-Wheeler Transform

Alberto Policriti; Nicola Gigante; Nicola Prezza


BMC Bioinformatics | 2015

Fast randomized approximate string matching with succinct hash data structures

Alberto Policriti; Nicola Prezza

of size


latin american symposium on theoretical informatics | 2018

On the Approximation Ratio of Lempel-Ziv Parsing.

Travis Gagie; Gonzalo Navarro; Nicola Prezza


symposium on the theory of computing | 2018

At the roots of dictionary compression: string attractors

Dominik Kempa; Nicola Prezza

\sigma


BMC Bioinformatics | 2016

Fast, accurate, and lightweight analysis of BS-treated reads with ERNE 2

Nicola Prezza; Francesco Vezzi; Max Käller; Alberto Policriti

Collaboration


Dive into the Nicola Prezza's collaboration.

Top Co-Authors

Avatar
Top Co-Authors

Avatar

Travis Gagie

Diego Portales University

View shared research outputs
Top Co-Authors

Avatar
Top Co-Authors

Avatar

Philip Bille

Technical University of Denmark

View shared research outputs
Top Co-Authors

Avatar

Inge Li Gørtz

Technical University of Denmark

View shared research outputs
Top Co-Authors

Avatar

Djamal Belazzougui

Helsinki Institute for Information Technology

View shared research outputs
Top Co-Authors

Avatar
Top Co-Authors

Avatar

Fabio Cunial

Helsinki Institute for Information Technology

View shared research outputs
Top Co-Authors

Avatar

Mathieu Raffinot

University of Marne-la-Vallée

View shared research outputs
Top Co-Authors

Avatar
Researchain Logo
Decentralizing Knowledge