Lucian Ilie | Researchain

Archive Network Publication Hotspot Collaboration

Network

Latest external collaboration on country level. Dive into details by clicking on the dots.

Explore More

Hotspot

Dive into the research topics where Lucian Ilie is active.

Explore More

Publication

Featured researches published by Lucian Ilie.

Bioinformatics | 2011

SHRiMP2: Sensitive yet Practical Short Read Mapping

Matei David; Misko Dzamba; Dan Lister; Lucian Ilie; Michael Brudno

UNLABELLED We report on a major update (version 2) of the original SHort Read Mapping Program (SHRiMP). SHRiMP2 primarily targets mapping sensitivity, and is able to achieve high accuracy at a very reasonable speed. SHRiMP2 supports both letter space and color space (AB/SOLiD) reads, enables for direct alignment of paired reads and uses parallel computation to fully utilize multi-core architectures. AVAILABILITY SHRiMP2 executables and source code are freely available at: http://compbio.cs.toronto.edu/shrimp/.

Bioinformatics | 2011

HiTEC: accurate error correction in high-throughput sequencing data

Lucian Ilie; Farideh Fazayeli; Silvana Ilie

MOTIVATION High-throughput sequencing technologies produce very large amounts of data and sequencing errors constitute one of the major problems in analyzing such data. Current algorithms for correcting these errors are not very accurate and do not automatically adapt to the given data. RESULTS We present HiTEC, an algorithm that provides a highly accurate, robust and fully automated method to correct reads produced by high-throughput sequencing methods. Our approach provides significantly higher accuracy than previous methods. It is time and space efficient and works very well for all read lengths, genome sizes and coverage levels. AVAILABILITY The source code of HiTEC is freely available at www.csd.uwo.ca/~ilie/HiTEC/.

Information and Computation archive | 2003

Follow automata

Lucian Ilie; Sheng Yu

We give two new algorithms for constructing small nondeterministic finite automata (NFA) from regular expressions. The first constructs NFAs with e-transitions (eNFA) which are smaller than all the other eNFAs obtained by similar constructions. Their size is at most 3/2 |α| + 5/2, where α is the regular expression. This is very close to optimal since we prove also the lower bound 4/3; |α| + 5/2. The second constructs NFAs. It uses e-elimination in the eNFAs we just introduced and builds a quotient of the well-known position automaton w.r.t. the equivalence given by the follow relation; therefore giving the name of follow automaton. The new automaton uses optimally the information from the positions of a regular expression. We compare the follow automaton with the best constructions to date and show that it has important advantages over those.

Information Processing Letters | 2008

Computing Longest Previous Factor in linear time and applications

Maxime Crochemore; Lucian Ilie

We give two optimal linear-time algorithms for computing the Longest Previous Factor (LPF) array corresponding to a string w. For any position i in w, LPF[i] gives the length of the longest factor of w starting at position i that occurs previously in w. Several properties and applications of LPF are investigated. They include computing the Lempel-Ziv factorization of a string and detecting all repetitions (runs) in a string in linear time independently of the integer alphabet size.

data compression conference | 2008

A Simple Algorithm for Computing the Lempel Ziv Factorization

Maxime Crochemore; Lucian Ilie; William F. Smyth

We give a space-efficient simple algorithm for computing the Lempel-Ziv factorization of a string. For a string of length n over an integer alphabet, it runs in O(n) time independently of alphabet size and uses o(n) additional space.

Theoretical Computer Science | 2007

A note on the number of squares in a word

Lucian Ilie

Fraenkel and Simpson [A.S. Fraenkel, J. Simpson, How many squares can a string contain? J. Combin. Theory Ser. A 82 (1998) 112-120] proved that the number of squares in a word of length n is bounded by 2n. In this note we improve this bound to 2n-@Q(logn). Based on the numerical evidence, the conjectured bound is n.

Journal of Combinatorial Theory | 2005

A simple proof that a word of length n has at most 2 n distinct squares

Lucian Ilie

We give a very short proof of a result by Fraenkel and Simpson (J. combin. Theory. Ser. A 82 (1998) 112) which states that the number of distinct squares in a word of lengh n is at most 2n.

Lecture Notes in Computer Science | 2004

On NFA Reductions

Lucian Ilie; Gonzalo Navarro; Sheng Yu

We give faster algorithms for two methods of reducing the number of states in nondeterministic finite automata. The first uses equivalences and the second uses preorders. We develop restricted reduction algorithms that operate on position automata while preserving some of its properties. We show empirically that these reductions are effective in largely reducing the memory requirements of regular expression search algorithms, and compare the effectiveness of different reductions.

Bioinformatics | 2013

RACER: Rapid and accurate correction of errors in reads

Lucian Ilie; Michael Molnar

MOTIVATION High-throughput next-generation sequencing technologies enable increasingly fast and affordable sequencing of genomes and transcriptomes, with a broad range of applications. The quality of the sequencing data is crucial for all applications. A significant portion of the data produced contains errors, and ever more efficient error correction programs are needed. RESULTS We propose RACER (Rapid and Accurate Correction of Errors in Reads), a new software program for correcting errors in sequencing data. RACER has better error-correcting performance than existing programs, is faster and requires less memory. To support our claims, we performed extensive comparison with the existing leading programs on a variety of real datasets. AVAILABILITY RACER is freely available for non-commercial use at www.csd.uwo.ca/∼ilie/RACER/.

Theoretical Computer Science | 2003

Reducing NFAs by invariant equivalences

Lucian Ilie; Sheng Yu

We give new general methods for constructing small non-deterministic finite automata (NFA) from arbitrary ones. Given an NFA, we compute the largest right-invariant equivalence on the set of states and then merge the equivalent states to obtain a smaller automaton. When applying this method to position automata, we get a way to convert regular expressions into NFAs which are always smaller than or equal to the position, partial derivative, and follow automata; it can be arbitrarily smaller. The construction can be dually made for left-invariant equivalences and then the two can be combined for even better results.

Explore More