Is this you? Create Your Porfile

Sérgio Lifschitz

Pontifical Catholic University of Rio de Janeiro

Archive Network Publication Hotspot Collaboration

Network

Latest external collaboration on country level. Dive into details by clicking on the dots.

Explore More

Hotspot

Dive into the research topics where Sérgio Lifschitz is active.

Explore More

Publication

Featured researches published by Sérgio Lifschitz.

Distributed and Parallel Databases | 2003

Database Allocation Strategies for Parallel BLAST Evaluation on Clusters

Rogério Luís de Carvalho Costa; Sérgio Lifschitz

In this work we investigate the parallel evaluation of BLAST, the most popular tool for comparing biological sequences. Our goal is to study database distribution issues that, besides workload balancing, could improve the performance of a set of BLAST processes running in a workstation cluster. We consider different partitioning strategies within actual BLAST executions against a few relevant molecular databases. We have implemented multiple databases and input sequence configurations and show that there are many important parameters, such as the fragment generation method and sequence similarities, that must be taken into account in order to make full use of such parallel environment.

acm symposium on applied computing | 2007

A mechanism for replicated data consistency in mobile computing environments

José Maria Monteiro; Ângelo Brayner; Sérgio Lifschitz

Mobile computing allows for the development of new and sophisticated database applications. Such applications require the reading of current and consistent data. In order to improve data availability, increase performance and maximize throughput, data replication is used. However, due to inherent limitations in mobile and other loosely-coupled environments, the concurrency control and replica control mechanisms must be revisited. This paper proposes a new protocol that guarantees the consistency of replicated data in a mobile computing environment, while provide high data availability and ensure an eventual replica convergence towards a strongly consistent state.

Pattern Recognition | 1991

Computing some distance functions between polygons

Mikhail J. Atallah; Celso C. Ribeiro; Sérgio Lifschitz

Abstract We present algorithms for computing some distance functions between two (possibly intersecting) polygons, in both the convex and nonconvex cases. The interest for such distance functions comes from applications in robot vision, pattern recognition and contour fitting. We present an optimal EREW-PRAM parallel algorithm for the case when the input polygons are convex, and an essentially quadratic sequential algorithm for the case of arbitrary polygons (possibly with holes).

BMC Bioinformatics | 2013

Provenance in bioinformatics workflows.

Renato de Paula; Maristela Holanda; Luciana S. A. Gomes; Sérgio Lifschitz; Maria Emilia Telles Walter

In this work, we used the PROV-DM model to manage data provenance in workflows of genome projects. This provenance model allows the storage of details of one workflow execution, e.g., raw and produced data and computational tools, their versions and parameters. Using this model, biologists can access details of one particular execution of a workflow, compare results produced by different executions, and plan new experiments more efficiently. In addition to this, a provenance simulator was created, which facilitates the inclusion of provenance data of one genome project workflow execution. Finally, we discuss one case study, which aims to identify genes involved in specific metabolic pathways of Bacillus cereus, as well as to compare this isolate with other phylogenetic related bacteria from the Bacillus group. B. cereus is an extremophilic bacteria, collected in warm water in the Midwestern Region of Brazil, its DNA samples having been sequenced with an NGS machine.

database and expert systems applications | 2001

A Genome Databases Framework

Luiz Fernando Bessa Seibel; Sérgio Lifschitz

There are many Molecular Biology Databases, also known as Genome Databases, and there is a need for integrating all this data sources and related applications. This work proposes the use of an object-oriented framework for genome data access and manipulations. The framework approach is an interesting solution due to the flexibility, reusability and extensibility requirements of this application domain. We give a formal definition of our Genome Databases Framework using UML class diagrams, that explore the structural part of the architecture. A brief discussion on the Framework functionalities is also presented.

international conference on data engineering | 2005

Implementation of an Agent Architecture for Automated Index Tuning

R.L. de Carvalho Costa; Sérgio Lifschitz; M.F. de Noronha; M.A. Vaz Salles

It has been important to extend database management systems to support new requirements of applications and administration. We focus in this paper on the automatic tuning feature, particularly manipulation of indexes. The architectural approach considered here is based on software agents. We briefly present the heuristics involved in the decision upon creation, dropping and the exact moment to proceed with either action. Then we discuss our agentbased database architecture, with its layers and coupling within an actual DBMS. Finally, we explain the way our hypothetical indexes are integrated in PostgreSQL and how they can be used in practice for automated index tuning.

Clei Electronic Journal | 2018

Index Self-tunning with Agent-based Databases.

Rogério Luís de Carvalho Costa; Sérgio Lifschitz; Marcos Antonio Vaz Salles

The use of software agents as Database Management System components lead to database systems that may be configured and extended to support new requirements. We focus here with the self-tuning feature, which demands a somewhat intelligent behavior that agents could add to traditional DBMS modules. We propose in this paper an agent-based database architecture to deal with automatic index creation. Implementation issues are also discussed, for a built-in agents and DBMS integration architecture.

Theoretical Computer Science | 1998

A probabilistic view of Datalog parallelization

Sérgio Lifschitz; Victor Vianu

We explore an approach to developing Datalog parallelization strategies that aims at good expected rather than worst-case performance. To illustrate, we consider a very simple parallelization strategy that applies to all Datalog programs. We prove that this has very good expected performance under equal distribution of inputs. This is done using an extension of 0–1 laws adapted to this context. The analysis is confirmed by experimental results on randomly generated data.

european conference on parallel processing | 1997

Exploring Load Balancing in Parallel Processing of Recursive Queries

Sérgio Lifschitz; Alexandre Plastino; Celso C. Ribeiro

We study a dose-driven dynamic load balancing strategy to evaluate database recursive queries. This proposal aims at obtaining a good workload balance with full use of the available resources, with different kinds of skew considered. The strategy is intended for general recursive queries and preliminary computational results illustrate its efficiency when applied to the particular case of linear transitive closure.

Comparative and Functional Genomics | 2015

Evaluating the Cassandra NoSQL Database Approach for Genomic Data Persistency.

Rodrigo Aniceto; Rene Xavier; Valéria Monteze Guimarães; Fernanda Hondo; Maristela Holanda; Maria Emilia Telles Walter; Sérgio Lifschitz

Rapid advances in high-throughput sequencing techniques have created interesting computational challenges in bioinformatics. One of them refers to management of massive amounts of data generated by automatic sequencers. We need to deal with the persistency of genomic data, particularly storing and analyzing these large-scale processed data. To find an alternative to the frequently considered relational database model becomes a compelling task. Other data models may be more effective when dealing with a very large amount of nonconventional data, especially for writing and retrieving operations. In this paper, we discuss the Cassandra NoSQL database approach for storing genomic data. We perform an analysis of persistency and I/O operations with real data, using the Cassandra database system. We also compare the results obtained with a classical relational database system and another NoSQL database approach, MongoDB.

Explore More