Network


Latest external collaboration on country level. Dive into details by clicking on the dots.

Hotspot


Dive into the research topics where Lukasz Lacinski is active.

Publication


Featured researches published by Lukasz Lacinski.


Journal of Biomedical Informatics | 2014

Cloud-based bioinformatics workflow platform for large-scale next-generation sequencing analyses

Bo Liu; Ravi K. Madduri; Borja Sotomayor; Kyle Chard; Lukasz Lacinski; Utpal J. Dave; Jianqiang Li; Chunchen Liu; Ian T. Foster

Due to the upcoming data deluge of genome data, the need for storing and processing large-scale genome data, easy access to biomedical analyses tools, efficient data sharing and retrieval has presented significant challenges. The variability in data volume results in variable computing and storage requirements, therefore biomedical researchers are pursuing more reliable, dynamic and convenient methods for conducting sequencing analyses. This paper proposes a Cloud-based bioinformatics workflow platform for large-scale next-generation sequencing analyses, which enables reliable and highly scalable execution of sequencing analyses workflows in a fully automated manner. Our platform extends the existing Galaxy workflow system by adding data management capabilities for transferring large quantities of data efficiently and reliably (via Globus Transfer), domain-specific analyses tools preconfigured for immediate use by researchers (via user-specific tools integration), automatic deployment on Cloud for on-demand resource allocation and pay-as-you-go pricing (via Globus Provision), a Cloud provisioning tool for auto-scaling (via HTCondor scheduler), and the support for validating the correctness of workflows (via semantic verification tools). Two bioinformatics workflow use cases as well as performance evaluation are presented to validate the feasibility of the proposed approach.


Concurrency and Computation: Practice and Experience | 2014

Experiences building Globus Genomics: a next-generation sequencing analysis service using Galaxy, Globus, and Amazon Web Services

Ravi K. Madduri; Dinanath Sulakhe; Lukasz Lacinski; Bo Liu; Alex Rodriguez; Kyle Chard; Utpal J. Dave; Ian T. Foster

We describe Globus Genomics, a system that we have developed for rapid analysis of large quantities of next‐generation sequencing genomic data. This system achieves a high degree of end‐to‐end automation that encompasses every stage of data analysis including initial data retrieval from remote sequencing centers or storage (via the Globus file transfer system); specification, configuration, and reuse of multistep processing pipelines (via the Galaxy workflow system); creation of custom Amazon Machine Images and on‐demand resource acquisition via a specialized elastic provisioner (on Amazon EC2); and efficient scheduling of these pipelines over many processors (via the HTCondor scheduler). The system allows biomedical researchers to perform rapid analysis of large next‐generation sequencing datasets in a fully automated manner, without software installation or a need for any local computing infrastructure. We report performance and cost results for some representative workloads. Copyright


Concurrency and Computation: Practice and Experience | 2015

The Globus Galaxies platform: delivering science gateways as a service

Ravi K. Madduri; Kyle Chard; Ryan Chard; Lukasz Lacinski; Alex Rodriguez; Dinanath Sulakhe; David Kelly; Utpal J. Dave; Ian T. Foster

The use of public cloud computers to host sophisticated scientific data and software is transforming scientific practice by enabling broad access to capabilities previously available only to the few. The primary obstacle to more widespread use of public clouds to host scientific software (‘cloud‐based science gateways’) has thus far been the considerable gap between the specialized needs of science applications and the capabilities provided by cloud infrastructures. We describe here a domain‐independent, cloud‐based science gateway platform, the Globus Galaxies platform, which overcomes this gap by providing a set of hosted services that directly address the needs of science gateway developers. The design and implementation of this platform leverages our several years of experience with Globus Genomics, a cloud‐based science gateway that has served more than 200 genomics researchers across 30 institutions. Building on that foundation, we have implemented a platform that leverages the popular Galaxy system for application hosting and workflow execution; Globus services for data transfer, user and group management, and authentication; and a cost‐aware elastic provisioning model specialized for public cloud resources. We describe here the capabilities and architecture of this platform, present six scientific domains in which we have successfully applied it, report on user experiences, and analyze the economics of our deployments. Published 2015. This article is a U.S. Government work and is in the public domain in the USA.


international conference on e-science | 2015

Cost-Aware Cloud Provisioning

Ryan Chard; Kyle Chard; Kris Bubendorfer; Lukasz Lacinski; Ravi K. Madduri; Ian T. Foster

Cloud computing is often suggested as a low-cost and scalable model for executing and scaling scientific analyses. However, while the benefits of cloud computing are frequently touted, there are inherent technical challenges associated with scaling execution efficiently and cost-effectively. We describe here a cost-aware elastic provisioner designed to dynamically and cost-effectively provision cloud infrastructure based on the requirements of user-submitted scientific workflows. Our provisioner is used in the Globus Galaxies platform -- a Software-as-a-Service provider of scientific analysis capabilities using commercial cloud infrastructure. Using workloads from production usage of this platform we investigate the performance of our provisioner in terms of cost, spot instance termination rate, and execution time. We demonstrate cost savings across six production gateways of up to 95% and 12% improvement in total execution time when compared to a worst case scenario using a single instance type in a single availability zone.


Computational and structural biotechnology journal | 2015

A case study for cloud based high throughput analysis of NGS data using the globus genomics system

Krithika Bhuvaneshwar; Dinanath Sulakhe; Robinder Gauba; Alex Rodriguez; Ravi K. Madduri; Utpal J. Dave; Lukasz Lacinski; Ian T. Foster; Yuriy Gusev; Subha Madhavan

Next generation sequencing (NGS) technologies produce massive amounts of data requiring a powerful computational infrastructure, high quality bioinformatics software, and skilled personnel to operate the tools. We present a case study of a practical solution to this data management and analysis challenge that simplifies terabyte scale data handling and provides advanced tools for NGS data analysis. These capabilities are implemented using the “Globus Genomics” system, which is an enhanced Galaxy workflow system made available as a service that offers users the capability to process and transfer data easily, reliably and quickly to address end-to-endNGS analysis requirements. The Globus Genomics system is built on Amazon s cloud computing infrastructure. The system takes advantage of elastic scaling of compute resources to run multiple workflows in parallel and it also helps meet the scale-out analysis needs of modern translational genomics research.


extreme science and engineering discovery environment | 2013

Experiences in building a next-generation sequencing analysis service using galaxy, globus online and Amazon web service

Ravi K. Madduri; Paul Dave; Dinanath Sulakhe; Lukasz Lacinski; Bo Liu; Ian T. Foster

We describe Globus Genomics, a system that we have developed for rapid analysis of large quantities of next-generation sequencing (NGS) genomic data. This system is notable for its high degree of end-to-end automation, which encompasses every stage of the data analysis pipeline from initial data access (from remote sequencing center or database, by the Globus Online file transfer system) to on-demand resource acquisition (on Amazon EC2, via the Globus Provision cloud manager); specification, configuration, and reuse of multi-step processing pipelines (via the Galaxy workflow system); and efficient scheduling of these pipelines over many processors (via the Condor scheduler). The system allows biomedical researchers to perform rapid analysis of large NGS datasets using just a web browser in a fully automated manner, without software installation.


international conference on cloud computing | 2015

Cost-Aware Elastic Cloud Provisioning for Scientific Workloads

Ryan Chard; Kyle Chard; Kris Bubendorfer; Lukasz Lacinski; Ravi K. Madduri; Ian T. Foster

Cloud computing provides an efficient model to host and scale scientific applications. While cloud-based approaches can reduce costs as users pay only for the resources used, it is often challenging to scale execution both efficiently and cost-effectively. We describe here a cost-aware elastic cloud provisioner designed to elastically provision cloud infrastructure to execute analyses cost-effectively. The provisioner considers real-time spot instance prices across availability zones, leverages application profiles to optimize instance type selection, over-provisions resources to alleviate bottlenecks caused by oversubscribed instance types, and is capable of reverting to on-demand instances when spot prices exceed thresholds. We evaluate the usage of our cost-aware provisioner using four production scientific gateways and show that it can produce cost savings of up to 97.2% when compared to naive provisioning approaches.


Journal of Physics: Conference Series | 2012

End-To-End Solution for Integrated Workload and Data Management using GlideinWMS and Globus Online

Parag Mhashilkar; Zachary Miller; Rajkumar Kettimuthu; G. Garzoglio; Burt Holzman; Cathrin Weiss; Xi Duan; Lukasz Lacinski

Grid computing has enabled scientific communities to effectively share computing resources distributed over many independent sites. Several such communities, or Virtual Organizations (VO), in the Open Science Grid and the European Grid Infrastructure use the GlideinWMS system to run complex application work-flows. GlideinWMS is a pilot-based workload management system (WMS) that creates an on-demand, dynamically-sized overlay Condor batch system on Grid resources. While the WMS addresses the management of compute resources, however, data management in the Grid is still the responsibility of the VO. In general, large VOs have resources to develop complex custom solutions, while small VOs would rather push this responsibility to the infrastructure. The latter requires a tight integration of the WMS and the data management layers, an approach still not common in modern Grids. In this paper we describe a solution developed to address this shortcoming in the context of Center for Enabling Distributed Peta-scale Science (CEDPS) by integrating GlideinWMS with Globus Online (GO). Globus Online is a fast, reliable file transfer service that makes it easy for any user to move data. The solution eliminates the need for the users to provide custom data transfer solutions in the application by making this functionality part of the GlideinWMS infrastructure. To achieve this, GlideinWMS uses the file transfer plug-in architecture of Condor. The paper describes the system architecture and how this solution can be extended to support data transfer services other than Globus Online when used with Condor or GlideinWMS.


workflows in support of large scale science | 2013

Distributed tools deployment and management for multiple galaxy instances in globus genomics

Dinanath Sulakhe; Alex Rodriguez; Nick Prozorovsky; Nilesh Kavthekar; Ravi K. Madduri; Amol Parikh; Paul Dave; Lukasz Lacinski; Ian T. Foster

Workflow systems play an important role in the analysis of the fast-growing genomics data produced by low-cost next generation sequencing (NGS) technologies. Many biomedical research groups lack the expertise to assemble and run the sophisticated computational pipelines required for high-throughput analysis of such data. There is an urgent need for services that can allow researchers to run their analytical workflows where they can define their own research methodologies by selecting the tools of their interest. We present the challenges associated with managing multiple Galaxy instances on the cloud for various research groups using Globus Genomics, a cloud based platform-as-a-service (PaaS) that provides the Galaxy workflow system as a hosted service along with data management capabilities using Globus Online. We address the unique challenges, our strategy, and a tool for automatically deploying and managing hundreds of analytical tools coming from the public Galaxy Tool Shed, new tools wrapped by our group, and tools wrapped by end users across multiple Galaxy instances hosted with Globus Genomics.


Archive | 2017

Bayesian Causalities, Mappings, and Phylogenies: A Social Science Gateway for Modeling Ethnographic, Archaeological, Historical Ecological, and Biological Variables

Douglas R. White; Paul Rodriguez; Eric Blau; Stuart Martin; Lukasz Lacinski; Thomas D. Uram; Feng Ren; Wesley Roberts; Tolga Oztan

Extending the innovative “Def Wy” procedures for modeling evolutionary network effects (Dow, Cross-Cult Res 41:336–363, 2007; Dow and Eff, Cross-Cult Res 43:134–151, 2009; Dow and Eff, Cross-Cult Res 43:206–229, 2009), a Complex Social Science http://intersci.ss.uci.edu (CoSSci) Gateway was developed to provide complex analyses of ethnographic, archaeological, historical, ecological, and biological datasets with easy open access. Analysis begins with dependent variable y with n observations and X independent and other variables, and imputes missing data for all variates. Several (n × n) W* matrices measure evolutionary network effects such as diffusion or phylogenetic ancestries. W* is row-normalized to sum to 1 and combined to obtain a W, multiplied by X as WX, and allowing X and y multiplication by W:

Collaboration


Dive into the Lukasz Lacinski's collaboration.

Top Co-Authors

Avatar

Ian T. Foster

Argonne National Laboratory

View shared research outputs
Top Co-Authors

Avatar

Ravi K. Madduri

Argonne National Laboratory

View shared research outputs
Top Co-Authors

Avatar

Dinanath Sulakhe

Argonne National Laboratory

View shared research outputs
Top Co-Authors

Avatar

Kyle Chard

Argonne National Laboratory

View shared research outputs
Top Co-Authors

Avatar

Alex Rodriguez

Argonne National Laboratory

View shared research outputs
Top Co-Authors

Avatar

Utpal J. Dave

Argonne National Laboratory

View shared research outputs
Top Co-Authors

Avatar

Bo Liu

University of Chicago

View shared research outputs
Top Co-Authors

Avatar
Top Co-Authors

Avatar

Ryan Chard

Victoria University of Wellington

View shared research outputs
Top Co-Authors

Avatar
Researchain Logo
Decentralizing Knowledge