Jared Jackson | Researchain

Archive Network Publication Hotspot Collaboration

Network

Latest external collaboration on country level. Dive into details by clicking on the dots.

Explore More

Hotspot

Dive into the research topics where Jared Jackson is active.

Explore More

Publication

Featured researches published by Jared Jackson.

high performance distributed computing | 2010

AzureBlast: a case study of developing science applications on the cloud

Wei Lu; Jared Jackson; Roger S. Barga

Cloud computing has emerged as a new approach to large scale computing and is attracting a lot of attention from the scientific and research computing communities. Despite its growing popularity, it is still unclear just how well the cloud model of computation will serve scientific applications. In this paper we analyze the applicability of cloud to the sciences by investigating an implementation of a well known and computationally intensive algorithm called BLAST. BLAST is a very popular life sciences algorithm used commonly in bioinformatics research. The BLAST algorithm makes an excellent case study because it is both crucial to many life science applications and its characteristics are representative of many applications important to data intensive scientific research. In our paper we introduce a methodology that we use to study the applicability of cloud platforms to scientific computing and analyze the results from our study. In particular we examine the best practices of handling the large scale parallelism and large volumes of data. While we carry out our performance evaluation on Microsofts Windows Azure the results readily generalize to other cloud platforms.

ieee international conference on escience | 2008

The Trident Scientific Workflow Workbench

Roger S. Barga; Jared Jackson; Nelson Araujo; Dean Guo; Nitin Gautam; Yogesh Simmhan

In our demonstration we present Trident, a scientific workflow workbench built on top of a commercial workflow system to leverage existing functionality to the extent possible. Trident is being developed in collaboration with the scientific computing community for use in a number of ongoing eScience projects that make use of scientific workflows, in particular the Pan-STARRS sky survey project and the Ocean Observatory Initiative. In our demonstration of Trident we will illustrate the ability to utilize both local and cloud resources for storage and execution, as well as services such as provenance, monitoring, logging and scheduling workflows over clusters. Our goal is to release Trident in early 2009 as an open source accelerator for others to use for eScience projects and to continue extending with support for new workflow features and services.

ieee international conference on cloud computing technology and science | 2010

Performing Large Science Experiments on Azure: Pitfalls and Solutions

Wei Lu; Jared Jackson; Jaliya Ekanayake; Roger S. Barga; Nelson Araujo

Carrying out science at extreme scale is the next generational challenge facing the broad field of scientific research. Cloud computing offers to potential for an increasing number of researchers to have ready access to the large scale compute resources required to tackle new challenges in their field. Unfortunately barriers of complexity remain for researchers untrained in cloud programming. In this paper we examine how cloud based architectures can be used to solve large scale research experiments in a manner that is easily accessible for researchers with limited programming experience, using their existing computational tools. We examine the top challenges identified in our own large-scale science experiments running on the Windows Azure platform and then describe a Cloud-based parameter sweep prototype (dubbed Cirrus) which provides a framework of solutions for each challenge.

ieee congress on services | 2008

Trident: Scientific Workflow Workbench for Oceanography

Roger S. Barga; Jared Jackson; Nelson Araujo; Dean Guo; Nitin Gautam; Keith Grochow; Edward D. Lazowska

We introduce Trident, a scientific workflow workbench that is built on top of a commercial workflow system to leverage existing functionality. Trident is being developed in collaboration with the scientific community for oceanography, but the workbench itself can be used for any science project for scientific workflow.

ieee international symposium on parallel & distributed processing, workshops and phd forum | 2011

CloudClustering: Toward an Iterative Data Processing Pattern on the Cloud

Ankur Dave; Wei Lu; Jared Jackson; Roger S. Barga

As the emergence of cloud computing brings the potential for large-scale data analysis to a broader community, architectural patterns for data analysis on the cloud, especially those addressing iterative algorithms, are increasingly useful. MapReduce suffers performance limitations for this purpose as it is not inherently designed for iterative algorithms. In this paper we describe our implementation of Cloud Clustering, a distributed k-means clustering algorithm on Microsofts Windows Azure cloud. The k-means algorithm makes a good case study because its characteristics are representative of many iterative data analysis algorithms. Cloud Clustering adopts a novel architecture to improve performance without sacrificing fault tolerance. To achieve this goal, we introduce a distributed fault tolerance mechanism called the buddy system, and we make use of data affinity and check pointing. Our goal is to generalize this architecture into a pattern for large-scale iterative data analysis on the cloud.

international conference on cloud computing | 2011

A Scalable Communication Runtime for Clouds

Jaliya Ekanayake; Jared Jackson; Wei Lu; Roger S. Barga; Atilla Soner Balkir

Leveraging cloud computing to acquire the necessary computation resources to scale out parallel applications is becoming common practice. However, many such applications also require communication and synchronization between processes. Although, commercial cloud platforms provide ready access to scalable compute and storage services, implementing communication and synchronization between cooperating processes and efficiently exchanging arbitrary size messages remains a challenge for application developers. In clouds, durable queues provide basic abstractions for communication. However, they are not sufficient for applications that require transferring arbitrary size messages or for applications that require higher level abstractions such as broadcast. Furthermore, direct socket based communication is susceptible to various fluctuations common in data center environments. We envision a solution to this problem that leverages scalable storage services, queues, and direct socket based communication. Publish/subscribe (pub/sub) is a well-known communication pattern that can achieve the above capabilities in a loosely coupled fashion, which is highly desirable in cloud environments where most services are asynchronous. In this paper, we describe the architecture of a pub/sub library implemented on a commercial cloud computing platform, which can be used to develop various parallel applications. We also present an evaluation of our implementation using both micro benchmarks and a real world application. Together, these demonstrate that our approach is both effective and scalable in performing communication and synchronization in cloud scale applications.

ieee international conference on escience | 2008

Capturing Workflow Event Data for Monitoring, Performance Analysis, and Management of Scientific Workflows

Matthew D. Valerio; Satya S. Sahoo; Roger S. Barga; Jared Jackson

To effectively support real-time monitoring and performance analysis of scientific workflow execution, varying levels of event data must be captured and made available to interested parties. This paper discusses the creation of an ontology-aware workflow monitoring system for use in the Trident system which utilizes a distributed publish/subscribe event model. The implementation of the publish/subscribe system is discussed and performance results are presented.

interactive tabletops and surfaces | 2010

e-science on the surface

Tom Bartindale; Jared Jackson; Patrick Olivier

Often large amounts of computing power and storage resources are needed to facilitate e-science experiments, and much research has gone into providing software and hardware for these high-end needs. Once datasets are produced by these systems, few tools exist to comprehensively help researchers analyze, share and publish their findings and conclusions. We demonstrate a tool developed to allow the annotation, visualizing and sharing of large related data sets. Based around tangible objects, this system demonstrates a number of novel interactions for scientific research.

Archive | 2009