Sivaramakrishnan Narayanan

Archive Network Publication Hotspot Collaboration

Network

Latest external collaboration on country level. Dive into details by clicking on the dots.

Explore More

Hotspot

Dive into the research topics where Sivaramakrishnan Narayanan is active.

Explore More

Publication

Featured researches published by Sivaramakrishnan Narayanan.

high performance distributed computing | 2004

An approach for automatic data virtualization

Li Weng; Gagan Agrawal; T. Kur; Sivaramakrishnan Narayanan; Joel H. Saltz

Analysis of large and/or geographically distributed scientific datasets is emerging as a key component of grid computing. One challenge in this area is that scientific datasets are typically stored as binary or character flat-files, which makes specification of processing much harder. In view of this, there has been recent interest in data virtualization, and data services to support such virtualization. This paper presents an approach for automatically creating data services to support data virtualization. Specifically, we show how a relational table like data abstraction can be supported for complex multidimensional scientific datasets that are resident on a cluster. We have designed and implemented a tool that processes SQL queries (with select and where statements) on multi-dimensional datasets. We have designed a meta-data description language that is used for specifying the data layout. From such description, our tool automatically generates efficient data subsetting and access functions. We have extensively evaluated our system. The key observations from our experiments are as follows. First, our tool can correctly and efficiently handle a variety of different data layouts. Second, our system scales well as the number of nodes or the amount of data is scaled. Third, the performance of the automatically generated code for indexing and contracting functions is quite comparable to the performance of hand-written codes.

Parallel Processing Letters | 2003

DATABASE SUPPORT FOR DATA-DRIVEN SCIENTIFIC APPLICATIONS IN THE GRID

Sivaramakrishnan Narayanan; Tahsin M. Kurç; Joel H. Saltz

In this paper we describe a services oriented software system to provide basic database support for efficient execution of applications that make use of scientific datasets in the Grid. This system supports two core operations: efficient selection of the data of interest from distributed databases and efficient transfer of data from storage nodes to compute nodes for processing. We present its overall architecture and main components and describe preliminary experimental results.

latin american web congress | 2003

Applying database support for large scale data driven science in distributed environments

Sivaramakrishnan Narayanan; Tahsin M. Kurç; Xi Zhang; Joel H. Saltz

There is a rapidly growing set of applications, referred to as data driven applications, in which analysis of large amounts of data drives the next steps taken by the scientist, e.g., running new simulations, doing additional measurements, extending the analysis to larger data collections. Critical steps in data analysis are to extract the data of interest from large and potentially distributed datasets and to move it from storage clusters to compute clusters for processing. We have developed a middleware framework, called GridDB-Lite, that is designed to efficiently support these two steps. We describe the application of GridDB-Lite in large scale oil reservoir simulation studies and experimentally evaluate several optimizations that can be employed in the GridDB-Lite runtime system.

IEEE Computer | 2008

Analysis and Semantic Querying in Large Biomedical Image Datasets

Vijay Kumar; Sivaramakrishnan Narayanan; Tahsin M. Kurç; Jun Kong; Metin N. Gurcan; Joel H. Saltz

Biomedical image analysis plays an important role in diagnosing, prognosing, and treating complex diseases. The authors describe a set of techniques for analyzing, processing, and querying large image datasets using semantic and spatial information.

international conference on computational science | 2003

Driving scientific applications by data in distributed environments

Joel H. Saltz; Tahsin M. Kurç; Mike Gray; Shannon Hastings; Stephen Langella; Sivaramakrishnan Narayanan; Ryan Martino; Steven L. Bryant; Malgorzata Peszynska; Mary F. Wheeler; Alan Sussman; Michael D. Beynon; Christian Hansen; Don Stredney; Sessanna D

Traditional simulation-based applications for exploring a parameter space to understand a physical phenomenon or to optimize a design are rapidly overwhelmed by data volume when large numbers of simulations of different parameters are carried out. Optimizing reservoir management through simulation-based studies, in which large numbers of realizations are sought using detailed geologic descriptions, is an example of such applications. In this paper, we describe a software architecture to facilitate large scale simulation studies, involving ensembles of long-running simulations and analysis of vast volumes of output data. This architecture is built on top of two frameworks we have developed: IPARS and DataCutter. These frameworks make it possible to implement tools and applications to run large-scale simulatios, and generate and investigate terabyte-scale datasets efficiently.

ieee international conference on high performance computing data and analytics | 2009

HPC and Grid Computing for Integrative Biomedical Research

Tahsin M. Kurç; Shannon Hastings; Vijay Kumar; Stephen Langella; Ashish Sharma; Tony Pan; Scott Oster; David Ervin; Justin Permar; Sivaramakrishnan Narayanan; Yolanda Gil; Ewa Deelman; Mary W. Hall; Joel H. Saltz

Integrative biomedical research projects query, analyze, and integrate many different data types and make use of datasets obtained from measurements or simulations of structure and function at multiple biological scales. With the increasing availability of high-throughput and high-resolution instruments, the integrative biomedical research imposes many challenging requirements on software middleware systems. In this paper, we look at some of these requirements using example research pattern templates. We then discuss how middleware systems, which incorporate Grid and high-performance computing, could be employed to address the requirements.

high performance distributed computing | 2004

Strategies for using additional resources in parallel hash-based join algorithms

Xi Zhang; Tahsin M. Kurç; Tony Pan; Sivaramakrishnan Narayanan; Pete Wyckoff; Joel H. Saltz

Hash-based join is a compute- and memory-intensive algorithm. It achieves good performance and scales well to large datasets, if sufficient memory is available to hold the hash table and the distribution of computing had across nodes is balanced. We compare three adaptive algorithms that start with a partitioning of the hash table across a group of nodes and expand during the hash table building phase to additional resources, when memory on a node is used up. The split-based algorithm partitions the hash table range assigned to the node, on which memory is full, into two segments and assigns one of the segments to a new node in the system. The replication-based algorithm replicates the hash table range on a new node. The hybrid algorithm combines the first and second strategies in order to address each strategys short comings. We perform an experimental performance evaluation of these algorithms on a PC cluster. Our results show that among the three algorithms, in most cases the hybrid algorithm either performs close to the better of the two or is the best algorithm.

acm symposium on applied computing | 2009

Parallel materialization of large ABoxes

Sivaramakrishnan Narayanan; Tahsin M. Kurç; Joel H. Saltz

This paper is concerned with the efficient computation of materialization in a knowledge base with a large ABox. We present a framework for performing this task on a shared-nothing parallel machine. The framework partitions TBox and ABox axioms using a min-min strategy. It utilizes an existing system, like SwiftOWLIM, to perform local inference computations and coordinates exchange of relevant information between processors. Our approach is able to exploit parallelism in the axioms of the TBox to achieve speedup in a cluster. However, this approach is limited by the complexity of the TBox. We present an experimental evaluation of the framework using datasets from the Lehigh University Benchmark (LUBM).

very large data bases | 2005

Servicing seismic and oil reservoir simulation data through grid data services

Sivaramakrishnan Narayanan; Tahsin M. Kurç; Joel H. Saltz

This paper presents the implementation of a two layer infrastructure for servicing queries against large datasets generated in oil reservoir simulation studies in the Grid. The first layer implements object-relational virtualization of file-based dataset stored on a storage cluster. The second layer provides an implementation of Grid Data Services via Open Grid Services Architecture Data Access and Integration (OGSA-DAI) middleware.

Archive | 2007