Is this you? Create Your Porfile

Haran Boral

University of Wisconsin-Madison

Archive Network Publication Hotspot Collaboration

Network

Latest external collaboration on country level. Dive into details by clicking on the dots.

Explore More

Hotspot

Dive into the research topics where Haran Boral is active.

Explore More

Publication

Featured researches published by Haran Boral.

IEEE Transactions on Knowledge and Data Engineering | 1990

Prototyping Bubba, a highly parallel database system

Haran Boral; William Alexander; Larry Clay; George P. Copeland; Scott Danforth; Michael J. Franklin; Brian E. Hart; Marc G. Smith; Patrick Valduriez

Bubba is a highly parallel computer system for data-intensive applications. The basis of the Bubba design is a scalable shared-nothing architecture which can scale up to thousands of nodes. Data are declustered across the nodes (i.e. horizontally partitioned via hashing or range partitioning) and operations are executed at those nodes containing relevant data. In this way, parallelism can be exploited within individual transactions as well as among multiple concurrent transactions to improve throughput and response times for data-intensive applications. The current Bubba prototype runs on a commercial 40-node multicomputer and includes a parallelizing compiler, distributed transaction management, object management, and a customized version of Unix. The current prototype is described and the major design decisions that went into its construction are discussed. The lessons learned from this prototype and its predecessors are presented. >

measurement and modeling of computer systems | 1987

Multi-disk management algorithms

Miron Livny; Setrag Khoshafian; Haran Boral

We investigate two schemes for placing data on multiple disks. We show that declustering (spreading each file across several disks) is inherently better than clustering (placing each file on a single disk) due to a number of reasons including parallelism and uniform load on all disks.

international conference on data engineering | 1987

A query processing strategy for the decomposed storage model

Setrag Khoshafian; George P. Copeland; Thomas Jagodits; Haran Boral; Patrick Valduriez

Handling parallelism in database systems involves the specification of a storage model, a placement strategy, and a query processing strategy. An important goal is to determine the appropriate combination of these three strategies in order to obtain the best performance advantages. In this paper we present a novel and promising query processing strategy for a decomposed storage model. We discuss some of the qualitative advantages of the scheme. We also compare the performance of the proposed “pivot” strategy with conventional query processing for the n-ary storage model. The comparison is performed using the Wisconsin Benchmarks.

international conference on management of data | 1984

A methodology for database system performance evaluation

Haran Boral; David J. DeWitt

This paper presents a methodology for evaluating the performance of database management systems and database machines in a multiuser environment. Three main factors that affect transaction throughput in a multiuser environment are identified: multiprogramming level, degree of data sharing among simultaneously executing transactions, and transaction mix. We demonstrate that only four basic query types are needed to construct a benchmark that will evaluate the performance of a system under a wide variety of workloads. Finally, we present the results of applying our techniques to the Britton-Lee IDM 500 database machine

ACM Transactions on Database Systems | 1981

Processor allocation strategies for multiprocessor database machines

Haran Boral; David J. DeWitt

In this paper four alternative strategies for assigning processors to queries in multiprocessor database machines are described and evaluated. The results demonstrate that SIMD database machines are indeed a poor design when their performance is compared with that of the three MIMD strategies presented. Also introduced is the application of data-flow machine techniques to the processing of relational algebra queries. A strategy that employs data-flow techniques is shown to be superior to the other strategies described by several experiments. Furthermore, if the data-flow query processing strategy is employed, the results indicate that a two-level storage hierarchy (in which relations are paged between a shared data cache and mass storage) does not have a significant impact on performance.

high performance transaction systems workshop | 1987

A Single-User Performance Evaluation of the Teradata Database Machine

David J. DeWitt; Marc G. Smith; Haran Boral

In this report we present the results of an initial performance evaluation of the Teradata DBC/1012 parallel database machine, based on an expanded version of the single-user Wisconsin benchmark. In our experiments we measured the effect of relation size, memory size, and indices on response time for selection, join, and aggregation queries, single-tuple updates, and relation sorts. We analyze and interpret the experiment results based on our understanding of the system hardware and software, and conclude with a subjective assessment of the machine.

international conference on management of data | 1980

Design considerations for data-flow database machines

Haran Boral; David J. DeWitt

This paper presents a discussion of the application of data-flow machine concepts to the design and implementation of database machines which execute relational algebra queries. We analyze the performance of multiprocessor nested-loops and sort-merge join algorithms and show that the nested-loops algorithm is generally superior. Three levels of operand granularity for data-flow database machines are introduced and compared using the nested-loops join algorithm. We demonstrate, that relation-level granularity is too coarse and that tuple-level granularity is too fine. The third level of granularity, a page of a relation, is shown to be the best choice from both hardware and software viewpoints. Finally, a preliminary design for a data-flow database machine which utilizes page-level granularity and supports distributed control of instruction execution is presented.

international conference on management of data | 1984

Towards a self-adapting centralized concurrency control algorithm

Haran Boral; Israel Gold

We introduce the notion of self-adapting concurrency control algorithms --- concurrency control algorithms that consist of several rw and several ww synchronization techniques, and employ combinations of the techniques in a manner that attains a performance objective. We Consider synchronization techniques that use locking and certification. A general proof method for such algorithms is outlined and applied.

international conference on management of data | 1982

A framework for research in database management for statistical analysis or a primer on statistical database management problems for computer scientists

Doug Bates; Haran Boral; David J. DeWitt

This paper is intended to introduce those familiar with database management issues to the problems of managing large statistical databases. We begin with a characterization of statistical databases based on the structure and use of the data in the database. Several data management problems are then described. In particular, we discuss the problem of repetitive computations on large segments of the database during the lifetime of a statistical analysis. The organization of a data management system which avoids this problem by caching previously computed results and automatically maintaining their integrity is presented. We conclude with a list of problems that this organization raises and a discussion of related work.

IWDM '89 Proceedings of the Sixth International Workshop on Database Machines | 1989

An Experiment on Response Time Scalability in Bubba

Marc G. Smith; Bill Alexander; Haran Boral; George P. Copeland; Tom Keller; Herb Schwetman; Chii-Ren Young

We describe results from an experiment that investigates the scalability of response time performance in shared-nothing systems, such as the Bubba parallel database machine. In particular, we show how—and how much—potential response time improvements for certain transaction types can be impaired in shared-nothing architectures by the increased cost of transaction startup, communication, and synchronization as the degree of execution parallelism is increased. We further show how these effects change under increased levels of concurrency and heterogeneity in the transaction workload. From the results, we conclude that although parallelism must be limited in some circumstances, in general the benefits of increased parallelism in shared-nothing systems exceed the costs.

Explore More