Mario Schkolnick
IBM
Network
Latest external collaboration on country level. Dive into details by clicking on the dots.
Publication
Featured researches published by Mario Schkolnick.
Acta Informatica | 1994
Rudolf Bayer; Mario Schkolnick
SummaryConcurrent operations on B-trees pose the problem of insuring that each operation can be carried out without interfering with other operations being performed simultaneously by other users. This problem can become critical if these structures are being used to support access paths, like indexes, to data base systems. In this case, serializing access to one of these indexes can create an unacceptable bottleneck for the entire system. Thus, there is a need for locking protocols that can assure integrity for each access while at the same time providing a maximum possible degree of concurrency. Another feature required from these protocols is that they be deadlock free, since the cost to resolve a deadlock may be high.Recently, there has been some questioning on whether B-tree structures can support concurrent operations. In this paper, we examine the problem of concurrent access to B-trees. We present a deadlock free solution which can be tuned to specific requirements. An analysis is presented which allows the selection of parameters so as to satisfy these requirements.The solution presented here uses simple locking protocols. Thus, we conclude that B-trees can be used advantageously in a multi-user environment.
Journal of the ACM | 1978
Jon Louis Bentley; H. T. Kung; Mario Schkolnick; Clark D. Thompson
A maximal vector of a set ~s one which is not less than any other vector m all components We derive a recurrence relation for computing the average number of maxunal vectors m a set of n vectors m d-space under the assumpUon that all (nl) a relative ordermgs are equally probable. Solving the recurrence shows that the average number of maxmaa is O((ln n) a-~) for fixed d We use this result to construct an algorithm for finding all the maxima that have expected running tmae hnear m n (for sets of vectors drawn under our assumptions) We then use the result to find an upper bound on the expected number of convex hull points m a random point set
Communications of The ACM | 1981
Donald D. Chamberlin; Morton M. Astrahan; Michael W. Blasgen; Jim Gray; W. Frank King; Bruce G. Lindsay; Raymond A. Lorie; James W. Mehl; Thomas G. Price; Franco Putzolu; Patricia G. Selinger; Mario Schkolnick; Donald R. Slutz; Irving L. Traiger; Bradford W. Wade; Robert A. Yost
System R, an experimental database system, was constructed to demonstrate that the usability advantages of the relational data model can be realized in a system with the complete function and high performance required for everyday production use. This paper describes the three principal phases of the System R project and discusses some of the lessons learned from System R about the design of relational systems and database systems in general.
ACM Transactions on Database Systems | 1988
Sheldon J. Finkelstein; Mario Schkolnick; Paolo Tiberio
This paper describes the concepts used in the implementation of DBDSGN, an experimental physical design tool for relational databases developed at the IBM San Jose Research Laboratory. Given a workload for System R (consisting of a set of SQL statements and their execution frequencies), DBDSGN suggests physical configurations for efficient performance. Each configuration consists of a set of indices and an ordering for each table. Workload statements are evaluated only for atomic configurations of indices, which have only one index per table. Costs for any configuration can be obtained from those of the atomic configurations. DBDSGN uses information supplied by the System R optimizer both to determine which columns might be worth indexing and to obtain estimates of the cost of executing statements in different configurations. The tool finds efficient solutions to the index-selection problem; if we assume the cost estimates supplied by the optimizer are the actual execution costs, it finds the optimal solution. Optionally, heuristics can be used to reduce execution time. The approach taken by DBDSGN in solving the index-selection problem for multiple-table statements significantly reduces the complexity of the problem. DBDSGNs principles were used in the Relational Design Tool (RDT), an IBM product based on DBDSGN, which performs design for SQL/DS, a relational system based on System R. System R actually uses DBDSGNs suggested solutions as the tool expects because cost estimates and other necessary information can be obtained from System R using a new SQL statement, the EXPLAIN statement. This illustrates how a system can export a model of its internal assumptions and behavior so that other systems (such as tools) can share this model.
ACM Transactions on Database Systems | 1986
Giovanni Maria Sacco; Mario Schkolnick
The hot-set model, characterizing the buffer requirements of relational queries, is presented. This model allows the system to determine the optimal buffer space to be allocated to a query; it can also be used by the query optimizer to derive efficient execution plans accounting for the available buffer space, and by a query scheduler to prevent thrashing. The hot-set model is compared with the working-set model. A simulation study is presented.
ACM Transactions on Database Systems | 1981
Donald D. Chamberlin; Morton M. Astrahan; W. F. King; Raymond A. Lorie; James W. Mehl; Thomas G. Price; Mario Schkolnick; P. Griffiths Selinger; Donald R. Slutz; B. W. Wade; Robert A. Yost
System R supports a high-level relational user language called SQL which may be used by ad hoc users at terminals or as an embedded data sublanguage in PL/I or COBOL. Host-language programs with embedded SQL statements are processed by the System R precompiler which replaces the SQL statements by calls to a machine-language access module. The precompilation approach removes much of the work of parsing, name binding, and access path selection from the path of a running program, enabling highly efficient support for repetitive transactions. Ad hoc queries are processed by a similar approach of name binding and access path selection which takes place on-line when the query is specified. By providing a flexible spectrum of binding times, System R permits transaction-oriented programs and ad hoc query users to share a database without loss of efficiency. System R is an experimental database management system designed and built by members of the IBM San Jose Research Laboratory as part of a research program on the relational model of data. This paper describes the architecture of System R, and gives some preliminary measurements of system performance in both the ad hoc query and the “canned program” environments.
Ibm Systems Journal | 1999
Michael W. Blasgen; Morton M. Astrahan; Donald D. Chamberlin; Jim Gray; W. F. King; Bruce G. Lindsay; Raymond A. Lorie; James W. Mehl; Thomas G. Price; Gianfranco R. Putzolu; Mario Schkolnick; P. G. Sellinger; Donald R. Slutz; H. R. Strong; Irving L. Traiger; Bradford W. Wade; Robert A. Yost
We have described the architecture of System R, including the Relational Data System and the Research Storage System. The RDS supports a flexible spectrum of binding times, ranging from precompilation of “canned transactions” to on-line execution of ad hoc queries. The advantages of this approach may be summarized as follows: 1. For repetitive transactions, all the work of parsing, name binding, and access path selection is done once at precompilation time and need not be repeated. 2. Ad hoc queries are compiled on line into small machine-language routines that execute more efficiently than an interpreter. 3. Users are given a single language, SQL, for use in ad hoc queries as well as in writing PL/I and COBOL transaction programs. 4. The SQL parser, access path selection routines, and machine language code generator are used in common between query processing and precompilation of transaction programs. 5. When an index used by a transaction program is dropped, a new access path is automatically selected for the transaction without user intervention.
ACM Transactions on Database Systems | 1985
Mario Schkolnick; Paolo Tiberio
In this paper, cost formulas are derived for the updates of data and indexes in a relational database. The costs depend on the data scan type and the predicates involved in the update statements. We show that update costs have a considerable influence, both in the context of the physical database design problem and in access path selection in query optimization for relational DBMSs.
Information Systems | 1987
Morton M. Astrahan; Mario Schkolnick; Kyu-Young Whang
Abstract Counts of unique values are frequently needed information in database systems. Especially, they are essential in query optimization and physical database design. Traditionally, exact counts were obtained by sorting, which is an expensive operation. In this paper we present three algorithms for counting unique values by probabilistic methods. These algorithms require only one pass over the data, and produce approximations to the true count with certain standard deviations. For deviations acceptable in practical environments (~10%), the algorithms require only modest amounts of memory space and computation time. We have implemented all three algorithms in System R. We also present the results of the experiments on accuracy and performance of these algorithms.
very large data bases | 1979
Vincent Y. Lum; Sakti P. Ghosh; Mario Schkolnick; Robert W. Taylor; D. Jefferson; Stanley Y. W. Su; James P. Fry; Toby J. Teorey; B. Yao; D. S. Rund; Beverly K. Kahn; Shamkant B. Navathe; Diane C. P. Smith; L. Aguilar; W. J. Barr; P. E. Jones
This is a summary of a bigger report based on the results arrived at the New Orleans data base design workshop. This paper outlines the four major areas of data base design. It discusses the important issues, some of the results which have been achieved and future research problems.