Network


Latest external collaboration on country level. Dive into details by clicking on the dots.

Hotspot


Dive into the research topics where Nils Nieuwejaar is active.

Publication


Featured researches published by Nils Nieuwejaar.


IEEE Transactions on Parallel and Distributed Systems | 1996

File-access characteristics of parallel scientific workloads

Nils Nieuwejaar; David Kotz; Apratim Purakayastha; C. Sclatter Ellis; Michael L. Best

Phenomenal improvements in the computational performance of multiprocessors have not been matched by comparable gains in I/O system performance. This imbalance has resulted in I/O becoming a significant bottleneck for many scientific applications. One key to overcoming this bottleneck is improving the performance of multiprocessor file systems. The design of a high-performance multiprocessor file system requires a comprehensive understanding of the expected workload. Unfortunately, until recently, no general workload studies of multiprocessor file systems have been conducted. The goal of the CHARISMA project was to remedy this problem by characterizing the behavior of several production workloads, on different machines, at the level of individual reads and writes. The first set of results from the CHARISMA project describe the workloads observed on an Intel iPSC/860 and a Thinking Machines CM-5. This paper is intended to compare and contrast these two workloads for an understanding of their essential similarities and differences, isolating common trends and platform-dependent variances. Using this comparison, we are able to gain more insight into the general principles that should guide multiprocessor file-system design.


conference on high performance computing (supercomputing) | 1994

Dynamic file-access characteristics of a production parallel scientific workload

David Kotz; Nils Nieuwejaar

Multiprocessors have permitted astounding increases in computational performance, but many cannot meet the intense I/O requirements of some scientific applications. An important component of any solution to this I/O bottleneck is a parallel file system that can provide high-bandwidth access to tremendous amounts of data in parallel to hundreds or thousands of processors. Most successful systems are based on a solid understanding of the expected workload, but thus far there have been no comprehensive workload characterizations of multiprocessor file systems. This paper presents the results of a three week tracing study in which all file-related activity on a massively parallel computer was recorded. Our instrumentation differs from previous efforts in that it collects information about every I/O request and about the mix of jobs running an a production environment. We also present the results of a trace-driven caching simulation and recommendations for designers of multiprocessor file systems.<<ETX>>


international parallel processing symposium | 1995

Characterizing parallel file-access patterns on a large-scale multiprocessor

Apratim Purakayastha; Carla Schlatter Ellis; David Kotz; Nils Nieuwejaar; Michael L. Best

High-performance parallel file systems are needed to satisfy tremendous I/O requirements of parallel scientific applications. The design of such high-performance parallel file systems depends on a comprehensive understanding of the expected workload, but so far there have been very few usage studies of multiprocessor file systems. This paper is part of the CHARISMA project, which intends to fill this void by measuring real file-system workloads on various production parallel machines. In particular, here we present results from the CM-5 at the National Center for Supercomputing Applications. Our results are unique because we collect information about nearly every individual I/O request from the mix of jobs running on the machine. Analysis of the traces leads to various recommendations for parallel file-system design. >


parallel computing | 1997

The Galley parallel file system

Nils Nieuwejaar; David Kotz

Abstract Most current multiprocessor file systems are designed to use multiple disks in parallel, using the high aggregate bandwidth to meet the growing I/O requirements of parallel scientific applications. Many multiprocessor file systems provide applications with a conventional Unix-like interface, allowing the application to access multiple disks transparently. This interface conceals the parallelism within the file system, increasing the ease of programmability, but making it difficult or impossible for sophisticated programmers and libraries to use knowledge about their I/O needs to exploit that parallelism. In addition to providing an insufficient interface, most current multiprocessor file systems are optimized for a different workload than they are being asked to support. We introduce Galley, a new parallel file system that is intended to efficiently support realistic scientific multiprocessor workloads. We discuss Galleys file structure and application interface, as well as the performance advantages offered by that interface.


IOPADS | 1995

Low-level Interfaces for High-level Parallel I/O

Nils Nieuwejaar; David Kotz

As the I/O needs of parallel scientific applications increase, file systems for multiprocessors are being designed to provide applications with parallel access to multiple disks. Many parallel file systems present applications with a conventional Unix-like interface that allows the application to access multiple disks transparently. By tracing all the activity of a parallel file system in a production, scientific computing environment, we show that many applications exhibit highly regular, but non-consecutive I/O access patterns. Since the conventional interface does not provide an efficient method of describing these patterns, we present three extensions to the interface that support strided, nested-strided, and nested-batched I/O requests. We show how these extensions can be used to express common access patterns.


IEEE Parallel & Distributed Technology: Systems & Applications | 1995

File-system workload on a scientific multiprocessor

David Kotz; Nils Nieuwejaar

The Charisma project records individual read and write requests in live, multiprogramming parallel workloads. This information can be used to design more efficient multiprocessor systems. We present the first results from the project: a characterization of the tile-system workload on an iPSC/860 multiprocessor running production, parallel scientific applications at NASA Ames Research Center. We use the resulting information to address the following questions: What did the job mix look like (that is, how many jobs ran concurrently?) How many files were read and written? Which were temporary files? What were their sizes? What were typical read and write request sizes, and how were they spaced in the file? Were the accesses sequential? What forms of locality were there? How might caching be useful? What are the implications for file-system design?. >


Operating Systems Review | 1996

Flexibility and performance of parallel file systems

David Kotz; Nils Nieuwejaar

As we gain experience with parallel file systems, it becomes increasingly clear that a single solution does not suit all applications. For example, it appears to be impossible to find a single appropriate interface, caching policy, file structure, or disk-management strategy. Furthermore, the proliferation of file-system interfaces and abstractions make applications difficult to port.


workshop on i/o in parallel and distributed systems | 1996

Performance of the gallery parallel file system

Nils Nieuwejaar; David Kotz

As the 1/0 needs of parallel scientific applications increase, file systems for multiprocessors are being designed to provide applications with parallel access to multiple disks. Many parallel file systems present applications with a conventional Unix-like interface that allows the application to access multiple disks transparently. This interface conceals the parallelism within the file system, which increases the ease of programmability, but makes it difficult or impossible for sophisticated programmers and libraries to use knowledge about their 1/0 needs to exploit that parallelism. Furthermore, most current parallel file systems are optimized for a different workload than they are being asked to support. We introduce Galley, a new parallel file system that is intended to efficiently support realistic parallel workloads. Initial experiments, reported in this paper, indicate that Galley is capable of providing high-performance 1/0 to applications that access data in patterns that have been observed to be common.


Proceedings of the Third International ACPC Conference with Special Emphasis on Parallel Databases and Parallel I/O: Parallel Computation | 1996

Flexibility and Performance of Parallel File Systems

David Kotz; Nils Nieuwejaar

As we gain experience with parallel file systems, it becomes increasingly clear that a single solution does not suit all applications. For example, it appears to be impossible to find a single appropriate interface, caching policy, file structure, or disk-management strategy. Furthermore, the proliferation of file-system interfaces and abstractions make applications difficult to port.


international conference on supercomputing | 1996

The galley parallel file system

Nils Nieuwejaar; David Kotz

Collaboration


Dive into the Nils Nieuwejaar's collaboration.

Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar

Michael L. Best

Georgia Institute of Technology

View shared research outputs
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Researchain Logo
Decentralizing Knowledge