Peter L. Bird
University of Michigan
Network
Latest external collaboration on country level. Dive into details by clicking on the dots.
Publication
Featured researches published by Peter L. Bird.
international symposium on microarchitecture | 1997
Charles Lefurgy; Peter L. Bird; I-Cheng K. Chen; Trevor N. Mudge
Proposes a method for compressing programs in embedded processors where the instruction memory size dominates the cost. A post-compilation analyzer examines a program and replaces common sequences of instructions with a single instruction codeword. A microprocessor executes the compressed instruction sequences by fetching codewords from the instruction memory, expanding them back to the original sequence of instructions in the decode stage, and issuing them to the execution stages. We apply our technique to the PowerPC, ARM and i386 instruction sets and achieve an average size reduction of 39%, 34% and 26%, respectively, for SPEC CINT95 programs.
international conference on supercomputing | 1993
Peter L. Bird; Alasdair Rawsthorne; Nigel P. Topham
This paper examines the effectiveness of decoupling as an optimization technique for high-performance computer architectures. Decoupled access execute architectures are described, and the concept of control decoupling is introduced and justified. A description of a highly-decoupled architecture is given, and a metric for the effectiveness of decoupling on particular programs, the Loss of Decoupling frequency is introduced. Finally, a number of real benchmark programs are examined and the applicability of decoupling them is analyzed.
conference on high performance computing (supercomputing) | 1995
Nigel P. Topham; Alasdair Rawsthorne; Callum McLean; Muriel Mewissen; Peter L. Bird
Decoupled architectures provide a key to the problem of sustained supercomputer performance through their ability to hide large memory latencies. When a program executes in a decoupled mode the perceived memory latency at the processor is zero; effectively the entire physical memory has an access time equivalent to the processors register file, and latency is completely hidden. However, the asynchronous functional units within a decoupled architecture must occasionally synchronize, incurring a high penalty. The goal of compiling and optimizing for decoupled architectures is to partition the program between the asynchronous functional units in such a way that latencies are hidden but synchronization events are executed infrequently. This paper describes a model for decoupled compilation, and explains the effectiveness of compilation for decoupled systems. A number of new compiler optimizations are introduced and evaluated quantitatively using the Perfect Club scientific benchmarks. We show that with a suitable repertiore of optimizations, it is possible to hide large latencies most of the time for most of the programs in the Perfect Club.
joint international conference on vector and parallel processing parallel processing | 1992
Peter L. Bird; Nigel P. Topham; Sathiamoorthy Manoharan
High performance scientific computers require high bandwidth memory systems. The size of data sets for large scientific codes implies the need for a memory system which contains independent, interleaved banks. We compare the performance of two different control structures for an interleaved memory: bank buffering and address stream lookahead. We show that a centralized lookahead memory controller provides equivalent memory performance to a bank buffering system while simplifying implementation.
compiler construction | 1982
Peter L. Bird
This paper discusses an implementation of Glanvilles code generator generator for producing a code generator for a production Pascal compiler on an Amdahl 470. We successfully replaced the hand written code generator of an existing compiler with one which was produced automatically from a formal specification. This paper first outlines Glanvilles original scheme, then describes extensions which were necessary for generating code for a production compiler.
conference on high performance computing (supercomputing) | 1991
Peter L. Bird; Richard Uhlig
No abstract available
conference on high performance computing (supercomputing) | 1991
Peter L. Bird; Uwe F. Pleban
No abstract available
Computer Physics Communications | 1985
Peter L. Bird
Abstract The topological sort can be used for the rapid detection of parallelism in sequential programs. Using this algorithm, one can detect both intrablock and interblock parallelism. The algorithm requires only information normally collected by an optimizing compiler.
Sensors and Actuators A-physical | 2001
K.L. Kraver; Matthew R. Guthaus; Timothy D. Strong; Peter L. Bird; Geun Sig Cha; Wolfgang Höld; Richard B. Brown
Archive | 1997
I-Cheng K. Chen; Peter L. Bird; Trevor N. Mudge