Robert C. Zak
Sun Microsystems
Network
Latest external collaboration on country level. Dive into details by clicking on the dots.
Publication
Featured researches published by Robert C. Zak.
Journal of Parallel and Distributed Computing | 1996
Charles E. Leiserson; Zahi S. Abuhamdeh; David C. Douglas; Carl R. Feynman; Mahesh N. Ganmukhi; Jeffrey V. Hill; W. Daniel Hillis; Bradley C. Kuszmaul; Margaret A. St. Pierre; David S. Wells; Monica C. Wong-Chan; Shaw-Wen Yang; Robert C. Zak
The Connection Machine Model CM-5 Supercomputer is a massively parallel computer system designed to offer performance in the range of 1 teraflops (1012floating-point operations per second). The CM-5 obtains its high performance while offering ease of programming, flexibility, and reliability. The machine contains three communication networks: a data network, a control network, and a diagnostic network. This paper describes the organization of these three networks and how they contribute to the design goals of the CM-5.
acm symposium on parallel algorithms and architectures | 1992
Charles E. Leiserson; Zahi S. Abuhamdeh; David C. Douglas; Carl R. Feynman; Mahesh N. Ganmukhi; Jeffrey V. Hill; W. Daniel Hillis; Bradley C. Kuszmaul; Margaret A. St. Pierre; David S. Wells; Monica C. Wong; Shaw-Wen Yang; Robert C. Zak
The Connection Machine Model CM-5 Supercomputer is a massively parallel computer system designed to offer performance in the range of 1 teraflops (1012 floating-point operations per second). The CM-5 obtains its high performance while offering ease of programming, flexibility, and reliability. The machine contains three communication networks: a data network, a control network, and a diagnostic network. This paper describes the organization of these three networks and how they contribute to the design goals of the CM-5.
conference on high performance computing (supercomputing) | 2002
Lisa Noordergraaf; Robert C. Zak
The system interconnect is often the performance bottleneck in SMP computers. Although modern SMPs include event counters on processors and interconnects, these provide limited information about the interaction of processors vying for shared resources. Additionally, transaction sources and addresses are not readily available, making analysis of access patterns and data locality difficult. Enhanced system interconnect instrumentation is required to extract this information. This paper describes instrumentation implemented for monitoring the system interconnect on Sun Fire™ servers. The instrumentation supports sophisticated programmable filtering of event counters, allowing us to construct histograms of system interconnect activity, and a FIFO to capture trace sequences. Our implementation results in a very small hardware footprint, making it appropriate for inclusion in commodity hardware. We also describe a sampling of software tools and results based on this infrastructure. Applications have included performance profiling, architectural studies, and hardware bringup and debugging.
international conference on computer design | 1992
Robert C. Zak; Jeffrey V. Hill
A testability architecture for VLSI devices which is IEEE 1149.1 compliant and includes extensions for partitionable internal scan chains is described. The architecture includes a fully synchronous scan cell library and an explicit synchronization barrier between test clock synchronous logic and system clock synchronous logic. The explicit synchronization barrier guarantees coherent sampling of device state. These techniques result in VLSI designs that are amenable to static timing analysis and are extensible to at-speed built-in-self-test (BIST). Results for five VLSI devices used in the CM5, a massively parallel supercomputer, show low logic overhead (8-20%) and high in-system stuck-at fault coverage (>99.5%).<<ETX>>
international conference on computer design | 1988
Stephen A. Ward; Robert C. Zak
Static-column dynamic RAMs (random-access memories) offer fast access to successive locations within a single row, a fact which has been used in their implementation in fast cacheless memory systems. Such caches are necessarily nonassociative (direct-mapped), limiting their performance relative to set-associative caches of similar total capacity. The authors describe an architecture for dynamic RAM chips which circumvents this limitation by providing several alternative static row buffers on each chip. Cacheless memory systems utilizing these devices are able to achieve the performance characteristics of relatively expensive set-associative cached memories using only economical high-density RAM parts. Simulation results on set-associate dynamic RAMs (SADRAMs) are presented and some plausible roles for SADRAMs in various architectural contexts are noted.<<ETX>>
Archive | 1995
Erik Hagersten; Robert C. Zak
Archive | 2001
Christopher J. Jackson; Robert C. Zak
Archive | 1996
Jon P. Wade; Daniel R. Cassiday; Robert D. Lordi; Guy L. Steele; Margaret A. St. Pierre; Monica C. Wong-Chan; Zahi S. Abuhamdeh; David C. Douglas; Mahesh N. Ganmukhi; Jeffrey V. Hill; W. Daniel Hillis; Scott J. Smith; Shaw-Wen Yang; Robert C. Zak
Archive | 1991
David C. Douglas; Mahesh N. Ganmukhi; Jeffrey V. Hill; W. Daniel Hillis; Bradley C. Kuszmaul; Charles E. Leiserson; David S. Wells; Monica C. Wong; Shaw-Wen Yang; Robert C. Zak
Archive | 1993
Robert C. Zak; Charles E. Leiserson; Bradley C. Kuzmaul; Shaw-Wen Yang; W. Daniel Hillis; David C. Douglas; David Potter