Kristan D. Davis
IBM
Network
Latest external collaboration on country level. Dive into details by clicking on the dots.
Publication
Featured researches published by Kristan D. Davis.
international test conference | 2014
Chen-Yong Cher; K. Paul Muller; Ruud A. Haring; David L. Satterfield; Thomas E. Musta; Thomas M. Gooding; Kristan D. Davis; Marc Boris Dombrowa; Gerard V. Kopcsay; Robert M. Senger; Yutaka Sugawara; Krishnan Sugavanam
Fault injection through accelerated irradiation is an effective way to evaluate the overall soft error resiliency of microprocessors. In this work, we report on irradiation experiments on a Blue Gene/Q (BG/Q) compute processor chip running selected applications. Blue Gene/Q is the third generation of IBMs massively parallel, energy efficient Blue Gene series of supercomputers. In the experiments, we found 69 code fails. Out of these, 26 code fails are relevant for the calculation of the mean-time-between-failures (MTBF) for a 20 PetaFLOP, 96 rack system running a comparable workload mix. The expected MTBF for check-stops due to cosmic radiation and alpha particles from chip packaging materials is calculated to be 51 days for sea-level at New York City running the application mix studied. If the most vulnerable application is run exclusively, the projected MTBF is 35 days. These are outstanding results for a machine of this magnitude. The beaming experiment and projected MTBF validate the necessity to include autonomous hardware detection and recovery at the cost of design effort, silicon area and power.
asia and south pacific design automation conference | 2014
Chen-Yong Cher; K. Paul Muller; Ruud A. Haring; David L. Satterfield; Thomas E. Musta; Thomas M. Gooding; Kristan D. Davis; Marc Boris Dombrowa; Gerard V. Kopcsay; Robert M. Senger; Yutaka Sugawara; Krishnan Sugavanam
Soft Error Resiliency (SER) is a major concern for Petascale high performance computing (HPC) systems. In designing Blue Gene/Q (BG/Q) [8], many mechanisms were deployed to target SER including extensive use of Silicon-On-Insulator (SOI), radiation-hardened latches [7,13], detection and correction in on-chip arrays, and very low radiation packaging materials. On the other hand, it is well known that application behavior has major impacts on the masking (or “derating” factor) in system SER calculations. The principal goal of this project is to understand the interaction between BG/Q hardware and high-performance applications when it comes to SER by performing and evaluating a chip irradiation experiment.
Archive | 2007
Kristan D. Davis; Thomas Leo Haze; Andrew T. Koch; Kyle A. Lucke; Martin John Thompson
Archive | 2013
Lynn A. Boger; James E. Carey; Kristan D. Davis; Philip J. Sanders
Archive | 2013
Lynn A. Boger; James E. Carey; Kristan D. Davis; Philip J. Sanders
Archive | 2013
Lynn A. Boger; James E. Carey; Kristan D. Davis; Philip J. Sanders
Archive | 2010
Kristan D. Davis; Kahn C. Evans; Alan Gara; David L. Satterfield
Archive | 2013
Kristan D. Davis; Daniel A. Faraj
Archive | 2013
Kristan D. Davis; Daniel A. Faraj
Archive | 2013
Kristan D. Davis; Daniel A. Faraj