Network


Latest external collaboration on country level. Dive into details by clicking on the dots.

Hotspot


Dive into the research topics where Ben J. Nathanson is active.

Publication


Featured researches published by Ben J. Nathanson.


Ibm Journal of Research and Development | 2005

Blue Gene/L advanced diagnostics environment

Mark E. Giampapa; Ralph Bellofatto; Matthias A. Blumrich; Dong Chen; Marc Boris Dombrowa; Alan Gara; Ruud A. Haring; Philip Heidelberger; Dirk Hoenicke; Gerard V. Kopcsay; Ben J. Nathanson; Burkhard Steinmacher-Burow; Martin Ohmacht; Valentina Salapura; Pavlos M. Vranas

This paper describes the Blue Gene®/L advanced diagnostics environment (ADE) used throughout all aspects of the Blue Gene/L project, including design, logic verification, bring-up, diagnostics, and manufacturing test. The Blue Gene/L ADE consists of a lightweight multithreaded coherence-managed kernel, runtime libraries, device drivers, system programming interfaces, compilers, and host-based development tools. It provides complete and flexible access to all features of the Blue Gene/L hardware. Prior to the existence of hardware, ADE was used on Very high-speed integrated circuit Hardware Description Language (VHDL) models, not only for logic verification, but also for performance measurements, code-path analysis, and evaluation of architectural tradeoffs. During early hardware bring-up, the ability to run in a cycle-reproducible manner on both hardware and VHDL proved invaluable in fault isolation and analysis. However, ADE is also capable of supporting high-performance applications and parallel test cases, thereby permitting us to stress the hardware to the limits of its capabilities. This paper also provides insights into system-level and device-level programming of Blue Gene/L to assist developers of high-performance applications o more fully exploit the performance of the machine.


Ibm Journal of Research and Development | 2005

Blue Gene/L compute chip: memory and Ethernet subsystem

Martin Ohmacht; Reinaldo A. Bergamaschi; Subhrajit Bhattacharya; Alan Gara; Mark E. Giampapa; Balaji Gopalsamy; Ruud A. Haring; Dirk Hoenicke; David John Krolak; James A. Marcella; Ben J. Nathanson; Valentina Salapura; Michael E. Wazlowski

The Blue Gene®/L compute chip is a dual-processor system-on-a-chip capable of delivering an arithmetic peak performance of 5.6 gigaflops. To match the memory speed to the high compute performance, the system implements an aggressive three-level on-chip cache hierarchy. The implemented hierarchy offers high bandwidth and integrated prefetching on cache hierarchy levels 2 and 3 (L2 and L3) to reduce memory access time. A Gigabit Ethernet interface driven by direct memory access (DMA) is integrated in the cache hierarchy, requiring only an external physical link layer chip to connect to the media. The integrated L3 cache stores a total of 4 MB of data, using multibank embedded dynamic random access memory (DRAM). The 1,024-bit-wide data port of the embedded DRAM provides 22.4 GB/s bandwidth to serve the speculative prefetching demands of the two processor cores and the Gigabit Ethernet DMA engine. To reduce hardware overhead due to cache coherence intervention requests, memory coherence is maintained by software. This is particularly efficient for regular highly parallel applications with partitionable working sets. The system further integrates an on-chip double-data-rate (DDR) DRAM controller for direct attachment of main memory modules to optimize overall memory performance and cost. For booting the system and low-latency interprocessor communication and synchronization, a 16-KB static random access memory (SRAM) and hardware locks have been added to the design.


Ibm Journal of Research and Development | 2005

Verification strategy for the Blue Gene/L chip

Michael E. Wazlowski; Narasimha R. Adiga; Daniel K. Beece; Ralph Bellofatto; Matthias A. Blumrich; Dong Chen; Marc Boris Dombrowa; Alan Gara; Mark E. Giampapa; Ruud A. Haring; Philip Heidelberger; Dirk Hoenicke; Ben J. Nathanson; Martin Ohmacht; R. Sharrar; Sarabjeet Singh; Burkhard Steinmacher-Burow; Robert B. Tremaine; Mickey Tsao; A. R. Umamaheshwaran; Pavlos M. Vranas

The Blue Gene®/L compute chip contains two PowerPC® 440 processor cores, private L2 prefetch caches, a shared L3 cache and double-data-rate synchronous dynamic random access memory (DDR SDRAM) memory controller, a collective network interface, a torus network interface, a physical network interface, an interrupt controller, and a bridge interface to slower devices. System-on-a-chip verification problems require a multilevel verification strategy in which the strengths of each layer offset the weaknesses of another layer. The verification strategy we adopted relies on the combined strengths of random simulation, directed simulation, and code-driven simulation at the unit and system levels. The strengths and weaknesses of the various techniques and our reasons for choosing them are discussed. The verification platform is based on event simulation and cycle simulation running on a farm of Intel-processor-based machines, several PowerPC-processor-based machines, and the internally developed hardware accelerator Awan. The cost/performance tradeoffs of the different platforms are analyzed. The success of the first Blue Gene/L nodes, which worked within days of receiving them and had only a small number of undetected bugs (none fatal), reflects both careful design and a comprehensive verification strategy.


Ibm Journal of Research and Development | 2013

IBM Blue Gene/Q memory subsystem with speculative execution and transactional memory

Martin Ohmacht; Amy Wang; Thomas Gooding; Ben J. Nathanson; Indira Nair; Geert Janssen; Marcel Schaal; Burkhard Steinmacher-Burow

The memory subsystem of the IBM Blue Gene®/Q Compute chip features multi-versioning and access conflict detection. Its ordered and unordered transaction modes implement both speculative execution (SE) and transactional memory (TM). Blue Gene/Qs large shared second-level cache serves as storage for speculative versions, allowing up to 30 MB of speculative state for the 64 threads of a Blue Gene/Q node, which in the extreme can be associated with a single large transaction. Using the shared access to speculative data, the SE model implements forwarding, allowing data produced by one thread to be accessed by another thread while both are still speculative. This paper presents an overview of Blue Gene/Qs approach to TM and SE: the memory subsystem hardware and operating system extensions, IBM XL compiler support via OpenMP® extensions, and a cost estimation model for executing code speculatively. The model is validated using synthetic benchmarks.


Archive | 1994

System for independently transferring data using two independently controlled DMA engines coupled between a FIFO buffer and two separate buses respectively

Carl A. Bender; Gerard M. Salem; Richard A. Swetz; Singpui Zee; Ben J. Nathanson


Archive | 2002

Data capture technique for high speed signaling

Wayne Melvin Barrett; Dong Chen; Paul W. Coteus; Alan Gara; Rory Dana Jackson; Gerard V. Kopcsay; Ben J. Nathanson; Pavlos M. Vranas


Archive | 1989

Method and apparatus for implementing inter-processor interrupts using shared memory storage in a multi-processor computer system

Gordon W. Braudaway; Ben J. Nathanson


Archive | 1995

Flexible bridge between processor channel and switching mechanism.

Carl A. Bender; Gerard M. Salem; Richard A. Swetz; Singpui Zee; Ben J. Nathanson


Archive | 1995

Message data error detection circuit

Carl A. Bender; Gerard M. Salem; Richard A. Swetz; Singpui Zee; Ben J. Nathanson


Archive | 1990

Inter-processor interrupts in an n-element multi-processor.

Gordon W. Braudaway; Ben J. Nathanson

Researchain Logo
Decentralizing Knowledge