Network


Latest external collaboration on country level. Dive into details by clicking on the dots.

Hotspot


Dive into the research topics where Patrick McCarthy is active.

Publication


Featured researches published by Patrick McCarthy.


conference on high performance computing (supercomputing) | 2006

Designing a highly-scalable operating system: the Blue Gene/L story

José E. Moreira; Michael Brutman; José G. Castaños; Thomas Eugene Engelsiepen; Mark E. Giampapa; Tom Gooding; Roger L. Haskin; Todd Inglett; Derek Lieber; Patrick McCarthy; Michael Mundy; Jeffrey J. Parker; Brian Paul Wallenfelt

Blue Gene/L, is currently the worlds fastest and most scalable supercomputer. It has demonstrated essentially linear scaling all the way to 131,072 processors in several benchmarks and real applications. The operating systems for the compute and I/O nodes of Blue Gene/L are among the components responsible for that scalability. Compute nodes are dedicated to running application processes, whereas I/O nodes are dedicated to performing system functions. The operating systems adopted for each of these nodes reflect this separation of junction. Compute nodes run a lightweight operating system called the compute node kernel. I/O nodes run a port of the Linux operating system. This paper discusses the architecture and design of this solution for Blue Gene/L in the context of the hardware characteristics that led to the design decisions. It also explains and demonstrates how those decisions are instrumental in achieving the performance and scalability for which Blue Gene/L is famous


Ibm Journal of Research and Development | 2005

Blue Gene/L programming and operating environment

José E. Moreira; George S. Almasi; Charles J. Archer; Ralph Bellofatto; Peter Bergner; José R. Brunheroto; Michael Brutman; José G. Castaños; Paul G. Crumley; Manish Gupta; Todd Inglett; Derek Lieber; David Limpert; Patrick McCarthy; Mark Megerian; Mark P. Mendell; Michael Mundy; Don Reed; Ramendra K. Sahoo; Alda Sanomiya; Richard Shok; Brian E. Smith; Greg Stewart

With up to 65,536 compute nodes and a peak performance of more than 360 teraflops, the Blue Gene®/L (BG/L) supercomputer represents a new level of massively parallel systems. The system software stack for BG/L creates a programming and operating environment that harnesses the raw power of this architecture with great effectiveness. The design and implementation of this environment followed three major principles: simplicity, performance, and familiarity. By specializing the services provided by each component of the system architecture, we were able to keep each one simple and leverage the BG/L hardware features to deliver high performance to applications. We also implemented standard programming interfaces and programming languages that greatly simplified the job of porting applications to BG/L. The effectiveness of our approach has been demonstrated by the operational success of several prototype and production machines, which have already been scaled to 16,384 nodes.


Ibm Journal of Research and Development | 2009

Breaking the petaflops barrier

Don Grice; H. Brandt; C. Wright; Patrick McCarthy; A. Emerich; T. Schimke; Charles J. Archer; J. Carey; Philip J. Sanders; J. A. Fritzjunker; S. Lewis; P. R. Germann

In this paper, we discuss the impact of petascale computing and the major issues to getting beyond a petaflops. We describe IBM approaches to petascale computing but with a major focus on the Los Alamos National Laboratory Roadrunner machine. We provide an overview of the hardware and software structures, focusing on the new triblade compute node architecture and the corresponding data control and synchronization software support to enable high-performance computing applications on this architecture. The fundamental technology drivers and issues for petascale computing and beyond are software complexity, energy efficiency, and system reliability, availability, and serviceability.


conference on high performance computing (supercomputing) | 2006

Design and implementation of a one-sided communication interface for the IBM eServer Blue Gene® supercomputer

Michael Blocksome; Charles J. Archer; Todd Inglett; Patrick McCarthy; Michael Mundy; Joe Ratterman; A. Sidelnik; Brian E. Smith; George S. Almasi; José G. Castaños; Derek Lieber; José E. Moreira; Sriram Krishnamoorthy; Vinod Tipparaju; Jarek Nieplocha

This paper discusses the design and implementation of a one-sided communication interface for the IBM Blue Gene/L supercomputer. This interface facilitates ARMCI and the Global Arrays toolkit and can be used by other one-sided communication libraries. New protocols, interrupt driven communication, and compute node kernel enhancements were required to enable these libraries. Three possible methods for enabling ARMCI on the Blue Gene/L software stack are discussed. A detailed look into the development process shows how the implementation of the one-sided communication interface was completed. This was accomplished on a compressed time scale with the collaboration of various organizations within IBM and open source communities. In addition to enabling the one-sided libraries, bandwidth enhancements were made for communication along a diagonal on the Blue Gene/L torus network. The maximum bandwidth improved by a factor of three. This work will enable a variety of one-sided applications to run on Blue Gene/L


international parallel and distributed processing symposium | 2008

Asynchronous task dispatch for high throughput computing for the eServer IBM Blue Gene® Supercomputer

Amanda Peters; Alan King; Tom Budnik; Patrick McCarthy; Paul Michaud; Mike Mundy; James C. Sexton; Greg Stewart

High Throughput Computing (HTC) environments strive to provide large amounts of processing capacity to customers over long periods of time by exploiting existing resources on the network according to Basney and Livny [1]. A single Blue Gene/L rack can provide thousands of CPU resources into HTC environments. This paper discusses the implementation of an asynchronous task dispatch system that exploits a recently released feature of the Blue Gene/L control system - called HTC mode - and presents data on experimental runs consisting of the asynchronous submission of multiple batches of thousands of tasks for financial workloads. The methodology developed here demonstrates how systems with very large processor counts and light-weight kernels can be configured to deliver capacity computing at the individual processor level in future petascale computing systems.


conference on high performance computing (supercomputing) | 2006

Design and Implementation of a One-Sided Communication Interface for the IBM eServer Blue Gene

Michael Blocksome; Charles J. Archer; Todd Inglett; Patrick McCarthy; Michael Mundy; Joseph D. Ratterman; A. Sidelnik; Brian E. Smith; George S. Almasi; José G. Castaños; Derek Lieber; José E. Moreira; Sriram Krishnamoorthy; Vinod Tipparaju; Jaroslaw Nieplocha

This paper discusses the design and implementation of a one-sided communication interface for the IBM Blue Gene/L supercomputer. This interface facilitates ARMCI and the Global Arrays toolkit and can be used by other one-sided communication libraries. New protocols, interrupt driven communication, and compute node kernel enhancements were required to enable these libraries. Three possible methods for enabling ARMCI on the Blue Gene/L software stack are discussed. A detailed look into the development process shows how the implementation of the one-sided communication interface was completed. This was accomplished on a compressed time scale with the collaboration of various organizations within IBM and open source communities. In addition to enabling the one-sided libraries, bandwidth enhancements were made for communication along a diagonal on the Blue Gene/L torus network. The maximum bandwidth improved by a factor of three. This work will enable a variety of one-sided applications to run on Blue Gene/L


Archive | 2007

Dynamically rerouting node traffic on a massively parallel computer system using hint bits

David L. Darrington; Patrick McCarthy; Amanda Peters; Albert Sidelnik; Brian E. Smith; Brent Allen Swartz


Archive | 2007

Identifying Messaging Completion on a Parallel Computer

Charles J. Archer; Camesha R. Hardwick; Patrick McCarthy; Brian Paul Wallenfelt


Archive | 2006

Fast interrupt disabling and processing in a parallel computing environment

Charles J. Archer; Michael Alan Blocksome; Todd Inglett; Derek Lieber; Patrick McCarthy; Michael Mundy; Jeffrey J. Parker; Joseph D. Ratterman; Brian E. Smith


Archive | 2006

EFFICIENT DEFERRED INTERRUPT HANDLING IN A PARALLEL COMPUTING ENVIRONMENT

Charles J. Archer; Michael Alan Blocksome; Todd Inglett; Derek Lieber; Patrick McCarthy; Michael Mundy; Jeffrey J. Parker; Joseph D. Ratterman; Brian E. Smith

Collaboration


Dive into the Patrick McCarthy's collaboration.

Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar

Todd Inglett

University of Rochester

View shared research outputs
Researchain Logo
Decentralizing Knowledge