Thomas Sterling | Researchain

Archive Network Publication Hotspot Collaboration

Network

Latest external collaboration on country level. Dive into details by clicking on the dots.

Explore More

Hotspot

Dive into the research topics where Thomas Sterling is active.

Explore More

Publication

Featured researches published by Thomas Sterling.

International Conference on Exascale Applications and Software | 2014

Towards Exascale Co-design in a Runtime System

Thomas Sterling; Matthew Anderson; P. Kevin Bohan; Maciej Brodowicz; Abhishek Kulkarni; Bo Zhang

Achieving the performance potential of an Exascale machine depends on realizing both operational efficiency and scalability in high performance computing applications. This requirement has motivated the emergence of several new programming models which emphasize fine and medium grain task parallelism in order to address the aggravating effects of asynchrony at scale. The performance modeling of Exascale systems for these programming models requires the development of fundamentally new approaches due to the demands of both scale and complexity. This work presents a performance modeling case study of the Livermore Unstructured Lagrangian Explicit Shock Hydrodynamics (LULESH) proxy application where the performance modeling approach has been incorporated directly into a runtime system with two modalities of operation: computation and performance modeling simulation. The runtime system exposes performance sensitivies and projects operation to larger scales while also realizing the benefits of removing global barriers and extracting more parallelism from LULESH. Comparisons between the computation and performance modeling simulation results are presented.

International Workshop on Performance Modeling, Benchmarking and Simulation of High Performance Computer Systems | 2013

Performance Modeling of Gyrokinetic Toroidal Simulations for a Many-Tasking Runtime System

Matthew Anderson; Maciej Brodowicz; Abhishek Kulkarni; Thomas Sterling

Conventional programming practices on multicore processors in high performance computing architectures are not universally effective in terms of efficiency and scalability for many algorithms in scientific computing. One possible solution for improving efficiency and scalability in applications on this class of machines is the use of a many-tasking runtime system employing many lightweight, concurrent threads. Yet a priori estimation of the potential performance and scalability impact of such runtime systems on existing applications developed around the bulk synchronous parallel (BSP) model is not well understood. In this work, we present a case study of a BSP particle-in-cell benchmark code which has been ported to a many-tasking runtime system. The 3-D Gyrokinetic Toroidal code (GTC) is examined in its original MPI form and compared with a port to the High Performance ParalleX 3 (HPX-3) runtime system. Phase overlap, oversubscription behavior, and work rebalancing in the implementation are explored. Results for GTC using the SST/macro simulator complement the implementation results. Finally, an analytic performance model for GTC is presented in order to guide future implementation efforts.

Computing in Science and Engineering | 2013

Exascale Computing [Guest Editorial]

Steven Gottlieb; Thomas Sterling

The guest editors discuss some recent advances in exascale computing, as well as remaining issues.

Proceedings of The International Symposium on Grids and Clouds (ISGC) 2012 — PoS(ISGC 2012) | 2012

Towards a New Execution Model for HPC Clouds

Thomas Sterling; Matthew Anderson

The application of emergent Clouds to the domain of high performance computing is considered by examining the various operational modalities comprising the field of supercomputing and by analyzing their suitability to Clouds based on underlying factors of performance degradation. It is found that while throughput computing may be readily supported for such HPC workflows as parameter sweeps, capability computing and even weak scaled “cooperative” computing may not be well served using conventional practices. But the possible advance of revolutionary methods to manage asynchrony, exploit message-driven computing techniques and declarative synchronization semantic constructs such as found in the experimental ParalleX execution model may provide an alternative paradigm for bringing Clouds more closely aligned to Science, Technology, Engineering, and Mathematics (STEM) applications. Experimental results capturing an Adaptive Mesh Refinement (AMR) application in numerical relativity u sing the ParalleX-based HPX-3 runtime system demonstrates many of the required properties for HPC Clouds.

Archive | 2018