Michael J. Lijewski
Lawrence Berkeley National Laboratory
Network
Latest external collaboration on country level. Dive into details by clicking on the dots.
Publication
Featured researches published by Michael J. Lijewski.
Journal of Physics: Conference Series | 2007
Phillip Colella; John B. Bell; Noel Keen; Terry J. Ligocki; Michael J. Lijewski; Brian Van Straalen
In this paper, we discuss some of the issues in obtaining high performance for block-structured adaptive mesh refinement software for partial differential equations. We show examples in which AMR scales to thousands of processors. We also discuss a number of metrics for performance and scalability that can provide a basis for understanding the advantages and disadvantages of this approach.
computing frontiers | 2007
Shoaib Kamil; Ali Pinar; Daniel K. Gunter; Michael J. Lijewski; Leonid Oliker; John Shalf
As we enter the era of peta-scale computing, system architects must plan for machines composed of tens or even hundreds of thousands of processors. Although fully connected networks such as fat-tree configurations currently dominate HPC interconnect designs, such approaches are inadequate for ultra-scale concurrencies due to the superlinear growth of component costs. Traditional low-degree interconnect topologies, such as 3D tori, have reemerged as a competitive solution due to the linear scaling of system components relative to the node count; however, such networks are poorly suited for the requirements of many scientific applications at extreme concurrencies. To address these limitations, we propose HFAST, a hybrid switch architecture that uses circuit switches to dynamically reconfigure lower-degree interconnects to suit the topological requirements of a given scientific application. This work presents several new research contributions. We develop an optimization strategy for HFAST mappings and demonstrate that efficiency gains can be attained across a broad range of static numerical computations. Additionally, we conduct an extensive analysis of the communication characteristics of a dynamically adapting mesh calculation and show that the HFAST approach can achieve significant advantages, even when compared with traditional fat-tree configurations. Overall results point to the promising potential of utilizing hybrid reconfigurable networks to interconnect future peta-scale architectures, for both static and dynamically adapting applications.
Journal of Parallel and Distributed Computing | 2014
Anshu Dubey; Ann S. Almgren; John B. Bell; Martin Berzins; Steven R. Brandt; Greg L. Bryan; Phillip Colella; Daniel T. Graves; Michael J. Lijewski; Frank Löffler; Brian W. O'Shea; Brian Van Straalen; Klaus Weide
Over the last decade block-structured adaptive mesh refinement (SAMR) has found increasing use in large, publicly available codes and frameworks. SAMR frameworks have evolved along different paths. Some have stayed focused on specific domain areas, others have pursued a more general functionality, providing the building blocks for a larger variety of applications. In this survey paper we examine a representative set of SAMR packages and SAMR-based codes that have been in existence for half a decade or more, have a reasonably sized and active user base outside of their home institutions, and are publicly available. The set consists of a mix of SAMR packages and application codes that cover a broad range of scientific domains. We look at their high-level frameworks, their design trade-offs and their approach to dealing with the advent of radical changes in hardware architecture. The codes included in this survey are BoxLib, Cactus, Chombo, Enzo, FLASH, and Uintah. A survey of mature openly available state-of-the-art structured AMR libraries and codes.Discussion of their frameworks, challenges and design trade-offs.Directions being pursued by the codes to prepare for the future many-core and heterogeneous platforms.
Computational Geosciences | 2012
George Shu Heng Pau; John B. Bell; Ann S. Almgren; Kirsten Fagnan; Michael J. Lijewski
We describe a second-order accurate sequential algorithm for solving two-phase multicomponent flow in porous media. The algorithm incorporates an unsplit second-order Godunov scheme that provides accurate resolution of sharp fronts. The method is implemented within a block structured adaptive mesh refinement (AMR) framework that allows grids to dynamically adapt to features of the flow and enables efficient parallelization of the algorithm. We demonstrate the second-order convergence rate of the algorithm and the accuracy of the AMR solutions compared to uniform fine-grid solutions. The algorithm is then used to simulate the leakage of gas from a Liquified Petroleum Gas (LPG) storage cavern, demonstrating its capability to capture complex behavior of the resulting flow. We further examine differences resulting from using different relative permeability functions.
Other Information: PBD: 13 May 1998 | 1998
Marcus S. Day; Phillip Colella; Michael J. Lijewski; Charles A. Rendleman; Daniel L. Marcus
Author(s): Day, Marcus S.; Colella, Phillip; Lijewski, Michael J.; Rendleman, Charles A.; Marcus, Daniel L.
international supercomputing conference | 2013
Cy P. Chan; Didem Unat; Michael J. Lijewski; Weiqun Zhang; John B. Bell; John Shalf
The design of hardware for next-generation exascale computing systems will require a deep understanding of how software optimizations impact hardware design trade-offs. In order to characterize how co-tuning hardware and software parameters affects the performance of combustion simulation codes, we created ExaSAT, a compiler-driven static analysis and performance modeling framework. Our framework can evaluate hundreds of hardware/software configurations in seconds, providing an essential speed advantage over simulators and dynamic analysis techniques during the co-design process. Our analytic performance model shows that advanced code transformations, such as cache blocking and loop fusion, can have a significant impact on choices for cache and memory architecture. Our modeling helped us identify tuned configurations that achieve a 90% reduction in memory traffic, which could significantly improve performance and reduce energy consumption. These techniques will also be useful for the development of advanced programming models and runtimes, which must reason about these optimizations to deliver better performance and energy efficiency.
international parallel and distributed processing symposium | 2014
Samuel Williams; Michael J. Lijewski; Ann S. Almgren; Brian Van Straalen; Erin Carson; Nicholas Knight; James Demmel
Geometric multigrid solvers within adaptive mesh refinement (AMR) applications often reach a point where further coarsening of the grid becomes impractical as individual sub domain sizes approach unity. At this point the most common solution is to use a bottom solver, such as BiCGStab, to reduce the residual by a fixed factor at the coarsest level. Each iteration of BiCGStab requires multiple global reductions (MPI collectives). As the number of BiCGStab iterations required for convergence grows with problem size, and the time for each collective operation increases with machine scale, bottom solves in large-scale applications can constitute a significant fraction of the overall multigrid solve time. In this paper, we implement, evaluate, and optimize a communication-avoiding s-step formulation of BiCGStab (CABiCGStab for short) as a high-performance, distributed-memory bottom solver for geometric multigrid solvers. This is the first time s-step Krylov subspace methods have been leveraged to improve multigrid bottom solver performance. We use a synthetic benchmark for detailed analysis and integrate the best implementation into BoxLib in order to evaluate the benefit of a s-step Krylov subspace method on the multigrid solves found in the applications LMC and Nyx on up to 32,768 cores on the Cray XE6 at NERSC. Overall, we see bottom solver improvements of up to 4.2x on synthetic problems and up to 2.7x in real applications. This results in as much as a 1.5x improvement in solver performance in real applications.
Lawrence Berkeley National Laboratory | 2006
John B. Bell; Marcus S. Day; Ann S. Almgren; Michael J. Lijewski; Charles A. Rendleman; Robert K. Cheng; Ian G. Shepherd
There is considerable technological interest in developingnew fuel-flexible combustion systems that can burn fuels such ashydrogenor syngas. Lean premixed systems have the potential to burn thesetypes of fuels with high efficiency and low NOx emissions due to reducedburnt gas temperatures. Although traditional scientific approaches basedon theory and laboratory experiment have played essential roles indeveloping our current understanding of premixed combustion, they areunable to meet the challenges of designing fuel-flexible lean premixedcombustion devices. Computation, with itsability to deal with complexityand its unlimited access to data, hasthe potential for addressing thesechallenges. Realizing this potential requires the ability to perform highfidelity simulations of turbulent lean premixed flames under realisticconditions. In this paper, we examine the specialized mathematicalstructure of these combustion problems and discuss simulation approachesthat exploit this structure. Using these ideas we can dramatically reducecomputational cost, making it possible to perform high-fidelitysimulations of realistic flames. We illustrate this methodology byconsidering ultra-lean hydrogen flames and discuss how this type ofsimulation is changing the way researchers study combustion.
Advances in Water Resources | 2010
George Shu Heng Pau; John B. Bell; Karsten Pruess; Ann S. Almgren; Michael J. Lijewski; Keni Zhang
Journal Name: Proceedings of the Combustion Institute; Journal Volume: 31; Journal Issue: 1; Related Information: Journal Publication Date: 01/2007 | 2007
John B. Bell; Marcus S. Day; Joseph F. Grcar; Michael J. Lijewski; James F. Driscoll; Sergei Filatyev