Jack Poulson
University of Texas at Austin
Network
Latest external collaboration on country level. Dive into details by clicking on the dots.
Publication
Featured researches published by Jack Poulson.
ACM Transactions on Mathematical Software | 2013
Jack Poulson; Bryan Marker; Robert A. van de Geijn; Jeff R. Hammond; Nichols A. Romero
Parallelizing dense matrix computations to distributed memory architectures is a well-studied subject and generally considered to be among the best understood domains of parallel computing. Two packages, developed in the mid 1990s, still enjoy regular use: ScaLAPACK and PLAPACK. With the advent of many-core architectures, which may very well take the shape of distributed memory architectures within a single processor, these packages must be revisited since the traditional MPI-based approaches will likely need to be extended. Thus, this is a good time to review lessons learned since the introduction of these two packages and to propose a simple yet effective alternative. Preliminary performance results show the new solution achieves competitive, if not superior, performance on large clusters.
SIAM Journal on Scientific Computing | 2013
Jack Poulson; Björn Engquist; Siwei Li; Lexing Ying
A parallelization of a sweeping preconditioner for three-dimensional Helmholtz equations without large cavities is introduced and benchmarked for several challenging velocity models. The setup and application costs of the sequential preconditioner are shown to be
ieee international conference on high performance computing data and analytics | 2012
Bryan Marker; Jack Poulson; Don S. Batory; Robert A. van de Geijn
O(\gamma^2 N^{4/3})
SIAM Journal on Scientific Computing | 2014
Jack Poulson; Laurent Demanet; Nicholas Maxwell; Lexing Ying
and
Concurrency and Computation: Practice and Experience | 2012
Bryan Marker; Ernie Chan; Jack Poulson; Robert A. van de Geijn; Rob F. Van der Wijngaart; Timothy G. Mattson; Theodore E. Kubaska
O(\gamma N \log N)
SIAM Journal on Scientific Computing | 2016
Martin D. Schatz; Robert A. van de Geijn; Jack Poulson
, where
SIAM Journal on Scientific Computing | 2014
Austin R. Benson; Jack Poulson; Kenneth Tran; Björn Engquist; Lexing Ying
\gamma(\omega)
acm sigplan symposium on principles and practice of parallel programming | 2012
Bryan Marker; Andy Terrel; Jack Poulson; Don S. Batory; Robert A. van de Geijn
denotes the modestly frequency-dependent number of grid points per perfectly matched layer. Several computational and memory improvements are introduced relative to using black-box sparse-direct solvers for the auxiliary problems, and competitive runtimes and iteration counts are reported for high-frequency problems distributed over thousands of cores. Two open-source packages are released along with this paper: Parallel Sweeping Preconditioner (PSP) and the underlying distributed multifrontal solver, Clique.
Seg Technical Program Expanded Abstracts | 2011
Björn Engquist; Jack Poulson; Lexing Ying
To implement dense linear algebra algorithms for distributed-memory computers, an expert applies knowledge of the domain, the target architecture, and how to parallelize common operations. This is often a rote process that becomes tedious for a large collection of algorithms. We have developed a way to encode this expert knowledge such that it can be applied by a system to generate mechanically the same (and sometimes better) highly-optimized code that an expert creates by hand. This paper illustrates how we have encoded a subset of this knowledge and how our system applies it and searches a space of generated implementations automatically.
Mathematical Modelling and Numerical Analysis | 2014
Paul Tsuji; Jack Poulson; Björn Engquist; Lexing Ying
The butterfly algorithm is a fast algorithm which approximately evaluates a discrete analogue of the integral transform