Network


Latest external collaboration on country level. Dive into details by clicking on the dots.

Hotspot


Dive into the research topics where Keith M. Campbell is active.

Publication


Featured researches published by Keith M. Campbell.


acm sigplan symposium on principles and practice of parallel programming | 2015

On optimizing machine learning workloads via kernel fusion

Arash Ashari; Shirish Tatikonda; Matthias Boehm; Berthold Reinwald; Keith M. Campbell; John Keenleyside; P. Sadayappan

Exploitation of parallel architectures has become critical to scalable machine learning (ML). Since a wide range of ML algorithms employ linear algebraic operators, GPUs with BLAS libraries are a natural choice for such an exploitation. Two approaches are commonly pursued: (i) developing specific GPU accelerated implementations of complete ML algorithms; and (ii) developing GPU kernels for primitive linear algebraic operators like matrix-vector multiplication, which are then used in developing ML algorithms. This paper extends the latter approach by developing fused kernels for a combination of primitive operators that are commonly found in popular ML algorithms. We identify the generic pattern of computation (alpha * X^T (v * (X * y)) + beta * z) and its various instantiations. We develop a fused kernel to optimize this computation on GPUs -- with specialized techniques to handle both sparse and dense matrices. This approach not only reduces the cost of data loads due to improved temporal locality but also enables other optimizations like coarsening and hierarchical aggregation of partial results. We also present an analytical model that considers input data characteristics and available GPU resources to estimate near-optimal settings for kernel launch parameters. The proposed approach provides speedups ranging from 2 to 67 for different instances of the generic pattern compared to launching multiple operator-level kernels using GPU accelerated libraries. We conclude by demonstrating the effectiveness of the approach in improving end-to-end performance on an entire ML algorithm.


Archive | 2007

Power management in a power-constrained processing system

Joseph E. Bolan; Keith M. Campbell; Vijay Kumar; Malcolm Scott Ware


Archive | 2008

Securing Blade Servers In A Data Center

Keith M. Campbell; Rajiv N. Kantesaia; Caroline M. Metry; Michael N. Womack


Archive | 2009

Thermal-Based Job Scheduling Among Server Chassis Of A Data Center

Keith M. Campbell; Jeffery M. Franke; John K. Whetzel


Archive | 2008

Security system to prevent tampering with a server blade

Keith M. Campbell; Raymond Todd Greggs; James Gordon McLean; Caroline M. Metry


Archive | 2006

METHOD FOR DISPLAY OF BLADE VIDEO LOCATION AND STATUS INFORMATION

Keith M. Campbell; Raymond Todd Greggs; James Gordon McLean; Caroline M. Metry


Archive | 2009

Autoconfiguration Of An IPv6 Component In A Segmented Network

Joseph E. Bolan; Keith M. Campbell; Phuong Thanh Nguyen; Norman C. Strole


Archive | 2009

Data center job migration and scheduling based on server chassis fan speed threshold

Keith M. Campbell; Jeffery M. Franke; John K. Whetzel


Archive | 2006

Coordination of a voicemail response with calendar scheduling

Keith M. Campbell; Caroline M. Metry


Archive | 2009

Providing expansion card settings

Keith M. Campbell; Patrick L. Caporale; Caroline M. Metry; Pravin Patel

Researchain Logo
Decentralizing Knowledge