Wim Bohm
Colorado State University
Network
Latest external collaboration on country level. Dive into details by clicking on the dots.
Publication
Featured researches published by Wim Bohm.
international conference on parallel architectures and compilation techniques | 1999
Jeff Hammes; Bob Rinker; Wim Bohm; Walid A. Najjar; Bruce A. Draper; Ross Beveridge
This paper presents the Cameron Project, which aims to provide a high level, algorithmic language and optimizing compiler for the development of image processing applications on reconfigurable computing systems (RCSs). SA-C, a single assignment variant of the C programming language, is designed to exploit both coarse-grain and fine-grain parallelism in image processing applications. Khoros, a software development environment commonly used for image processing, has been modified to support SA-C program development. SA-C supports image processing with true multidimensional arrays, and with sophisticated array access and windowing mechanisms. Reduction operators such as medians and histograms are also provided. The optimizing compiler targets RCSs, which are fine-grained parallel processors made up of field programmable gate arrays (FPGAs), memories and interconnection hardware. They can be used as inexpensive co-processors with conventional workstations or PCs. This paper discusses compiler optimizations to generate optimal FPGA code using dataflow analysis techniques applied to data dependence graphs. Initial results are presented.
Proceedings of IEEE International Symposium on Parallel Algorithms Architecture Synthesis | 1997
Jean-Luc Gaudiot; Wim Bohm; Walid A. Najjar; Tom DeBoni; John Feo; Patrick Miller
Programming a massively-parallel machine is a daunting task for any human programmer, and parallelization may even be impossible for any compiler. Instead, the functional programming paradigm may prove to be an ideal solution by providing an implicitly parallel interface to the programmer. We describe the Sisal (Stream and Iteration in a Single Assignment Language) project. Its goal is to provide a general-purpose user interface for a wide range of parallel processing platforms.
IEEE Transactions on Very Large Scale Integration Systems | 2001
Robert Rinker; Margaret Carter; Amitkumar Patel; Monica Chawathe; Charlie Ross; Jeffrey Hammes; Walid A. Najjar; Wim Bohm
We describe a system, developed as part of the Cameron project, which compiles programs written in a single-assignment subset of C called SA-C into dataflow graphs and then into VHDL. The primary application domain is image processing. The system consists of an optimizing compiler which produces dataflow graphs and a dataflow graph to VHDL translator. The method used for the translation is described here, along with some results on an application. The objective is not to produce yet another design entry tool, but rather to shift the programming paradigm from HDLs to an algorithmic level, thereby extending the realm of hardware design to the application programmer.
ieee international symposium on parallel & distributed processing, workshops and phd forum | 2011
Doug Hains; Zach Cashero; Mark Ottenberg; Wim Bohm; Sanjay V. Rajopadhye
CUDASW++ is a parallelization of the Smith-Waterman algorithm for CUDA graphical processing units that computes the similarity scores of a query sequence paired with each sequence in a database. The algorithm uses one of two kernel functions to compute the score between a given pair of sequences: the inter-task kernel or the intra-task kernel. We have identified the intra-task kernel as a major bottleneck in the CUDASW++ algorithm. We have developed a new intra-task kernel that is faster than the original intra-task kernel used in CUDASW++. We describe the development of our kernel as a series of incremental changes that provide insight into a number of issues that must be considered when developing any algorithm for the CUDA architecture. We analyze the performance of our kernel compared to the original and show that the use of our intra-task kernel substantially improves the overall performance of CUDASW++ on the order of three to four giga-cell updates per second on various benchmark databases.
Proceedings Fifth IEEE International Workshop on Computer Architectures for Machine Perception | 2000
Bruce A. Draper; Walid A. Najjar; Wim Bohm; Jeff Hammes; Bob Rinker; Charlie Ross; Monica Chawathe; José Bins
This paper presents a high-level language for expressing image processing algorithms, and an optimizing compiler that targets FPGAs. The language is called SA-C, and this paper focuses on the language features that 1) support image processing, and 2) enable efficient compilation to FPGAs. It then describes the compilation process, in which SA-C algorithms are translated into non-recursive data flow graphs, which in turn are translated into VHDL. Finally, it presents performance numbers for some well-known image processing routines, written in SAC and automatically compiled to an Annapolis Microsystems WildForce board with Xilinx 4036XL FPGAs.
Archive | 1999
Jeffrey Hammes; Wim Bohm
Archive | 2006
Wim Bohm; Charles Ross
Archive | 2003
A Walid; Wim Bohm; A Bruce; Jeff Hammes; June R. P. Ross; Charles Ross
international symposium on low power electronics and design | 2001
Robert Rinker; Margaret Carter; Amitkumar Patel; Monica Chawathe; Charlie Ross; Jeffrey Hammes; Walid A. Najjar; Wim Bohm
compilers, architecture, and synthesis for embedded systems | 1999
Jeffrey Hammes; Robert G. Rinker; Wim Bohm; Walid A. Najjar