Network


Latest external collaboration on country level. Dive into details by clicking on the dots.

Hotspot


Dive into the research topics where Adam B. Collura is active.

Publication


Featured researches published by Adam B. Collura.


symposium on computer arithmetic | 2011

The IBM zEnterprise-196 Decimal Floating-Point Accelerator

Steven R. Carlough; Adam B. Collura; Silvia Melitta Mueller; Michael Kroener

Decimal floating-point Arithmetic is widely used in commercial computing applications, such as financial transactions, where rounding errors prevent the use of binary floating-point operations. The revised IEEE Standard for Floating-Point Arithmetic (IEEE-754-2008) defined standardized decimal floating-point (DFP) formats. As more software applications adopt the IEEE decimal floating-point standard, hardware accelerators that support it are becoming more prevalent. This paper describes the second generation decimal floating-point accelerator implemented on the IBM zEnterprise-196 processor. The 4-cycle deep pipeline was designed to optimize the latency of fixed-point decimal operations while significantly improving the bandwidth of DFP operations. A detailed description of the unit and a comparison to previous implementations found in literature is provided.


high-performance computer architecture | 2013

Two level bulk preload branch prediction

James J. Bonanno; Adam B. Collura; Daniel Lipetz; Ulrich Mayer; Brian R. Prasky; Anthony Saporito

This paper describes the large capacity hierarchical branch predictor in the 5.5 GHz IBM zEnterprise EC12 microprocessor. Performance analyses in a simulation model and on zEC12 hardware demonstrate the benefit of this hierarchy compared to a smaller one level predictor. Novel structures and algorithms for two level branch prediction are presented. Prediction information about multiple branches is bulk transferred from the second level into the first upon detecting a perceived miss in the first level. The second level does not directly make branch predictions. Access to the second level is limited when it is unlikely to be productive. The second level is systematically searched in an order that is likely to provide hits as early as possible. On the workloads analyzed in the simulation model, measurements show a maximum core performance benefit of 13.8%. On the two workloads analyzed on zEC12 hardware 3.4% and 5.3% system performance improvements are achieved.


Archive | 2008

Processor and method for workaround trigger activated exceptions

Steven R. Carlough; Adam B. Collura; Wen H. Li; Eric M. Schwarz; Chung-Lung Kevin Shum


Archive | 2012

Decimal Multi-Precision Overflow and Tininess Detection

Steven R. Carlough; Adam B. Collura; Michael Kroener; Silvia Melitta Mueller


Archive | 2012

CACHE LINE HISTORY TRACKING USING AN INSTRUCTION ADDRESS REGISTER FILE

Adam B. Collura; Brian R. Prasky


Archive | 2014

CHECKPOINTS FOR A SIMULTANEOUS MULTITHREADING PROCESSOR

Adam B. Collura; Brian R. Prasky; Anthony Saporito


Archive | 2010

Decimal Floating Point Mechanism and Process of Multiplication without Resultant Leading Zero Detection

Steven R. Carlough; Adam B. Collura; Michael Kroener; Silvia Melitta Mueller


Archive | 2010

DECIMAL ADDER WITH END AROUND CARRY

Steven R. Carlough; Adam B. Collura; Klaus Michael Kroener; Silvia Melitta Mueller


Archive | 2015

MODULO-M BINARY COUNTER

Steven R. Carlough; Adam B. Collura


Archive | 2013

Cache line history tracking using an instruction address register file storing memory location identifier

Adam B. Collura; Brian R. Prasky

Researchain Logo
Decentralizing Knowledge