Adam B. Collura
IBM
Network
Latest external collaboration on country level. Dive into details by clicking on the dots.
Publication
Featured researches published by Adam B. Collura.
symposium on computer arithmetic | 2011
Steven R. Carlough; Adam B. Collura; Silvia Melitta Mueller; Michael Kroener
Decimal floating-point Arithmetic is widely used in commercial computing applications, such as financial transactions, where rounding errors prevent the use of binary floating-point operations. The revised IEEE Standard for Floating-Point Arithmetic (IEEE-754-2008) defined standardized decimal floating-point (DFP) formats. As more software applications adopt the IEEE decimal floating-point standard, hardware accelerators that support it are becoming more prevalent. This paper describes the second generation decimal floating-point accelerator implemented on the IBM zEnterprise-196 processor. The 4-cycle deep pipeline was designed to optimize the latency of fixed-point decimal operations while significantly improving the bandwidth of DFP operations. A detailed description of the unit and a comparison to previous implementations found in literature is provided.
high-performance computer architecture | 2013
James J. Bonanno; Adam B. Collura; Daniel Lipetz; Ulrich Mayer; Brian R. Prasky; Anthony Saporito
This paper describes the large capacity hierarchical branch predictor in the 5.5 GHz IBM zEnterprise EC12 microprocessor. Performance analyses in a simulation model and on zEC12 hardware demonstrate the benefit of this hierarchy compared to a smaller one level predictor. Novel structures and algorithms for two level branch prediction are presented. Prediction information about multiple branches is bulk transferred from the second level into the first upon detecting a perceived miss in the first level. The second level does not directly make branch predictions. Access to the second level is limited when it is unlikely to be productive. The second level is systematically searched in an order that is likely to provide hits as early as possible. On the workloads analyzed in the simulation model, measurements show a maximum core performance benefit of 13.8%. On the two workloads analyzed on zEC12 hardware 3.4% and 5.3% system performance improvements are achieved.
Archive | 2008
Steven R. Carlough; Adam B. Collura; Wen H. Li; Eric M. Schwarz; Chung-Lung Kevin Shum
Archive | 2012
Steven R. Carlough; Adam B. Collura; Michael Kroener; Silvia Melitta Mueller
Archive | 2012
Adam B. Collura; Brian R. Prasky
Archive | 2014
Adam B. Collura; Brian R. Prasky; Anthony Saporito
Archive | 2010
Steven R. Carlough; Adam B. Collura; Michael Kroener; Silvia Melitta Mueller
Archive | 2010
Steven R. Carlough; Adam B. Collura; Klaus Michael Kroener; Silvia Melitta Mueller
Archive | 2015
Steven R. Carlough; Adam B. Collura
Archive | 2013
Adam B. Collura; Brian R. Prasky