Network
Latest external collaboration on country level. Dive into details by clicking on the dots.
Publication
Featured researches published by Florian A. Auernhammer.
Ibm Journal of Research and Development | 2015
William J. Starke; Jeffrey A. Stuecheli; David Daly; John Steven Dodson; Florian A. Auernhammer; Patricia M. Sagmeister; Guy Lynn Guthrie; Charles F. Marino; Michael S. Siegel; Bart Blaner
In this paper, we describe the IBM POWER8™ cache, interconnect, memory, and input/output subsystems, collectively referred to as the “nest.” This paper focuses on the enhancements made to the nest to achieve balanced and scalable designs, ranging from small 12-core single-socket systems, up to large 16-processor-socket, 192-core enterprise rack servers. A key aspect of the design has been increasing the end-to-end data and coherence bandwidth of the system, now featuring more than twice the bandwidth of the POWER7® processor. The paper describes the new memory-buffer chip, called Centaur, providing up to 128 MB of eDRAM (embedded dynamic random-access memory) buffer cache per processor, along with an improved DRAM (dynamic random-access memory) scheduler with support for prefetch and write optimizations, providing industry-leading memory bandwidth combined with low memory latency. It also describes new coherence-transport enhancements and the transition to directly integrated PCIe® (PCI Express®) support, as well as additions to the cache subsystem to support higher levels of virtualization and scalability including snoop filtering and cache sharing.
architectures for networking and communications systems | 2008
Florian A. Auernhammer; Patricia M. Sagmeister
Network processors use highly parallel architectures to improve performance and reach multi-gigabit line-speeds. In this paper, we emulate a pipeline in a highly parallel non-programmable industrial InfiniBand Host Channel Adapter to make a performance and bottleneck analysis and, at the same time, explore the potential of a pipelined architecture. Therefore, starting from the original Host Channel Adapter model with multiple send- and receive-side packet-processing units, we compare its performance capabilities with that of a pipelined design by introducing a central arbiter synchronizing the state machines of the different packet-processing instances to achieve a pipelined behavior. We show that the pipelined model achieves a performance comparable to that of the parallel design in most of our micro-benchmarks, making it a valid option for next-generation high-speed adapters. At the same time, our approach enables a deeper analysis of the original architecture and a better understanding of the actual processing requirements, and therefore offers valuable insights for future designs.
Archive | 2015
Florian A. Auernhammer; Patricia M. Sagmeister
Archive | 2011
Florian A. Auernhammer
Archive | 2012
Florian A. Auernhammer; Patricia M. Sagmeister
Archive | 2012
Florian A. Auernhammer; Nikolaos Chrysos; Rolf Clauberg; Andreas C. Doering; Ronald P. Luijten; Patricia M. Sagmeister
Archive | 2009
Florian A. Auernhammer; Patricia M. Sagmeister
Archive | 2008
Andreas Christian Doering; Patricia M. Sagmeister; Jonathan Rohrer; Silvio Dragone; Rolf Clauberg; Florian A. Auernhammer; Maria Gabrani
Archive | 2009
Florian A. Auernhammer
Archive | 2009
Florian A. Auernhammer; Patricia M. Sagmeister