Aaron R. Kunze
Intel
Network
Latest external collaboration on country level. Dive into details by clicking on the dots.
Publication
Featured researches published by Aaron R. Kunze.
acm special interest group on data communication | 2004
Ravi Kokku; Taylor L. Riché; Aaron R. Kunze; Jayaram Mudigonda; Jamie Jason; Harrick M. Vin
Most packet processing applications receive and process multiple types of packets. Today, the processors available within packet processing systems are allocated to packet types at design time. In this paper, we explore the benefits and challenges of adapting allocations of processors to packet types in packet processing systems. We demonstrate that, for all the applications and traces considered, run-time adaptation can reduce energy consumption by 70--80% and processor provisioning level by 40--50%. The adaptation benefits are maximized if processor allocations can be adapted at fine time-scales and if the total available processing power can be allocated to packet types in small granularities. We show that, of these two factors, allocating processing power to packet types is small granularity is more important---if the allocation granularity is large, then even a very fine adaptation time-scale yields meager benefits.
Archive | 2005
Harrick Vin; Jayaram Mudigonda; Jamie Jason; Erik J. Johnson; Roy Ju; Aaron R. Kunze; Ruiqi Lian
The programming environments-languages, compilers, and runtime systems -for NPs are in their infancy. At the same time, NPs represent a much larger trend in the processor industry: multicore, lightweight threaded architectures designed for throughput-driven applications. Once this trend hits the mainstream-programming marketplace, the need for a programming environment that is as easy to use as the programming environments for todays workstations and servers will become universally important to programmers. The Shangri-La architecture represents a complete programming environment for the domain of packet processing on multicore, lightweight threaded architectures in general, and NPs specifically. Shangri-La encompasses a language that exposes domain constructs instead of hardware constructs, keeping the programmer and code separate from architectural details; a sophisticated compiler complex that uses profile information to guide the mapping of code to processors and data structures to memory automatically; and runtime system to ensure maximum performance benefits in the face of fluctuating traffic conditions-both natural and malicious. Currently effort is being made on two major tasks: creating a prototype implementation of the proposed architecture, and researching the more difficult questions that will appear as development proceeds. The prototype system, which builds on the Open Research Compiler infrastructure and targets the Intel IXP2400 network processor, will provide a platform for further research and development.
architectures for networking and communications systems | 2006
Aaron R. Kunze; Stephen D. Goglin; Erik J. Johnson
The complexity of packet-processing applications continues to grow, with encryption, compression, and XML processing becoming common on packet-processing devices at the edge of enterprise and service provider networks. While performance remains a key differentiator for these devices, the complexity and rate of change in the supported applications has made general-purpose platforms an attractive alternative to ASICs and network processors. General-purpose platforms offer excellent programmability and a wealth of existing software, in the form of operating systems, libraries, and applications that can be used to build a packet-processing system; however, the performance of general-purpose operating systems is unacceptable for many environments. This has driven developers to either make derivative versions of existing operating systems or to use special-purpose operating systems with a less comprehensive and familiar library of existing software. As part of the Symerton project, we propose using virtualization to address these issues. We have designed a system that has a virtual machine dedicated to high-performance networking, and a virtual machine dedicated to hosting non-performance critical tasks in a general-purpose operating system. Using a proof-of-concept implementation, we show that the resulting system outperforms a general-purpose operating system by an average of 22% for a real networking application. We also discuss tradeoffs that will need to be considered in further development of systems using this design.
architectures for networking and communications systems | 2005
Arun Raghunath; Aaron R. Kunze; Erik J. Johnson; Vinod K. Balakrishnan
Network edge packet-processing systems, as are commonly implemented on network processor platforms, are increasingly required to support a rich set of services. These multi-service systems are also subjected to widely varying and unpredictable traffic. Current network processor systems do not simultaneously deal well with a variety of services and fluctuating workloads. For example, current methods of worst-case, static provisioning can meet performance requirements for any workload, but provisioning each service for its worst case reduces the total number of services that can be supported. Alternately, profile-driven automatic-partitioning compilers create efficient binaries for multi-service applications for specific workloads but they are sensitive to workload fluctuations. Run-time adaptation is a potential solution to this problem. With run-time adaptation, the mapping of services to system resources can be dynamically adjusted based on the workload. We have implemented an adaptive system that automatically changes the mapping of services to processors, and handles migration of services between different processor core types to match the current workload. In this paper we explain our adaptive system built on the Intelreg IXP2400 network processor. We demonstrate that it outperforms multiple different profile-driven compiled solutions for most workloads and performs within 20% of the optimal compiled solution for the remaining workloads.
Archive | 2005
Harrick Vin; Jayaram Mudigonda; Jamie Jason; Erik J. Johnson; Roy Ju; Aaron R. Kunze; Ruiqi Lian
The programming environments-languages, compilers, and runtime systems -for NPs are in their infancy. At the same time, NPs represent a much larger trend in the processor industry: multicore, lightweight threaded architectures designed for throughput-driven applications. Once this trend hits the mainstream-programming marketplace, the need for a programming environment that is as easy to use as the programming environments for todays workstations and servers will become universally important to programmers. The Shangri-La architecture represents a complete programming environment for the domain of packet processing on multicore, lightweight threaded architectures in general, and NPs specifically. Shangri-La encompasses a language that exposes domain constructs instead of hardware constructs, keeping the programmer and code separate from architectural details; a sophisticated compiler complex that uses profile information to guide the mapping of code to processors and data structures to memory automatically; and runtime system to ensure maximum performance benefits in the face of fluctuating traffic conditions-both natural and malicious. Currently effort is being made on two major tasks: creating a prototype implementation of the proposed architecture, and researching the more difficult questions that will appear as development proceeds. The prototype system, which builds on the Open Research Compiler infrastructure and targets the Intel IXP2400 network processor, will provide a platform for further research and development.
Archive | 2003
Erik J. Johnson; Aaron R. Kunze
Archive | 1998
Christian Dreke; James W. Edwards; Walter K. Hazzard; Aaron R. Kunze
Archive | 2007
Aaron R. Kunze; Erik J. Johnson; Hermann Gartler
Archive | 2004
Aaron R. Kunze; Jim Edwards; Christian Dreke
Archive | 2001
Aaron R. Kunze; Erik J. Johnson; John A. Wiegert