Network


Latest external collaboration on country level. Dive into details by clicking on the dots.

Hotspot


Dive into the research topics where James Allan Kahle is active.

Publication


Featured researches published by James Allan Kahle.


Ibm Journal of Research and Development | 2005

Introduction to the cell multiprocessor

James Allan Kahle; Michael Norman Day; Harm Peter Hofstee; Charles Ray Johns; T. R. Maeurer; David Shippy

This paper provides an introductory overview of the Cell multiprocessor. Cell represents a revolutionary extension of conventional microprocessor architecture and organization. The paper discusses the history of the project, the program objectives and challenges, the disign concept, the architecture and programming models, and the implementation.


international solid-state circuits conference | 2005

The design and implementation of a first-generation CELL processor

D. Pham; S. Asano; Mark Bolliger; M.N. Day; H.P. Hofstee; Charles Ray Johns; James Allan Kahle; Atsushi Kameyama; John M. Keaty; Y. Masubuchi; Mack W. Riley; D. Shippy; Daniel Lawrence Stasiak; Masakazu Suzuoki; M. Wang; James D. Warnock; Steve Weitzel; Dieter Wendel; T. Yamazaki; Kazuaki Yazawa

A CELL processor is a multi-core chip consisting of a 64b power architecture processor, multiple streaming processors, a flexible IO interface, and a memory interface controller. This SoC is implemented in 90nm SOI technology. The chip is designed with a high degree of modularity and reuse to maximize the custom circuit content and achieve a high-frequency clock-rate.


IEEE Journal of Solid-state Circuits | 2006

Overview of the architecture, circuit design, and physical implementation of a first-generation cell processor

D. Pham; T. Aipperspach; David William Boerstler; M. Bolliger; R. Chaudhry; D. Cox; P. Harvey; P.M. Harvey; H.P. Hofstee; C. Johns; James Allan Kahle; Atsushi Kameyama; John M. Keaty; Y. Masubuchi; M. Pham; Jürgen Pille; Stephen D. Posluszny; M. Riley; D.L. Stasiak; M. Suzuoki; Osamu Takahashi; James D. Warnock; Stephen Douglas Weitzel; Dieter Wendel; K. Yazawa

This paper reviews the design challenges that current and future processors must face, with stringent power limits, high-frequency targets, and the continuing system integration trends. This paper then describes the architecture, circuit design, and physical implementation of a first-generation Cell processor and the design techniques used to overcome the above challenges. A Cell processor consists of a 64-bit Power Architecture processor coupled with multiple synergistic processors, a flexible IO interface, and a memory interface controller that supports multiple operating systems including Linux. This multi-core SoC, implemented in 90-nm SOI technology, achieved a high clock rate by maximizing custom circuit design while maintaining reasonable complexity through design modularity and reuse.


international solid state circuits conference | 1994

A 2.2 W, 80 MHz superscalar RISC microprocessor

Gianfranco Gerosa; S. Gary; C. Dietz; D. Pham; K. Hoover; J. Alvarez; H. Sanchez; P. Ippolito; Tai Ngo; S. Litch; J. Eno; J. Golab; N. Vanderschaaf; James Allan Kahle

A 28 mW/MHz at 80 MHz structured-custom RISC microprocessor design is described. This 32-b implementation of the PowerPC architecture is fabricated in a 3.3 V, 0.5 /spl mu/m, 4-level metal CMOS technology, resulting in 1.6 million transistors in a 7.4 mm by 11.5 mm chip size. Dual 8-kilobyte instruction and data caches coupled to a high performance 32/64-b system bus and separate execution units (float, integer, loadstore, and system units) result in peak instruction rates of three instructions per clock cycle. Low-power design techniques are used throughout the entire design, including dynamically powered down execution units. Typical power dissipation is kept under 2.2 W at 80 MHz. Three distinct levels of software-programmable, static, low-power operation-for system power management are offered, resulting in standby power dissipation from 2 mW to 350 mW. CPU to bus clock ratios of 1/spl times/, 2/spl times/, 3/spl times/, and 4/spl times/ are implemented to allow control of system power while maintaining processor performance. As a result, workstation level performance is packed into a low-power, low-cost design ideal for notebooks and desktop computers. >


international solid-state circuits conference | 2010

The implementation of POWER7 TM : A highly parallel and scalable multi-core high-end server processor

Dieter Wendel; Ronald Nick Kalla; Robert Cargoni; Joachim Clables; Joshua Friedrich; Roland Frech; James Allan Kahle; Balaram Sinharoy; William J. Starke; Scott A. Taylor; Steve Weitzel; Sam Gat-Shang Chu; Saiful Islam; Victor Zyuban

The next processor of the POWER ™ family, called POWER7™ is introduced. Eight quad-threaded cores are integrated together with two memory controllers and high-speed system links on a 567mm2 die, employing 1.2B transistors in 45nm CMOS SOI technology [4]. High on-chip performance and therefore bandwidth is achieved using 11 layers of low-к copper wiring and devices with enhanced dual-stress liners. The technology features deep trench [DT] capacitors that are used to build the 32MB embedded DRAM L3 based on a 0.067µm2 DRAM cell. DT capacitors are used also to reduce on-chip voltage-island supply noise. Focusing on speed, the dual-supply ripple-domino SRAM concepts follows the schemes described elsewhere.


international symposium on microarchitecture | 2005

The Cell Processor Architecture

James Allan Kahle

This talk will present the Cell processor, jointly developed by the STI (Sony-Toshiba-IBM) partnership. Cell is a non-homogeneous chip multiprocessor intended for general-purpose applications but with a particular emphasis on multimedia performance. The Cell processor combines a 64bit Power Architecture(TM) core with 8 Synergistic Processors. In many cases, it delivers more than an order of magnitude more performance than conventional PC processors. Cell achieves this performance and power efficiency improvement by a new division of labor between the Power core and the Synergistic Processors. Cell allows for a wide variety of programming models, a selection of which will be presented in this talk. We will end the talk by discussing some applications that seem to fit the Cell processor particularly well, and by indicating areas of further exploration.


international conference on computer design | 1989

IBM second-generation RISC machine organization

H. B. Bakoglu; Gregory F. Grohoski; Larry Edward Thatcher; James Allan Kahle; Charles Roberts Moore; David P. Tuttle; Warren E. Maule; William Rudolph Hardell; Dwain Alan Hicks; Myhong Nguyenphu; Robert K. Montoye; W. T. Glover; Sudhir Dhawan

A highly concurrent second-generation RISC (reduced-instruction-set computer) that combines a powerful RISC architecture with sophisticated hardware design techniques to achieve a short cycle time and a low cycles-per-instruction (CPI) ratio is described. Like earlier RISC processors, this design uses a register-oriented instruction set, the CPU is hardwired rather than microcoded, and it features a pipelined implementation. Unlike earlier RISC processors, however, several advanced architectural and implementation features are used, including separate instruction and data caches, zero-cycle branches, multiple-instruction dispatch, and simultaneous execution of fixed- and floating-point instructions. The CPU has a four-word data bus to main memory, a four-word instruction-fetch bus from the I-cache arrays, and a two-word data bus between the D-cache and floating-point unit. The CPU has a full 64-b floating-point engine, and thirty-two 64-b floating point registers in addition to thirty-two 32-b fixed-point registers. In a single cycle, four instructions can be executed simultaneously.<<ETX>>


asia and south pacific design automation conference | 2006

Key features of the design methodology enabling a multi-core SoC implementation of a first-generation CELL processor

D. Pham; Hans-Werner Anderson; Erwin Behnen; Mark Bolliger; Sanjay Gupta; H. Peter Hofstee; Paul Harvey; Charles Ray Johns; James Allan Kahle; Atsushi Kameyama; John M. Keaty; Bob Le; Sang Lee; Tuyen V. Nguyen; John George Petrovick; Mydung Pham; Juergen Pille; Stephen D. Posluszny; Mack W. Riley; Joseph Roland Verock; James D. Warnock; Steve Weitzel; Dieter Wendel

This paper reviews the design challenges that current and future processors must face, with stringent power limits and high frequency targets, and the design methods required to overcome the above challenges and address the continuing Giga-scale system integration trend. This paper then describes the details behind the design methodology that was used to successfully implement a first-generation CELL processor - a multi-core SoC. Key features of this methodology are broad optimization with fast rule-based analysis engines using macro-level abstraction for constraints propagation up/down the design hierarchy, coupled with accurate transistor level simulation for detailed analysis. The methodology fostered the modular design concept that is inherent to the CELL architecture, enabling a high frequency design by maximizing custom circuit content through re-use, and balanced power, frequency, and die size targets through global convergence capabilities. The design has roughly 241 million transistors implemented in 90 nm SOI technology with 8 levels of copper interconnects and one local interconnect layer. The chip has been tested at various temperatures, voltages, and frequencies. Correct operation has been observed in the lab on first pass silicon at frequencies well over 4GHz.


custom integrated circuits conference | 2005

The design methodology and implementation of a first-generation CELL processor: a multi-core SoC

D. Pham; Erwin Behnen; Mark Bolliger; H.P. Hofstee; Charles Ray Johns; James Allan Kahle; Atsushi Kameyama; John M. Keaty; B. Le; Y. Masubuchi; Stephen D. Posluszny; Mack W. Riley; M. Suzuoki; M. Wang; James D. Warnock; Steve Weitzel; Dieter Wendel; K. Yazawa

This paper reviews the design challenges that current and future processors must face with stringent power limits and high frequency targets, and the design methods required to address the continuing system integration trends. This paper then describes the implementation of a first-generation CELL processor and the design methods used to overcome the above challenges. A CELL processor consists of a 64 bit power architecture processor coupled with multiple synergistic processors, a flexible IO interface, and a memory interface controller that supports multiple operating systems including Linux. This multicore SoC, implemented in 90nm SOI technology, achieved a high clock rate by maximizing custom circuit design while maintaining reasonable complexity through design modularity and reuse.


international solid-state circuits conference | 1994

A 3.0 W 75SPECint92 85SPECfp92 superscalar RISC microprocessor

D. Pham; M. Alexander; A. Arizpe; B. Burgess; C. Dietz; Lee Evan Eisen; R. El-Kareh; J. Eno; S. Gary; G. Gerosa; B. Goins; J. Golab; R. Golla; R. Harris; B. Ho; Y.-W. Ho; K. Hoover; C. Hunter; P. Ippolito; R. Jessani; James Allan Kahle; K.R. Kishore; B. Kuttanna; S. Litch; S. Mallick; Tai Ngo; D. Ogden; C. Olson; S.-H. Park; R. Patel

This superscalar microprocessor is a 32b implementation of the PowerPC Architecture. With an estimated performance/power ratio of 25SPECint92/W at 80 MHz, this RISC style chip offers workstation-level performance packed into a low-power consumption, low-cost design ideal for notebooks and desktop computers.<<ETX>>

Researchain Logo
Decentralizing Knowledge