Network


Latest external collaboration on country level. Dive into details by clicking on the dots.

Hotspot


Dive into the research topics where Joel Abraham Silberman is active.

Publication


Featured researches published by Joel Abraham Silberman.


international solid-state circuits conference | 1998

A 1.0-GHz single-issue 64-bit powerPC integer processor

Joel Abraham Silberman; Naoaki Aoki; David William Boerstler; Jeffrey L. Burns; Sang Hoo Dhong; Axel Essbaum; Uttam Shyamalindu Ghoshal; David F. Heidel; Peter Hofstee; Kyung Tek Lee; David Meltzer; Hung Ngo; Kevin J. Nowka; Stephen D. Posluszny; Osamu Takahashi; Ivan Vo; Brian Zoric

This 64 b single-issue integer processor, comprised of about one million transistors, is fabricated in a 0.15 /spl mu/m effective channel length, six-metal-layer CMOS technology. Intended as a vehicle to explore circuit, clocking, microarchitecture, and methodology options for high-frequency processors, the processor prototype implements 60 fixed-point compare, logical, arithmetic, and rotate-merge-mask instructions of the PowerPC instruction-set architecture with single-cycle latency. The processor executes programs written in this instruction subset from cache with a 1 ns cycle. In addition, the prototype implements 36 PowerPC load/store instructions that execute as single-cycle operations (zero wait cycles) with 1.15 ns latency. Full data forwarding and full at speed scan testing are supported.


international solid-state circuits conference | 2005

A 4.8GHz fully pipelined embedded SRAM in the streaming processor of a CELL processor

Sang Hoo Dhong; Osamu Takahashi; Michael Wayne White; T. Asano; Takaaki Nakazato; Joel Abraham Silberman; Atsushi Kawasumi; H. Yoshihara

A 6-stage fully pipelined embedded SRAM is implemented in a 90nm SOI technology. The array uses a conventional 6-transistor memory cell and sense amplifier to achieve the cycle time while minimizing the impact of device variation. A sum-addressed pre-decoder allows partial activation for power savings.


vlsi test symposium | 1998

High speed serializing/de-serializing design-for-test method for evaluating a 1 GHz microprocessor

David F. Heidel; Sang Hoo Dhong; H. Peter Hofstee; Michael Immediato; Kevin J. Nowka; Joel Abraham Silberman; Kevin Stawiasz

As microprocessor speeds approach 1 GHz and beyond the difficulties of at-speed testing continue to increase. In particular, automated test equipment which operates at these frequencies is very limited. This paper discusses a design-for-test method which serializes parallel circuit inputs and de-serializes circuit outputs to achieve 1 GHz operation on test equipment operating at frequencies below 100 MHz. This method has been used to successfully characterize the operation of a 1 GHz microprocessor chip.


international symposium on microarchitecture | 1998

Designing for a gigahertz [guTS integer processor]

Harm Peter Hofstee; Sang Hoo Dhong; David Meltzer; Kevin J. Nowka; Joel Abraham Silberman; J.I. Burns; Stephen D. Posluszny; Osamu Takahashi

At the IEEE International Solid State Circuits Conference this February, the IBM Austin Research Laboratory presented an experimental 64-bit integer processor called guTS (gigahertz unit Test Site). The goal of the guTS project was to demonstrate that circuit techniques, and circuit-centric design, could significantly increase the performance of microprocessors, thus providing headroom for future performance growth beyond contributions from microarchitecture and CMOS technology. To clearly distinguish the design contributions of this project from innovations in CMOS technology we chose a fabrication technology that was in production in 1997. The guTS processor is a full-custom, nearly 100% dynamic design. Its single-issue core implements 96 instructions from the integer subset of the PowerPC instruction set architecture, and covers in excess of 90% of instructions executed in typical code. Address translation, floating-point, and I/O-related instructions are omitted. All instructions, including loads and stores, execute in one cycle. We measured core speeds in excess of a gigahertz. We focus here on the circuit-centric design approach that enabled the gigahertz result. This approach requires designers to operate across the boundaries of microarchitecture, logic, circuit, and physical design. We explain why developments in CMOS technology increasingly favor this approach.


international conference on computer design | 1998

Design methodology for a 1.0 GHz microprocessor

Stephen D. Posluszny; Nobumasa Aoki; David William Boerstler; Jeffrey L. Burns; Sang Hoo Dhong; Uttam Shyamalindu Ghoshal; H. Peter Hofstee; David P. LaPotin; Kyung Tek Lee; David Meltzer; Hung C. Ngo; Kevin J. Nowka; Joel Abraham Silberman; Osamu Takahashi; Ivan Vo

This paper describes the design methodology used to build an experimental 1.0 GigaHertz PowerPC integer microprocessor at IBMs Austin Research Laboratory. The high frequency requirements dictated the chip composition to be almost entirely custom macros using dynamic circuit techniques. The methodology presented will cover design and verification tools as well as circuit constraints and microarchitecture philosophy. The microarchitecture, circuits and tools were defined by the high frequency requirements of the processor as well as the aggressive design schedule and size of the design team.


international solid-state circuits conference | 2012

A 3D system prototype of an eDRAM cache stacked over processor-like logic using through-silicon vias

Matthew R. Wordeman; Joel Abraham Silberman; Gary W. Maier; Michael R. Scheuermann

3D integration (3DI) holds promise for improved performance of integrated systems by increasing interconnect bandwidth [1]. A processor stacked with cache memory is one potential application of 3DI [2]. This work describes the design and operation of a prototype of a 3D system, constructed by stacking a memory layer, built with eDRAM [3] and logic blocks from the IBM Power7™ processor L3 cache, and a “processor proxy” layer in 45nm CMOS technology [4] enhanced to include through-silicon vias (TSVs) [5]. Unlike the previously reported 3D eDRAM [6], the 3D stack described here is constructed using 50μm pitch μC4s joining the front side of one thick chip to TSV connections on the back side of a thinned chip. TSVs are formed of Cu-filled vias that are ~20μm in diameter and <;100μm deep [5].


international solid-state circuits conference | 2000

A 1 GHz single-issue 64 b PowerPC processor

Peter Hofstee; Naoaki Aoki; David William Boerstler; Paula Kristine Coulman; Sang Hoo Dhong; Brian Flachs; N. Kojima; O. Kwon; Kyung Tek Lee; David Meltzer; Kevin J. Nowka; J. Park; J. Peter; Stephen D. Posluszny; M. Shapiro; Joel Abraham Silberman; Osamu Takahashi; B. Weinberger

This 64 b single-issue PowerPC processor contains 19M transistors and is fabricated in 0.12 /spl mu/m L/sub eff/ six-layer copper interconnect CMOS. Nominal processor clock frequency is 1.0 GHz. At the fast end of the process distribution the processor reaches 1.15 GHz (1.87 V, 101/spl deg/C, 112 W). As in a previous design, nearly the entire processor is implemented using delayed-reset and self-resetting dynamic circuit macros. New contributions include: (1) a fully pipelined, four execution-stage IEEE double-precision floating-point unit (FPU) with fused multiply-add. 2) Sum-addressed memory management units (MMUs) and 64 kB 2-cycle caches. (3) Support for the full 64 b PowerPC instruction set. (4) Dynamic PLA-based control. (5) A microarchitecture and floorplan that balances critical paths. (6) Delayed-reset dynamic circuits that support stress testing (burn-in). 7) Improved clock generation and distribution.


design automation conference | 2000

“Timing closure by design,” a high frequency microprocessor design methodology

Stephen D. Posluszny; Naoaki Aoki; David William Boerstler; Paula Kristine Coulman; Sang Hoo Dhong; B. Flachs; Peter Hofstee; N. Kojima; O. Kwon; K. Lee; David Meltzer; Kevin J. Nowka; J. Park; J. Peter; Joel Abraham Silberman; Osamu Takahashi; P. Villarrubial

This paper presents a design methodology emphasizing early and quick timing closure for high frequency microprocessor designs. This methodology was used to design a Gigahertz class PowerPC microprocessor with 19 million transistors. Characteristics of “Timing Closure by Design are 1) logic partitioned on timing boundaries, 2) predictable control structures (PLAs), 3) static interfaces for dynamic circuits, 4) low skew clock distribution, 5) deterministic method of macro placement, 6) simplified timing analysis, and 7) refinement method of chip integration with early timing analysis.


symposium on vlsi circuits | 2005

The circuits and physical design of the synergistic processor element of a CELL processor

Osamu Takahashi; R. Cook; Scott R. Cottier; Sang Hoo Dhong; Brian Flachs; Koji Hirairi; Atsushi Kawasumi; H. Murakami; H. Noro; H. Oh; S. Onishi; Juergen Pille; Joel Abraham Silberman; S. Yong

A 32b 4-way SIMD dual-issue synergistic processor element of a CELL processor is developed with 20.9 million transistors in 14.8mm/sub 2/ using a 90nm SOI technology. CMOS static gates implement the majority of the logic. Dynamic circuits are used in critical areas, occupying 19% of the non-SRAM area. ISA, microarchitecture, and physical implementation are tightly coupled to achieve a compact and power efficient design. Correct operation has been observed up to 5.6GHz at 1.4V supply and 56/spl deg/C.


international symposium on microarchitecture | 2005

Power-conscious design of the Cell processor's synergistic processor element

Osamu Takahashi; Scott R. Cottier; Sang Hoo Dhong; Brian Flachs; Joel Abraham Silberman

The authors describe the low-power design of the synergistic processor element (SPE) of the cell processor developed by Sony, Toshiba and IBM. CMOS static gates implement most of the logic, and dynamic circuits are used in critical areas. Tight coupling of the instruction set architecture, microarchitecture, and physical implementation achieves a compact, power-efficient design.

Researchain Logo
Decentralizing Knowledge