Network


Latest external collaboration on country level. Dive into details by clicking on the dots.

Hotspot


Dive into the research topics where David Meltzer is active.

Publication


Featured researches published by David Meltzer.


international solid-state circuits conference | 1998

A 1.0-GHz single-issue 64-bit powerPC integer processor

Joel Abraham Silberman; Naoaki Aoki; David William Boerstler; Jeffrey L. Burns; Sang Hoo Dhong; Axel Essbaum; Uttam Shyamalindu Ghoshal; David F. Heidel; Peter Hofstee; Kyung Tek Lee; David Meltzer; Hung Ngo; Kevin J. Nowka; Stephen D. Posluszny; Osamu Takahashi; Ivan Vo; Brian Zoric

This 64 b single-issue integer processor, comprised of about one million transistors, is fabricated in a 0.15 /spl mu/m effective channel length, six-metal-layer CMOS technology. Intended as a vehicle to explore circuit, clocking, microarchitecture, and methodology options for high-frequency processors, the processor prototype implements 60 fixed-point compare, logical, arithmetic, and rotate-merge-mask instructions of the PowerPC instruction-set architecture with single-cycle latency. The processor executes programs written in this instruction subset from cache with a 1 ns cycle. In addition, the prototype implements 36 PowerPC load/store instructions that execute as single-cycle operations (zero wait cycles) with 1.15 ns latency. Full data forwarding and full at speed scan testing are supported.


international symposium on microarchitecture | 1998

Designing for a gigahertz [guTS integer processor]

Harm Peter Hofstee; Sang Hoo Dhong; David Meltzer; Kevin J. Nowka; Joel Abraham Silberman; J.I. Burns; Stephen D. Posluszny; Osamu Takahashi

At the IEEE International Solid State Circuits Conference this February, the IBM Austin Research Laboratory presented an experimental 64-bit integer processor called guTS (gigahertz unit Test Site). The goal of the guTS project was to demonstrate that circuit techniques, and circuit-centric design, could significantly increase the performance of microprocessors, thus providing headroom for future performance growth beyond contributions from microarchitecture and CMOS technology. To clearly distinguish the design contributions of this project from innovations in CMOS technology we chose a fabrication technology that was in production in 1997. The guTS processor is a full-custom, nearly 100% dynamic design. Its single-issue core implements 96 instructions from the integer subset of the PowerPC instruction set architecture, and covers in excess of 90% of instructions executed in typical code. Address translation, floating-point, and I/O-related instructions are omitted. All instructions, including loads and stores, execute in one cycle. We measured core speeds in excess of a gigahertz. We focus here on the circuit-centric design approach that enabled the gigahertz result. This approach requires designers to operate across the boundaries of microarchitecture, logic, circuit, and physical design. We explain why developments in CMOS technology increasingly favor this approach.


international conference on computer design | 1998

Design methodology for a 1.0 GHz microprocessor

Stephen D. Posluszny; Nobumasa Aoki; David William Boerstler; Jeffrey L. Burns; Sang Hoo Dhong; Uttam Shyamalindu Ghoshal; H. Peter Hofstee; David P. LaPotin; Kyung Tek Lee; David Meltzer; Hung C. Ngo; Kevin J. Nowka; Joel Abraham Silberman; Osamu Takahashi; Ivan Vo

This paper describes the design methodology used to build an experimental 1.0 GigaHertz PowerPC integer microprocessor at IBMs Austin Research Laboratory. The high frequency requirements dictated the chip composition to be almost entirely custom macros using dynamic circuit techniques. The methodology presented will cover design and verification tools as well as circuit constraints and microarchitecture philosophy. The microarchitecture, circuits and tools were defined by the high frequency requirements of the processor as well as the aggressive design schedule and size of the design team.


signal processing systems | 2000

Trends in compilable DSP architecture

John Glossner; Jaime H. Moreno; Mayan Moudgill; Jeff H. Derby; Erdem Hokenek; David Meltzer; Uzi Shvadron; Malcolm Scott Ware

We review the evolution of DSP architectures and compiler technology, and describe how compiler techniques are being used to optimize emerging DSP architectures. Such new architectures are characterized by the exploitation of data and instruction level parallelism while being an amenable target for a compiler, thereby reducing or eliminating the need to rely on assembly language programming and/or architecture-specific compiler intrinsics to achieve highly efficient code. We also summarize our research results on an ultra low power compilable DSP architecture.


international solid-state circuits conference | 2000

A 1 GHz single-issue 64 b PowerPC processor

Peter Hofstee; Naoaki Aoki; David William Boerstler; Paula Kristine Coulman; Sang Hoo Dhong; Brian Flachs; N. Kojima; O. Kwon; Kyung Tek Lee; David Meltzer; Kevin J. Nowka; J. Park; J. Peter; Stephen D. Posluszny; M. Shapiro; Joel Abraham Silberman; Osamu Takahashi; B. Weinberger

This 64 b single-issue PowerPC processor contains 19M transistors and is fabricated in 0.12 /spl mu/m L/sub eff/ six-layer copper interconnect CMOS. Nominal processor clock frequency is 1.0 GHz. At the fast end of the process distribution the processor reaches 1.15 GHz (1.87 V, 101/spl deg/C, 112 W). As in a previous design, nearly the entire processor is implemented using delayed-reset and self-resetting dynamic circuit macros. New contributions include: (1) a fully pipelined, four execution-stage IEEE double-precision floating-point unit (FPU) with fused multiply-add. 2) Sum-addressed memory management units (MMUs) and 64 kB 2-cycle caches. (3) Support for the full 64 b PowerPC instruction set. (4) Dynamic PLA-based control. (5) A microarchitecture and floorplan that balances critical paths. (6) Delayed-reset dynamic circuits that support stress testing (burn-in). 7) Improved clock generation and distribution.


design automation conference | 2000

“Timing closure by design,” a high frequency microprocessor design methodology

Stephen D. Posluszny; Naoaki Aoki; David William Boerstler; Paula Kristine Coulman; Sang Hoo Dhong; B. Flachs; Peter Hofstee; N. Kojima; O. Kwon; K. Lee; David Meltzer; Kevin J. Nowka; J. Park; J. Peter; Joel Abraham Silberman; Osamu Takahashi; P. Villarrubial

This paper presents a design methodology emphasizing early and quick timing closure for high frequency microprocessor designs. This methodology was used to design a Gigahertz class PowerPC microprocessor with 19 million transistors. Characteristics of “Timing Closure by Design are 1) logic partitioned on timing boundaries, 2) predictable control structures (PLAs), 3) static interfaces for dynamic circuits, 4) low skew clock distribution, 5) deterministic method of macro placement, 6) simplified timing analysis, and 7) refinement method of chip integration with early timing analysis.


international symposium on low power electronics and design | 2001

Clocking strategies and scannable latches for low power appliacations

Victor Zyuban; David Meltzer

This paper covers a range of issues in the design of clocking schemes for low-power applications. First we revisit, extend and improve the power-performance optimization methodology for latches, attempting to make it more formal and comprehensive. The data switching factor and the glitching activity are taken into consideration, using a formal analytical approach, then a notion of energy-efficient family of configurations is introduced to make the comparison of different latch styles in the power-performance space more fair, also the power of the clock distribution is taken into account. Practical issues of building a low overhead scan mechanism are considered, and the power overhead of the scannable design is analyzed. A low-power LSSD extension to single-phase latches is proposed, and results of a comparative study of LSSD-scannable latches are shown, supported by experimental data measured on a 0.18 /spl mu/m test chip.


Archive | 1997

Multi-port multiple-simultaneous-access DRAM chip

Jeffrey H. Dreibelbis; Wayne F. Ellis; Thomas J. Heller; Michael Ignatowski; Howard Leo Kalter; David Meltzer


Archive | 1988

High speed buffer store arrangement for quick wide transfer of data

Frederick J. Aichelmann; Rex Harold Blumberg; David Meltzer; James Herbert Pomerene; Thomas R. Puzak; Rudolph Nathan Rechtschaffen; Frank John Sparacio


Archive | 2003

Vector register file with arbitrary vector addressing

Clair John Glossner; Erdem Hokenek; David Meltzer; Mayan Moudgill

Researchain Logo
Decentralizing Knowledge