S. Borkar
Intel
Network
Latest external collaboration on country level. Dive into details by clicking on the dots.
Publication
Featured researches published by S. Borkar.
IEEE Journal of Solid-state Circuits | 2011
Jason Howard; Saurabh Dighe; Sriram R. Vangal; Gregory Ruhl; Nitin Borkar; Shailendra Jain; Vasantha Erraguntla; Michael Konow; Michael Riepen; Matthias Gries; Guido Droege; Tor Lund-Larsen; Sebastian Steibl; S. Borkar; Vivek De; R Van Der Wijngaart
This paper describes a multi-core processor that integrates 48 cores, 4 DDR3 memory channels, and a voltage regulator controller in a 64 2D-mesh network-on-chip architecture. Located at each mesh node is a five-port virtual cut-through packet-switched router shared between two IA-32 cores. Core-to-core communication uses message passing while exploiting 384 KB of on-die shared memory. Fine grain power management takes advantage of 8 voltage and 28 frequency islands to allow independent DVFS of cores and mesh. At the nominal 1.1 V supply, the cores operate at 1 GHz while the 2D-mesh operates at 2 GHz. As performance and voltage scales, the processor dissipates between 25 W and 125 W. The processor is implemented in 45 nm Hi-K CMOS and has 1.3 billion transistors.
international solid-state circuits conference | 2003
J. Tschanz; Siva G. Narendra; Yibin Ye; Bradley Bloechel; S. Borkar; Vivek De
Sleep transistors and body bias are used to control active leakage for a 32b integer execution core implemented in a 100nm dual V, CMOS technology. A PMOS sleep transistor degrades performance by 4% but offers 20/spl times/ leakage reduction which is further improved with body bias. Time constants for leakage convergence range from 30ns to 300ns allowing 9-44% power savings for idle periods greater than 100 clock cycles.
IEEE Journal of Solid-state Circuits | 2004
Peter Hazucha; Tanay Karnik; S. Walstra; Bradley Bloechel; J. Tschanz; J. Maiz; Krishnamurthy Soumyanath; Gregory E. Dermer; Siva G. Narendra; Vivek De; S. Borkar
We designed a soft error rate (SER) tolerant latch utilizing local redundancy. We implemented a test chip containing both the standard and SER-tolerant latches in a 90-nm dual-V/sub T/ CMOS process. Accelerated measurements with a neutron beam at Los Alamos National Laboratory demonstrated 10/spl times/ better reliability of the SER-tolerant latch over the standard latch at no speed degradation. The worst case energy and area penalties were 39% and 44%, respectively. Both the energy and area penalties are negligible for standard-latch transistor sizes at least double the minimum width. We analyzed the effects of the recovery time, threshold voltage assignment, and leakage on the SER robustness. The proposed latch can improve reliability of critical sequential logic elements in microprocessors and other circuits.
international solid-state circuits conference | 2002
Siva G. Narendra; M. Haycock; V. Govindarajulu; Vasantha Erraguntla; Howard Wilson; Sriram R. Vangal; Amaresh Pangal; E. Seligman; Rajendran Nair; Ali Portland Keshavarzi; Bradley Bloechel; Gregory E. Dermer; R. Mooney; Nitin Borkar; S. Borkar; Vivek De
A router chip, that incorporates on-chip forward body biasing capability with 2% area overhead, achieves 1 GHz operation at 1.1 V supply in a 150 nm logic technology, compared to 1.25 V required for the original design having no body bias. Switching power is 23% less and chip leakage is reduced by 3.5/spl times/ in standby mode by withdrawing forward bias.
symposium on vlsi circuits | 2002
Ali Keshavarzi; Siva G. Narendra; Bradley Bloechel; S. Borkar; Vivek De
Device and test chip measurements show that forward body bias (FBB) can be used effectively to improve performance and reduce complexity of a 130nm dual-V/sub T/ technology, reduce leakage power during burn-in and standby, improve circuit delay and robustness, and reduce active power. FBB allows performance advantages of low temperature operation to be realized fully without requiring transistor redesign, and also improves V/sub T/ variations, mismatch, and g/sub m/ /spl times/ r/sub 0/ product.
international solid-state circuits conference | 2004
J. Kennedy; Robert M. Ellis; James E. Jaussi; Randy Mooney; S. Borkar; Jung-Hwan Choi; Jae-Kwan Kim; Chan-Kyong Kim; Woo-Seop Kim; Chang-Hyun Kim; Soo-In Cho; Steffen Loeffler; Jochen Hoffmann; Wolfgang Hokenmaier; R. Houghton; Thomas Vogelsang
We describe a DRAM interface operating at 3.6 Gb/s/pin implemented in 130-nm CMOS logic and 110-nm DRAM process technologies. It utilizes simultaneous bidirectional (SBD) signaling in a daisy-chained (repeated), point-to-point configuration to enable high performance scalable memory subsystems; and also provides direct attach capability for DRAMs to memory controllers or other logic devices. Source-synchronous strobes are used for data capture, minimizing strobe-to-data jitter. A low-jitter differential clock retimes the data at each DRAM on a per DIMM basis preventing jitter from accumulating in repeated data. The phase of this clock is adjusted on each DRAM to minimize the latency of the repeaters. 80 mW of total power is dissipated per DRAM I/O at 3.6 Gb/s. We present results from a system using both memory controller and DRAM repeater test chips.
international solid-state circuits conference | 2002
Sriram R. Vangal; Mark A. Anders; Nitin Borkar; E. Seligman; V. Govindarajulu; Vasantha Erraguntla; Howard Wilson; A. Pangal; V. Veeramachaneni; J. Tschanz; Yibin Ye; Dinesh Somasekhar; Bradley Bloechel; Greg Dermer; Ram K. Krishnamurthy; Krishnamurthy Soumyanath; Sanu K. Mathew; Siva G. Narendra; Mircea R. Stan; S. Thompson; Vivek De; S. Borkar
A 32 b integer execution core implements 12 instructions. Circuit and body bias techniques together increase the core clock frequency to 5 GHz. In a 130 nm six-metal dual-V/sub T/ CMOS process, the 2.3 mm/sup 2/ prototype contains 160 k transistors, with RF-ALU units dissipating 515 mW at 1.6 V.
international solid-state circuits conference | 2002
Mark A. Anders; Sanu K. Mathew; Bradley Bloechel; S. Thompson; Ram K. Krishnamurthy; Krishnamurthy Soumyanath; S. Borkar
32b Han-Carlson ALU and 8-entry /spl times/ 2-ALU instruction scheduler loop for 6.5 GHz single-cycle integer execution at 1.2 V and 25/spl deg/C uses dual-Vt CMOS technology. A single-ended, leakage-tolerant dynamic scheme enables up to 9-wide ORs with 23% critical path speed improvement, 40% active leakage power reduction compared to Koggie-Stone implementation, dense layout occupying 44, 100 /spl mu/m/sup 2/, and performance scalable to 8 GHz at 1.5 V, 25/spl deg/C.
symposium on vlsi circuits | 2004
Tanay Karnik; J. Tschanz; Bradley Bloechel; Peter Hazucha; P. Armstrong; Siva G. Narendra; Ali Keshavarzi; Krishnamurthy Soumyanath; Gregory E. Dermer; J. Maiz; S. Borkar; Vivek De
Soft error rate measurements for flip-flops on two testchips in 180nm and 130nm logic technologies show that using forward body bias improves alpha SER by 35% and neutron SER by 23%, while applying reverse body bias degrades SER by 9% to 36%. Body bias impact on SER remains virtually unchanged with technology scaling.
symposium on vlsi circuits | 2001
Siva G. Narendra; J. Tschanz; Ali Keshavarzi; S. Borkar; Vivek De
History effect measurements on different circuits in a 150 nm SOI technology show no adverse impact on worst-case delay vs. leakage trade-offs. The performance advantage of SOI over bulk is shown to come mostly from capacitance reduction. Hence, it will diminish with technology scaling.