Network


Latest external collaboration on country level. Dive into details by clicking on the dots.

Hotspot


Dive into the research topics where Sriram R. Vangal is active.

Publication


Featured researches published by Sriram R. Vangal.


ieee international conference on high performance computing data and analytics | 2010

The 48-core SCC Processor: the Programmer's View

Timothy G. Mattson; Michael Riepen; Thomas Lehnig; Paul Brett; Patrick Kennedy; Jason Howard; Sriram R. Vangal; Nitin Borkar; Gregory Ruhl; Saurabh Dighe

The number of cores integrated onto a single die is expected to climb steadily in the foreseeable future. This move to many-core chips is driven by a need to optimize performance per watt. How best to connect these cores and how to program the resulting many-core processor, however, is an open research question. Designs vary from GPUs to cache-coherent shared memory multiprocessors to pure distributed memory chips. The 48-core SCC processor reported in this paper is an intermediate case, sharing traits of message passing and shared memory architectures. The hardware has been described elsewhere. In this paper, we describe the programmers view of this chip. In particular we describe RCCE: the native message passing model created for the SCC processor.


IEEE Journal of Solid-state Circuits | 2006

A 6.2-GFlops Floating-Point Multiply-Accumulator With Conditional Normalization

Sriram R. Vangal; Yatin Hoskote; Nitin Borkar; Atila Alvandpour

A pipelined single-precision floating-point multiply-accumulator (FPMAC) featuring a single-cycle accumulate loop using base 32 and internal carry-save arithmetic with delayed addition is described. A combination of algorithmic, logic, and circuit techniques enables multiply-accumulate operations at speeds exceeding 3 GHz with single-cycle throughput. The optimizations allow removal of the costly normalization step from the critical accumulate loop. This logic is conditionally powered down using dynamic sleep transistors on long accumulate operations, saving active and leakage power. In addition, an improved leading-zero anticipator (LZA) and overflow prediction logic applicable to carry-save format is presented. In a 90-nm seven-metal dual-VT CMOS process, the 2 mm2 custom design contains 230K transistors. The fully functional first silicon achieves 6.2 GFlops of performance while dissipating 1.2 W at 3.1 GHz, 1.3-V supply


Archive | 2002

Network protocol engine

Sriram R. Vangal; Yatin Hoskote; Nitin Borkar; Jianping Xu; Vasantha Erraguntla; Shekhar Borkar


Archive | 2003

Network protocol processor

Sriram R. Vangal; Yatin Hoskote; Vasantha Erraguntla; Nitin Borkar


Archive | 2002

Network protocol off-load engines

Sriram R. Vangal; Yatin Hoskote; Vasantha Erraguntla; Jianping Xu


Archive | 2010

Performing Variation-Aware Profiling And Dynamic Core Allocation For A Many-Core Processor

Saurabh Dighe; Sriram R. Vangal; Nitin Borkar; Vivek De


Archive | 2002

Tracking out-of-order packets

Sriram R. Vangal; Yatin Hoskote; Nitin Borkar; Jianping Xu; Vasantha K. Erranguntla; Shekhar Borkar


international solid-state circuits conference | 2002

5GHz 32b integer-execution core in 130nm dual-VT CMOS

Sriram R. Vangal; Nitin Borkar; Erik Seligman; V. Govindarajulu; Vasantha Erraguntla; Howard Wilson; Amaresh Pangal; V. Veeramachaneni; Mark A. Anders; James W. Tschanz; Yibin Ye; Dinesh Somasekhar; Bradley Bloechel; Gregory E. Dermer; Ram K. Krishnamurthy; Siva G. Narendra; Mircea R. Stan; Simon G. Thompson; Vivek De; Shekhar Borkar


Archive | 2002

Packet-based clock signal

Sriram R. Vangal; Yatin Hoskote; Nitin Borkar; Jianping Xu; Vasantha Erraguntla; Shekhar Borkar


Archive | 2007

Crossbar channel router having a distributed arbitration scheme

Arvind Singh; Sriram R. Vangal; Yatin Hoskote

Collaboration


Dive into the Sriram R. Vangal's collaboration.

Researchain Logo
Decentralizing Knowledge