Sriram R. Vangal
Linköping University
Network
Latest external collaboration on country level. Dive into details by clicking on the dots.
Publication
Featured researches published by Sriram R. Vangal.
ieee international conference on high performance computing data and analytics | 2010
Timothy G. Mattson; Michael Riepen; Thomas Lehnig; Paul Brett; Patrick Kennedy; Jason Howard; Sriram R. Vangal; Nitin Borkar; Gregory Ruhl; Saurabh Dighe
The number of cores integrated onto a single die is expected to climb steadily in the foreseeable future. This move to many-core chips is driven by a need to optimize performance per watt. How best to connect these cores and how to program the resulting many-core processor, however, is an open research question. Designs vary from GPUs to cache-coherent shared memory multiprocessors to pure distributed memory chips. The 48-core SCC processor reported in this paper is an intermediate case, sharing traits of message passing and shared memory architectures. The hardware has been described elsewhere. In this paper, we describe the programmers view of this chip. In particular we describe RCCE: the native message passing model created for the SCC processor.
IEEE Journal of Solid-state Circuits | 2006
Sriram R. Vangal; Yatin Hoskote; Nitin Borkar; Atila Alvandpour
A pipelined single-precision floating-point multiply-accumulator (FPMAC) featuring a single-cycle accumulate loop using base 32 and internal carry-save arithmetic with delayed addition is described. A combination of algorithmic, logic, and circuit techniques enables multiply-accumulate operations at speeds exceeding 3 GHz with single-cycle throughput. The optimizations allow removal of the costly normalization step from the critical accumulate loop. This logic is conditionally powered down using dynamic sleep transistors on long accumulate operations, saving active and leakage power. In addition, an improved leading-zero anticipator (LZA) and overflow prediction logic applicable to carry-save format is presented. In a 90-nm seven-metal dual-VT CMOS process, the 2 mm2 custom design contains 230K transistors. The fully functional first silicon achieves 6.2 GFlops of performance while dissipating 1.2 W at 3.1 GHz, 1.3-V supply
Archive | 2002
Sriram R. Vangal; Yatin Hoskote; Nitin Borkar; Jianping Xu; Vasantha Erraguntla; Shekhar Borkar
Archive | 2003
Sriram R. Vangal; Yatin Hoskote; Vasantha Erraguntla; Nitin Borkar
Archive | 2002
Sriram R. Vangal; Yatin Hoskote; Vasantha Erraguntla; Jianping Xu
Archive | 2010
Saurabh Dighe; Sriram R. Vangal; Nitin Borkar; Vivek De
Archive | 2002
Sriram R. Vangal; Yatin Hoskote; Nitin Borkar; Jianping Xu; Vasantha K. Erranguntla; Shekhar Borkar
international solid-state circuits conference | 2002
Sriram R. Vangal; Nitin Borkar; Erik Seligman; V. Govindarajulu; Vasantha Erraguntla; Howard Wilson; Amaresh Pangal; V. Veeramachaneni; Mark A. Anders; James W. Tschanz; Yibin Ye; Dinesh Somasekhar; Bradley Bloechel; Gregory E. Dermer; Ram K. Krishnamurthy; Siva G. Narendra; Mircea R. Stan; Simon G. Thompson; Vivek De; Shekhar Borkar
Archive | 2002
Sriram R. Vangal; Yatin Hoskote; Nitin Borkar; Jianping Xu; Vasantha Erraguntla; Shekhar Borkar
Archive | 2007
Arvind Singh; Sriram R. Vangal; Yatin Hoskote