Shantanu R. Gupta
Intel
Network
Latest external collaboration on country level. Dive into details by clicking on the dots.
Publication
Featured researches published by Shantanu R. Gupta.
high-performance computer architecture | 2013
Amin Ansari; Shuguang Feng; Shantanu R. Gupta; Josep Torrellas; Scott A. Mahlke
Power dissipation limits combined with increased silicon integration have led microprocessor vendors to design chip multiprocessors (CMPs) with relatively simple (lightweight) cores. While these designs provide high throughput, single-thread performance has stagnated or even worsened. Asymmetric CMPs offer some relief by providing a small number of high-performance (aggressive) cores that can accelerate specific threads. However, threads are only accelerated when they can be mapped to an aggressive core, which are restricted in number due to power and thermal budgets of the chip. Rather than using the aggressive cores to accelerate threads, this paper argues that the aggressive cores can have a multiplicative impact on single-thread performance by accelerating a large number of lightweight cores and providing an illusion of a chip full of aggressive cores. Specifically, we propose an adaptive asymmetric CMP, Illusionist, that can dynamically boost the system throughput and get a higher single-thread performance across the chip. To accelerate the performance of many lightweight cores, those few aggressive cores run all the threads that are running on the lightweight cores and generate execution hints. These hints are then used to accelerate the execution of the lightweight cores. However, the hardware resources of the aggressive core are not large enough to allow the simultaneous execution of a large number of threads. To overcome this hurdle, Illusionist performs aggressive dynamic program distillation to execute small, critical segments of each lightweight-core thread. A combination of dynamic code removal and phase-based pruning distill programs to a tiny fraction of their original contents. Experiments demonstrate that Illusionist achieves 35% higher single thread performance for all the threads running on the system, compared to a CMP with all lightweight cores, while achieving almost 2X higher system throughput compared to a CMP with all aggressive cores.
Archive | 1995
Shantanu R. Gupta; Thomas D. Fletcher
Archive | 1994
David B. Papworth; Andrew F. Glew; Glenn J. Hinton; Robert P. Colwell; Michael A. Fetterman; Shantanu R. Gupta; James S. Griffith
Archive | 1994
James S. Griffth; Shantanu R. Gupta; Narayan Hegde
Archive | 1996
David B. Papworth; Andrew F. Glew; Michael A. Fetterman; Glenn J. Hinton; Robert P. Colwell; Steven J. Griffith; Shantanu R. Gupta; Narayan Hegde
Archive | 1994
Shantanu R. Gupta; James S. Griffith; Glenn J. Hinton
Archive | 1995
David B. Papworth; Andrew F. Glew; Michael A. Fetterman; Glenn J. Hinton; Robert P. Colwell; Steven J. Griffith; Shantanu R. Gupta; Narayan Hegde
Archive | 1996
Shantanu R. Gupta; James S. Griffith
Archive | 1993
Shantanu R. Gupta; James S. Griffith
Archive | 1995
James S. Griffith; Shantanu R. Gupta; Glenn J. Hinton