Network


Latest external collaboration on country level. Dive into details by clicking on the dots.

Hotspot


Dive into the research topics where Trung A. Diep is active.

Publication


Featured researches published by Trung A. Diep.


memory performance dealing with applications systems and architecture | 2004

A case for shared instruction cache on chip multiprocessors running OLTP

Partha Kundu; Murali Annavaram; Trung A. Diep; John Paul Shen

Due to their large code footprint, OLTP workloads suffer from significant I-cache miss rates on contemporary microprocessors. This paper analyzes the I-stream behavior of an OLTP workload, called the Oracle Database Benchmark (ODB), on Chip-Multiprocessors (CMP). Our results show that, although, the overall code footprint of ODB is large, multiple ODB threads running concurrently on multiple processors tend to access common code segments frequently, thus exhibiting significant constructive sharing. In fact, in a CMP system, an I-cache shared between multiple processors incurs similar miss rate as a dedicated I-cache per processor where the per processor I-cache has the same capacity as the shared I-cache. Based on these observations, this paper makes the case for a shared I-cache organization in a CMP, instead of the traditional approach of using a dedicated I-cache per processor.Furthermore, this paper shows that OLTP code stream exhibits good spatial locality. Adding a simple dedicated Line Buffer per processor can exploit this spatial locality effectively, to reduce latency and bandwidth requirements on the shared cache. The proposed shared I-cache organization results in an improvement of at least 5X in miss rate over a dedicated cache organization, for the same total capacity.


international conference on computer design | 2002

Branch behavior of a commercial OLTP workload on Intel IA32 processors

Murali Annavaram; Trung A. Diep; John Paul Shen

This paper presents a detailed branch characterization of an Oracle based commercial on-line transaction processing workload, Oracle Database Benchmark (ODB), running on an IA32 processor. We ran a well-tuned ODB on Simics, a full system simulator, to collect the instruction traces used in this study. We compare the branch behavior of ODB with the branch behaviors of gcc, gzip and mcf from the SPECINT 2000 benchmark suite. Contrary to the popular belief that databases have unpredictable branches, we show that using larger predictors that capture enough branch history information, and using branch prediction schemes that reduce aliasing, conditional branches in ODB are more predictable than in gcc, gzip and mcf Due to frequent context switching in ODB, a hardware return address stack is ineffective in predicting return addresses for ODB. Based on further analysis, we propose and evaluate an enhanced return address predictor, which reduces return address mispredictions in ODB by 40%.


international symposium on microarchitecture | 2003

Scaling and Charact rizing Database Workloads: Bridging the Gap between Research and Practice

Richard A. Hankins; Trung A. Diep; Murali Annavaram; Brian Hirano; Harald Eri; Hubert Nueckel; John Paul Shen


Archive | 2011

Mechanism for monitoring instruction set based thread execution on a plurality of instruction sequencers

Richard A. Hankins; Gautham N. Chinya; Hong Wang; Shivnandan D. Kaushik; Bryant Bigbee; John Paul Shen; Trung A. Diep; Xiang Zou; Baiju V. Patel; Paul M. Petersen; Sanjiv Shah; Ryan N. Rakvic; Prashant Sethi


Archive | 2005

Scheduling optimizations for user-level threads

Ryan N. Rakvic; Richard A. Hankins; Hong Wang; Trung A. Diep; Xinmin Tain; Paul M. Petersen; Sanjiv Shah; John Paul Shen; Gautham N. Chinya; Shivnandan D. Kaushik; Bryant Bigbee; Baiju V. Patel; Douglas R. Armstrong


Archive | 2005

Mechanism to emulate user-level multithreading on an os-sequestered sequencer

Gautham N. Chinya; Hong Wang; Xiang Zou; James P. Held; Prashant Sethi; Trung A. Diep; Anil Aggarwal; Baiju V. Patel; Shiv Kaushik; Bryant Bigbee; John Paul Shen; Richard A. Hankins; John L. Reid


Archive | 2004

Mechanism to schedule threads on OS-sequestered sequencers without operating system intervention

Richard A. Hankins; Hong Wang; Gautham N. Chinya; Trung A. Diep; Shivnandan D. Kaushik; Bryant Bigbee; John Paul Shen; Asit Mallick; Baiju V. Patel; James P. Held; Milind Girkar; Prashant Sethi; Xinmin Tian


Archive | 1999

Arbitration mechanism for a computer system having a unified memory architecture

Steve J. Clohset; Trung A. Diep; Wishwesh Gandhi; Thomas A. Piazza; Aditya Sreenivas; Tuong Trieu


Archive | 2005

Compiler-based scheduling optimization hints for user-level threads

Shih-Wei Liao; Ryan N. Rakvic; Richard A. Hankins; Hong Wang; Gansha Wu; Guei-Yuan Lueh; Xinmin Tian; Paul M. Petersen; Sanjiv Shah; Trung A. Diep; John Paul Shen; Gautham N. Chinya


Archive | 2005

MECHANISM TO SCHEDULE THREADS ON OS-SEQUESTERED WITHOUT OPERATING SYSTEM INTERVENTION

Bryant Bigbee; Richard A. Hankins; Hong Wang; Trung A. Diep; Tian Xinmin; Chiv Kaushik; John Paul Shen; Asit Mallick; Millind Girkar; Prashant Sethi; Gautham N. Chinya; Baiju V. Patel; James P. Held

Collaboration


Dive into the Trung A. Diep's collaboration.

Researchain Logo
Decentralizing Knowledge