Network


Latest external collaboration on country level. Dive into details by clicking on the dots.

Hotspot


Dive into the research topics where Duc Quang Bui is active.

Publication


Featured researches published by Duc Quang Bui.


international solid-state circuits conference | 2002

A 600-MHz VLIW DSP

Sanjive Agarwala; P. Koeppen; Timothy D. Anderson; Anthony M. Hill; M. Ales; Raguram Damodaran; Lewis Nardini; P. Wiley; Steven Mullinnix; J. Leach; Anthony J. Lell; Manzur Gill; J. Golston; D. Hoyle; Arjun Rajagopal; Abhijeet Ashok Chachad; M. Agarwala; R. Castille; N. Common; John Apostol; H. Mahmood; Manjeri Krishnan; Duc Quang Bui; Quang-Dieu An; Peter Groves; Luong Nguyen; N.S. Nagaraj; R. Simar

A 600 MHz VLIW DSP, which implements the C64x VelociTI.2/spl trade/ architecture delivers 4800 MIPS, 2400 (16 b) or 4800 (8 b) million multiply accumulates at 0.3 mW/MMAC (16 b). The chip has 64 M transistors and dissipates 718 mW at 600 MHz and 1.2 V, and 200 mW at 300 MHz and 0.9 V. It has an 8-way VLIW DSP core, a 2-level memory system, and 2.4 GB/s I/O bandwidth. The DSP chip is implemented in 0.13 μm CMOS technology with 6-layer copper metalization.


international conference on vlsi design | 2012

A 1.25GHz 0.8W C66x DSP Core in 40nm CMOS

Raguram Damodaran; Timothy D. Anderson; Sanjive Agarwala; Rama Venkatasubramanian; Michael Gill; Dhileep Gopalakrishnan; Anthony M. Hill; Abhijeet Ashok Chachad; Dheera Balasubramanian; Naveen Bhoria; Jonathan (Son) Hung Tran; Duc Quang Bui; Mujibur Rahman; Shriram D. Moharil; Matthew D. Pierson; Steven Mullinnix; Hung Ong; David Thompson; Krishna Chaithanya Gurram; Oluleye Olorode; Nuruddin Mahmood; Jose Luis Flores; Arjun Rajagopal; Soujanya Narnur; Daniel Wu; Alan Hales; Kyle Peavy; Robert Sussman

The next-generation C66x DSP integrated fixed and floating-point DSP implemented in TSMC 40nm process is presented in this paper. The DSP core runs at 1.25GHz at 0.9V and has a standby power consumption of 800mW. The core transistor count is 21.5 million. The DSP core features 8-way VLIW floating point Data path and a two level memory system and delivers 40 GMACS or 10 GFLOPS floating point MAC performance at 1.25GHz.


symposium on computer arithmetic | 2011

A 1.5 Ghz VLIW DSP CPU with Integrated Floating Point and Fixed Point Instructions in 40 nm CMOS

Timothy D. Anderson; Duc Quang Bui; Shriram D. Moharil; Soujanya Narnur; Mujibur Rahman; Anthony J. Lell; Eric Biscondi; Ashish Rai Shrivastava; Peter Dent; Mingjian Yan; Hasan Mahmood

A next generation VLIW DSP Central Processing Unit (CPU) which has an integrated fixed point and floating point Instruction Set Architecture (ISA) is presented. It is designed to meet a 1.5 GHz core clock frequency in a 40nm process with aggressive area and power goals. In this paper, the benchmarking process and benefits of newly defined instructions such as complex matrix multiply is explained. Also, the CPU data path is described in detail, highlighting several novel micro-architecture features. Finally, our design methodology as well as verification methodology to ensure functional correctness utilizing formal equivalent verification is described.


high performance interconnects | 2013

Heterogeneous Multi-processor Coherent Interconnect

Kai Chirca; Matthew D. Pierson; Joe Zbiciak; David Thompson; Daniel Wu; Shankar Myilswamy; Roger Griesmer; Kedar Basavaraj; Thomas Huynh; Akshit Dayal; Junbok You; Patrick Eyres; Yusuf Ghadiali; Todd Beck; Anthony M. Hill; Naveen Bhoria; Duc Quang Bui; Jonathan (Son) Hung Tran; Mujibur Rahman; Hong Fei; Shoban Srikrishna Jagathesan; Timothy D. Anderson

The rapid increase in processor and memory integration onto a single die continues to place increasingly complex demands on the interconnect network. In addition to providing low latency, high speed and high bandwidth access from all processors to all shared resources, the burdens of hardware cache coherence and resource virtualization are being placed upon the interconnect as well. This paper describes a multi-core shared memory controller interconnect (MSMC) which supports up to 12 processors, 8 independent banks of IO-coherent on-chip shared RAM, an IO-coherent external memory controller, and high bandwidth IO connections to the SoC infrastructure. MSMC also provides basic IO address translation and memory protection for the on-chip shared SRAM and external memory as well as soft error protection with hardware scrubbing for the on-chip memory. MSMC formed the heart of the compute cluster for a 28-nm CMOS device including 8 Texas Instruments C66x DSP processors and 4 cache-coherent ARM A15 processors sharing 6 MB of on-chip SRAM running at 1.3 Ghz. At this speed MSMC provides all connected masters a combined read/write bandwidth of nearly 1TB/s to access a combined read/write bandwidth of 457.6 GB/s to all shared resources @ 16 mm2.


Archive | 1997

Single carry/borrow propagate adder/decrementer for generating register stack addresses in a microprocessor

Tuan Q. Dao; Debjit Das Sarma; Duc Quang Bui


Archive | 2014

Highly integrated scalable, flexible DSP megamodule architecture

Timothy D. Anderson; Joseph Zbiciak; Duc Quang Bui; Abnijeet A. Chachad; Kai Chirca; Naveen Bhoria; Matthew D. Pierson; Daniel Wu


Archive | 2014

Compiler-control Method for Load Speculation In a Statically Scheduled Microprocessor

Timothy D. Anderson; Joseph Zbiciak; Duc Quang Bui; Mel Alan Phipps; Todd T. Hahn


Archive | 2014

Vector SIMD VLIW Data Path Architecture

Timothy D. Anderson; Duc Quang Bui; Mujibur Rahman; Joseph Zbiciak; Eric Biscondi; Peter Dent; Jelena Milanovic; Ashish Rai Shrivastava


Archive | 2010

Reduced-level two's complement arithmetic unit

Duc Quang Bui; Timothy D. Anderson


Archive | 2014

Vector Load and Duplicate Operations

Timothy D. Anderson; Duc Quang Bui; Peter Dent

Collaboration


Dive into the Duc Quang Bui's collaboration.

Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Researchain Logo
Decentralizing Knowledge