Network


Latest external collaboration on country level. Dive into details by clicking on the dots.

Hotspot


Dive into the research topics where Mujibur Rahman is active.

Publication


Featured researches published by Mujibur Rahman.


international conference on vlsi design | 2012

A 1.25GHz 0.8W C66x DSP Core in 40nm CMOS

Raguram Damodaran; Timothy D. Anderson; Sanjive Agarwala; Rama Venkatasubramanian; Michael Gill; Dhileep Gopalakrishnan; Anthony M. Hill; Abhijeet Ashok Chachad; Dheera Balasubramanian; Naveen Bhoria; Jonathan (Son) Hung Tran; Duc Quang Bui; Mujibur Rahman; Shriram D. Moharil; Matthew D. Pierson; Steven Mullinnix; Hung Ong; David Thompson; Krishna Chaithanya Gurram; Oluleye Olorode; Nuruddin Mahmood; Jose Luis Flores; Arjun Rajagopal; Soujanya Narnur; Daniel Wu; Alan Hales; Kyle Peavy; Robert Sussman

The next-generation C66x DSP integrated fixed and floating-point DSP implemented in TSMC 40nm process is presented in this paper. The DSP core runs at 1.25GHz at 0.9V and has a standby power consumption of 800mW. The core transistor count is 21.5 million. The DSP core features 8-way VLIW floating point Data path and a two level memory system and delivers 40 GMACS or 10 GFLOPS floating point MAC performance at 1.25GHz.


symposium on computer arithmetic | 2011

A 1.5 Ghz VLIW DSP CPU with Integrated Floating Point and Fixed Point Instructions in 40 nm CMOS

Timothy D. Anderson; Duc Quang Bui; Shriram D. Moharil; Soujanya Narnur; Mujibur Rahman; Anthony J. Lell; Eric Biscondi; Ashish Rai Shrivastava; Peter Dent; Mingjian Yan; Hasan Mahmood

A next generation VLIW DSP Central Processing Unit (CPU) which has an integrated fixed point and floating point Instruction Set Architecture (ISA) is presented. It is designed to meet a 1.5 GHz core clock frequency in a 40nm process with aggressive area and power goals. In this paper, the benchmarking process and benefits of newly defined instructions such as complex matrix multiply is explained. Also, the CPU data path is described in detail, highlighting several novel micro-architecture features. Finally, our design methodology as well as verification methodology to ensure functional correctness utilizing formal equivalent verification is described.


high performance interconnects | 2013

Heterogeneous Multi-processor Coherent Interconnect

Kai Chirca; Matthew D. Pierson; Joe Zbiciak; David Thompson; Daniel Wu; Shankar Myilswamy; Roger Griesmer; Kedar Basavaraj; Thomas Huynh; Akshit Dayal; Junbok You; Patrick Eyres; Yusuf Ghadiali; Todd Beck; Anthony M. Hill; Naveen Bhoria; Duc Quang Bui; Jonathan (Son) Hung Tran; Mujibur Rahman; Hong Fei; Shoban Srikrishna Jagathesan; Timothy D. Anderson

The rapid increase in processor and memory integration onto a single die continues to place increasingly complex demands on the interconnect network. In addition to providing low latency, high speed and high bandwidth access from all processors to all shared resources, the burdens of hardware cache coherence and resource virtualization are being placed upon the interconnect as well. This paper describes a multi-core shared memory controller interconnect (MSMC) which supports up to 12 processors, 8 independent banks of IO-coherent on-chip shared RAM, an IO-coherent external memory controller, and high bandwidth IO connections to the SoC infrastructure. MSMC also provides basic IO address translation and memory protection for the on-chip shared SRAM and external memory as well as soft error protection with hardware scrubbing for the on-chip memory. MSMC formed the heart of the compute cluster for a 28-nm CMOS device including 8 Texas Instruments C66x DSP processors and 4 cache-coherent ARM A15 processors sharing 6 MB of on-chip SRAM running at 1.3 Ghz. At this speed MSMC provides all connected masters a combined read/write bandwidth of nearly 1TB/s to access a combined read/write bandwidth of 457.6 GB/s to all shared resources @ 16 mm2.


Archive | 2014

Vector SIMD VLIW Data Path Architecture

Timothy D. Anderson; Duc Quang Bui; Mujibur Rahman; Joseph Zbiciak; Eric Biscondi; Peter Dent; Jelena Milanovic; Ashish Rai Shrivastava


Archive | 2011

Circuit which Performs Split Precision, Signed/Unsigned, Fixed and Floating Point, Real and Complex Multiplication

Timothy D. Anderson; Mujibur Rahman


Archive | 2014

Three-term predictive adder and/or subtracter

Timothy D. Anderson; Mujibur Rahman; Kai Chirca


Archive | 2011

Dual register data path architecture with registers in a data file divided into groups and sub-groups

Timothy D. Anderson; Duc Quang Bui; Eric Biscondi; Shriram D. Moharil; Mujibur Rahman; Soujanya Narnur; Peter Dent; Ashish Rai Shrivastava


Archive | 2010

Family of Multiplexer/Flip-Flops with Enhanced Testability

Mujibur Rahman; Timothy D. Anderson; Alan Hales


Archive | 2010

Test Chain Testability In a System for Testing Tri-State Functionality

Mujibur Rahman; Timothy D. Anderson; Alan Hales


Archive | 2017

Faster and More Efficient Different Precision Sum of Absolute Differences for Dynamically Configurable Block Searches for Motion Estimation

Mujibur Rahman; Djordje Senicic; Timothy D. Anderson

Collaboration


Dive into the Mujibur Rahman's collaboration.

Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Researchain Logo
Decentralizing Knowledge