Network


Latest external collaboration on country level. Dive into details by clicking on the dots.

Hotspot


Dive into the research topics where Aiqun Cao is active.

Publication


Featured researches published by Aiqun Cao.


IEEE Transactions on Very Large Scale Integration Systems | 2007

SAMBA-Bus: A High Performance Bus Architecture for System-on-Chips

Ruibing Lu; Aiqun Cao; Cheng-Kok Koh

A high performance communication architecture, SAMBA-bus, is proposed in this paper. In SAMBA-bus architecture, multiple compatible bus transactions can be performed simultaneously with only a single bus access grant from the bus arbiter. Experimental results show that, compared with a traditional bus architecture, the SAMBA-bus architecture can have up to 3.5 times improvement in the effective bandwidth, and up to 15 times reduction in the average communication latency. In addition, the performance of SAMBA-bus architecture is affected only slightly by arbitration latency, because bus transactions can be performed without waiting for the bus access grant from the arbiter. This feature is desirable in SoC designs with large numbers of modules and long communication delay between modules and the bus arbiter


international symposium on low power electronics and design | 2001

Selectively clocked skewed logic (SCSL): a robust low-power logic style for high-performance applications

Naran Sirisantana; Aiqun Cao; Shawn Davidson; Cheng-Kok Koh; Kaushik Roy

In very high performance designs, dynamic circuits, such as domino logic, are used because of their high speed. Skewed logic circuits can be used to achieve designs having performance comparable to that of domino but with better scalability. Moreover, a selective clocking scheme may be applied to enhance the power savings for skewed logic circuits. This paper proposes Selectively Clocked Skewed Logic (SCSL), a new circuit style based on skewed logic aiming for low clock power consumption. The results on ISCAS benchmark circuits implemented with this circuit design style show that the total power consumption can be reduced by 52.05% when compared to that of domino circuit with comparable performance.


international conference on computer design | 2003

Non-crossing OBDDs for mapping to regular circuit structures

Aiqun Cao; Cheng-Kok Koh

We propose a novel compact BDD structure, called noncrossing ordered BDD (NCOBDD), that can be mapped directly to a regular circuit structure. Compared with other BDD-based regular structures, NCOBDD-mapped circuits reduce the costs of area, power and latency, while preserving the regularity of the structures. We also present an algorithm that uses a top-down level-by-level sweep to construct minimal NCOBDDs. Experimental results show that for asymmetric benchmark circuits, the average reduction on area, power and latency are 61.6%, 53.1% and 69.2%, respectively, compared with yet another decision diagram (YADD) [A. Mukherjee et al., (1999)].


international symposium on quality electronic design | 2002

Synthesis of selectively clocked skewed logic circuits

Aiqun Cao; Naran Sirisantana; Cheng-Kok Koh; Kaushik Roy

Skewed logic circuits with a selective clocking scheme have performance comparable to that of Domino logic, but consume much lower power. Unlike Domino, the reconvergent path problem in skewed logic circuits may be overcome without logic duplication due to the static nature of skewed logic. In this paper, we propose a novel approach that alleviates the need for logic duplication when dealing with reconvergent paths in skewed logic circuits. We also propose a dynamic programming-based heuristic to determine a low-power clocking scheme for skewed logic circuits. Experimental results show that 32% of gates in a skewed logic circuit are duplicated, whereas 69% of gates in a Domino logic circuit are duplicated. The total power saving of skewed logic over Domino logic is 32.6% on average.


asia and south pacific design automation conference | 2005

Improving the scalability of SAMBA bus architecture

Ruibing Lu; Aiqun Cao; Cheng-Kok Koh

SAMBA bus (Lu and Koh, 2003) is a high performance bus architecture that can deliver multiple transactions in one bus cycle under single-winner bus arbitration. The bus architecture displays several advantages such as, high bandwidth, low latency, and low performance penalty from arbitration delay, all of which make it more scalable than traditional buses. However, its scalability may be limited by the bus access logic delay. As a module is connected to the bus through its interface unit, which is connected in series on the bus, the bus logic delay increases linearly as the bus size increases. In this paper, we propose to increase the scalability of SAMBA buses through two methods: control signal lookahead and module clustering. The control signal lookahead technique can determine the bus access control signal in advance, thereby reducing the effective delay of each interface unit. Module clustering, on the other hand, can reduce the number of interface units attached to a bus. Experimental results show that combining these two methods can effectively reduce the bus logic delay, and thus increase the scalability of SAMBA buses.


asia and south pacific design automation conference | 2005

Post-layout logic duplication for synthesis of domino circuits with complex gates

Aiqun Cao; Ruibing Lu; Cheng-Kok Koh

Logic duplication to resolve the logic reconvergent paths problem encountered in Domino logic synthesis is expensive in terms of area and power. In this paper, we propose a combined logic duplication minimization and technology mapping scheme for Domino circuits with complex gates. The logic duplication is performed as a post-layout step as the duplication cost is minimized based on accurate timing information. Experimental results show significant improvements in area, power, and delay.


design automation conference | 2004

Post-layout logic optimization of domino circuits

Aiqun Cao; Cheng-Kok Koh

Logic duplication, a commonly used synthesis technique to remove trapped inverters in reconvergent paths of Domino circuits, incurs high area and power penalties. In this paper, we propose a synthesis scheme to reduce the duplication cost by allowing inverters in Domino logic under certain timing constraints. In order to guarantee the robustness of such Domino circuits, we perform the reduction of logic duplication at the physical level. Experimental results show significant reduction in duplication cost, which translates into significant improvements in area, power, and/or delay.


asia and south pacific design automation conference | 2003

Integer linear programming-based synthesis of skewed logic circuits

Aiqun Cao; Naran Sirisantana; Cheng-Kok Koh; Kaushik Roy

We present an integer linear programming-based approach for solving the logic reconvergence problem in skewed logic circuits with minimal logic duplication cost. A simplification technique is applied to reduce the complexity of the ILP problem greatly so that the run time is more affordable. Experimental results show that an average of 18% of original gates are duplicated in skewed logic circuits, whereas 65% in Domino logic circuits are duplicated. The average power saving over Domino logic circuits is 40.9%.


ACM Transactions on Design Automation of Electronic Systems | 2006

Postlayout optimization for synthesis of Domino circuits

Aiqun Cao; Ruibing Lu; Chen Li; Cheng-Kok Koh

Logic duplication, a commonly used synthesis technique to remove trapped inverters in reconvergent paths of Domino circuits, incurs high area and power penalties. In this article, we propose a synthesis scheme to reduce the duplication cost by allowing inverters in Domino logic under certain timing constraints for both simple and complex gates. Moreover, we can include the logic duplication minimization during technology mapping for synthesis of Domino circuits with complex gates. In order to guarantee the robustness of such Domino circuits, we perform the logic optimization as a postlayout step. Experimental results show significant reduction in duplication cost, which translates into significant improvements in area and power. As a byproduct, the timing performance is also improved owing to smaller layout area and/or logic depth.


ACM Transactions on Design Automation of Electronic Systems | 2005

Synthesis of skewed logic circuits

Aiqun Cao; Naran Sirisantana; Cheng-Kok Koh; Kaushik Roy

Skewed logic circuits belong to a noise-tolerant high-performance static circuit family. Skewed logic circuits can achieve performance comparable to that of Domino logic circuits but with much lower power consumption. Two factors contribute to the reduction in power. First, by exploiting the static nature of skewed logic circuits, we can alleviate the cost of logic duplication which is typically required to overcome the logic reconvergence problem in both Domino logic and skewed logic circuits. Second, a selective clocking scheme can be applied to a skewed logic circuit to reduce the clock load and hence, clock power. In this article, we propose a two-step synthesis scheme of skewed logic circuits. In the first step, an integer linear programming-based approach is presented to overcome the logic reconvergence problem in skewed logic circuits with minimal logic duplication cost. In the second step, a dynamic programming-based heuristic is applied to achieve an optimal selective clocking scheme. Experimental results show that the average power saving of skewed logic circuits over Domino logic circuits is 41.1&percent;.

Collaboration


Dive into the Aiqun Cao's collaboration.

Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Researchain Logo
Decentralizing Knowledge