Network


Latest external collaboration on country level. Dive into details by clicking on the dots.

Hotspot


Dive into the research topics where Ming-Hwa Sheu is active.

Publication


Featured researches published by Ming-Hwa Sheu.


IEEE Transactions on Circuits and Systems | 2007

A Novel High-Speed and Energy Efficient 10-Transistor Full Adder Design

Jin-Fa Lin; Yin-Tsung Hwang; Ming-Hwa Sheu; Cheng-Che Ho

In this paper, we propose a novel full adder design using as few as ten transistors per bit. Compared with other low-gate-count full adder designs using pass transistor logic, the proposed design features lower operating voltage, higher computing speed and lower energy (power delay product) operation. The design adopts inverter buffered xor/xnor designs to alleviate the threshold voltage loss problem commonly encountered in pass transistor logic design. This problem usually prevents the full adder design from operating in low supply voltage or cascading directly without extra buffering. The proposed design successfully embeds the buffering circuit in the full adder design and the transistor count is minimized. The improved buffering helps the design operate under lower supply voltage compared with existing works. It also enhances the speed performance of the cascaded operation significantly while maintaining the performance edge in energy consumption. For performance comparison, both dc andperformances of the proposed design against various full adder designs are evaluated via extensive HSPICE simulations. The simulation results, based on TSMC 2P4M 0.35-mum process models, indicate that the proposed design has the lowest working Vdd and highest working frequency among all designs using ten transistors. It also features the lowest energy consumption per addition among these designs. In addition, the performance edge of the proposed design in both speed and energy consumption becomes even more significant as the word length of the adder increases


IEEE Transactions on Circuits and Systems Ii-express Briefs | 2004

An efficient VLSI design for a residue to binary converter for general balance moduli (2/sup n/-3,2/sup n/+1,2/sup n/-1,2/sup n/+3)

Ming-Hwa Sheu; Su-Hon Lin; Chichyang Chen; Shyue-Wen Yang

In this paper, we present a new four-moduli set (2/sup n/-3,2/sup n/+1,2/sup n/-1,2/sup n/+3) and an efficient residue to binary (R/B) converter design. The merits of the proposed four-moduli set include 1) larger dynamic range; 2) higher degree of parallelism for conversion; 3) balanced bit-width for internal RNS arithmetic operations; and 4) flexible moduli set selection. According to the relation between the proposed moduli, the divide-and-conquer technique is used to design a two-level converter architecture which has lower hardware cost and shorter critical delay. For the R/B converter designed with 12-b (n=3), our architecture has about 47% saving in hardware cost and 40% saving in critical delay compared to the last work.


IEEE Transactions on Very Large Scale Integration Systems | 2012

Low-Power Pulse-Triggered Flip-Flop Design With Conditional Pulse-Enhancement Scheme

Yin-Tsung Hwang; Jin-Fa Lin; Ming-Hwa Sheu

In this paper, a novel low-power pulse-triggered flip-flop (FF) design is presented. First, the pulse generation control logic, an and function, is removed from the critical path to facilitate a faster discharge operation. A simple two-transistor and gate design is used to reduce the circuit complexity. Second, a conditional pulse-enhancement technique is devised to speed up the discharge along the critical path only when needed. As a result, transistor sizes in delay inverter and pulse-generation circuit can be reduced for power saving. Various postlayout simulation results based on UMC CMOS 90-nm technology reveal that the proposed design features the best power-delay-product performance in seven FF designs under comparison. Its maximum power saving against rival designs is up to 38.4%. Compared with the conventional transmission gate-based FF design, the average leakage power consumption is also reduced by a factor of 3.52.


international symposium on circuits and systems | 2008

The efficient VLSI design of BI-CUBIC convolution interpolation for digital image processing

Chung-chi Lin; Ming-Hwa Sheu; Huann-keng Chiang; Chishyan Liaw; Zeng-chuan Wu

This paper presents an efficient VLSI design of bicubic convolution interpolation for digital image processing. The architecture of reducing the computational complexity of generating coefficients as well as decreasing number of memory access times is proposed. Our proposed method provides a simple hardware architecture design, low computation cost and is easy to implement. Based on our technique, the high-speed VLSI architecture has been successfully designed and implemented with TSMC 0.13 mum standard cell library. The simulation results demonstrate that the high performance architecture of bi-cubic convolution interpolation at 279 MHz with 30643 gates in a 498times498 mum chip is able to process digital image scaling for HDTV in real-time.


IEEE Transactions on Circuits and Systems Ii-express Briefs | 2008

VLSI Design of Diminished-One Modulo

Su-Hon Lin; Ming-Hwa Sheu

The diminished-one modulo 2n+1 addition is an important arithmetic operation for a high-performance residue number system. In this paper, we propose a new circular-carry-selection (CCS) technique for modulo 2n+1 addition in the diminished-one number domain. The architecture design of CCS modular adder is simple and regular for various bit-width inputs. For actual VLSI implementation, the proposed modular adder can demonstrate its superiority of savings up to 39.5% in AreaxTime and 46.3% in TimexPower performances over those of the previous existing solutions under 180-nm CMOS technology. Finally, the chip area and the clock rate of CCS diminished-one modulo 216+1 adder are 26746 mum2 and 476 MHz, respectively.


field-programmable technology | 2008

2^{n}+1

Chung-chi Lin; Ming-Hwa Sheu; Huann-keng Chiang; Wen-Kai Tsai; Zeng-chuan Wu

This paper presents a novel image interpolation method, extended linear interpolation, which is a low-cost architecture with the interpolation quality compatible to that of bi-cubic convolution interpolation. The architecture of reducing the computational complexity of generating weighting coefficients is proposed. Our proposed method provides a simple hardware architecture design, low computation cost and is easy to implement. Compared to the latest bi-cubic hardware design work, the architecture saves about 60% of hardware cost. The presented architecture is implemented on the Virtex-II FPGA has been successfully designed and implemented. The simulation results demonstrate that the high performance architecture of extended linear interpolation at 104 MHz with 379 LBs is able to process digital image scaling.


IEEE Journal of Solid-state Circuits | 2002

Adder Using Circular Carry Selection

Ming-Hwa Sheu; Su-Hon Lin

In this paper, a systematic compensation approach is presented to efficiently design the approximate squaring function with a simple combinational logic circuit. Also, a set of recursive Boolean equations for general outputs is derived such that the logic circuit can be rapidly designed and reused for various bit-width inputs. In logic implementation, our design approach possesses less circuit cost and lower critical delay. Moreover, in error analysis, the maximum relative error (MRE) and average relative error (ARE) of squaring approximation are significantly improved by at least 26.95% and 61.59%, respectively, as compared with the existing approaches. Finally, a 7-bit approximate squaring function chip is accomplished to verify the circuit performance based on 0.6-/spl mu/m CMOS technology. The chip layout occupies 127/spl times/135 /spl mu/m/sup 2/ and the total number of transistors is 186.


international symposium on circuits and systems | 1993

Real-time FPGA architecture of extended linear convolution for digital image scaling

Ming-Hwa Sheu; Jau-Yien Lee; Jhing-Fa Wang; An-Nan Suen; Liang-Ying Liu

A new architecture for VLSI implementation of an 8 /spl times/ 8 2D discrete cosine transform (DCT) is proposed. The main merits of this architecture are: (1) the multipliers are replaced by memory look-up tables; (2) no input registers are required to save a column of input data; (3) the chip performance is independent of data width; and (4) the latency (the largest delay path) is short.<<ETX>>


international symposium on circuits and systems | 2005

Fast compensative design approach for the approximate squaring function

Shyue-Wen Yang; Ming-Hwa Sheu; Hsien-Huang P. Wu; Hung-En Chien; Ping-Kuo Weng; Ying-Yih Wu

We propose a new connected component labeling method based on a 3/spl times/4 window for a binary image. This method possesses parallel processing to achieve high speed operation. According to the proposed method, we design an efficient VLSI architecture which only uses two process elements and a class storage array to complete the whole label assignment after two raster scans. From experimental results, our design has better performance in terms of hardware cost and speed. This architecture has been verified on FPGA. It takes 6120 logic elements and its working frequency is about 80 MHz.


IEEE Transactions on Circuits and Systems Ii: Analog and Digital Signal Processing | 1992

A high throughput-rate architecture for 8*8 2D DCT

Ming-Hwa Sheu; Jhing-Fa Wang; Jer-Sheng Chen; An-Nan Suen; Yuan-Long Jeang; Jau-Yien Lee

Presents an efficient pipeline architecture to perform gray-scale morphologic operations. The features of the architecture are 1) lower hardware cost, 2) faster operation time in processing an image, 3) lower data access times from the image memory, 4) shorter latency, 5) suitability for VLSI implementation, and 6) adaptability for N*N morphologic operations. >

Collaboration


Dive into the Ming-Hwa Sheu's collaboration.

Top Co-Authors

Avatar

Wen-Kai Tsai

National Yunlin University of Science and Technology

View shared research outputs
Top Co-Authors

Avatar

Ming-Der Shieh

National Cheng Kung University

View shared research outputs
Top Co-Authors

Avatar

Su-Hon Lin

National Yunlin University of Science and Technology

View shared research outputs
Top Co-Authors

Avatar

Yin-Tsung Hwang

National Chung Hsing University

View shared research outputs
Top Co-Authors

Avatar

Chung-chi Lin

National Yunlin University of Science and Technology

View shared research outputs
Top Co-Authors

Avatar

Jin-Fa Lin

Chaoyang University of Technology

View shared research outputs
Top Co-Authors

Avatar

Huann-keng Chiang

National Yunlin University of Science and Technology

View shared research outputs
Top Co-Authors

Avatar

Shyue-Wen Yang

National Yunlin University of Science and Technology

View shared research outputs
Top Co-Authors

Avatar
Top Co-Authors

Avatar

Siang-Min Siao

National Yunlin University of Science and Technology

View shared research outputs
Researchain Logo
Decentralizing Knowledge