Yu-Fai Fung
Hong Kong Polytechnic University
Network
Latest external collaboration on country level. Dive into details by clicking on the dots.
Publication
Featured researches published by Yu-Fai Fung.
European Journal of Operational Research | 2003
Ceyda Oguz; M. Fikret Ercan; T.C. Edwin Cheng; Yu-Fai Fung
Abstract The aim of this paper is to propose heuristic algorithms for a two-stage flow-shop scheduling problem with multiprocessor tasks to minimize the makespan. The heuristic algorithms are constructive in nature. They schedule the tasks one at a time from a given sequence. The rules considered for sequencing the tasks are based on simple priority rules from the literature. Some lower bounds for the problem are also derived to be used in the performance analysis of the heuristic algorithms. Next, the average performance of the proposed heuristic algorithms is analyzed by a computational experiment using randomly generated problem instances. The results suggest that these heuristic algorithms are both efficient and effective. The paper concludes with a discussion of the insights obtained from the experimental analysis about this type of scheduling problems.
Microprocessors and Microsystems | 2000
M.F. Ercan; Yu-Fai Fung
Abstract This paper introduces a multi-layer MIMD system for computer vision applications. The system is provided with three hierarchical processing layers, each of which is dedicated to one level of processing to create a pipelining effect. Unlike the traditional approach, where a 2D mesh connected array is used, layers in this system are linear arrays. Simplicity and expandability are the main advantages. A prototype of the system is implemented using off-the-shelf components and the performance of the various vision operations is analyzed.
Microprocessors and Microsystems | 2002
Yu-Fai Fung; M.F. Ercan; T.K. Ho; W. L. Cheung
Streaming SIMD Extensions (SSE) is a unique feature embedded in the Pentium III and IV classes of microprocessors. By fully exploiting SSE, parallel algorithms can be implemented on a standard personal computer and a theoretical speedup of four can be achieved. In this paper, we demonstrate the implementation of a parallel LU matrix decomposition algorithm for solving linear systems with SSE and discuss advantages and disadvantages of this approach based on our experimental study.
pacific rim conference on communications, computers and signal processing | 2003
Wai-leung Cheung; Yu-Fai Fung; Weizhao Wang; T.M. Chan
The introduction of personal computing and wireless communication technology provides an option for on site device software updating and data retrieving. This is especially true for any devices sitting in a remote site where computing network is not accessible. In many advanced computing systems, frequent software updating and configuration profiles refreshing are required. This is clumsy and error prone procedures when users are not familiar with the operating systems. Suppose all the necessary files and programs are predefined in a mobile computing device such as notebooks, PDAs, or even mobile phones. All necessary files and software can be transferred to the corresponding computing devices and PCs at remote sites through wireless communication links such as Bluetooth, infrared, general packet radio service (GPRS). This idea helps solve the initial installation cost of a communication network to a remote site.
Computer Physics Communications | 2001
Zeke S.H. Chan; H.W. Ngan; Yu-Fai Fung; A.B. Rad
Abstract In this work we design an advanced Evolutionary Algorithm (EA) for optimizing the discrete Kalman filter (KF) model. The EA employs parallel architecture and an advanced mutation operator called the “Selection Follower”. Its performance is benchmarked with that of the Expectation-Maximization algorithm (EM) in minimizing the mean-square-error of the KF prediction. Experimental results show that the EA consistently outperforms the EM and runs significantly faster under the same number of function evaluations.
international conference on computational science and its applications | 2007
M. Fikret Ercan; Yu-Fai Fung
In many industrial and computing applications, proper scheduling of tasks can determine the overall efficiency of the system. The algorithm, presented in this paper, tackles the scheduling problem in a multi-layer multiprocessor environment, which exists in many computing and industrial applications. Based on the scheduling terminology, the problem can be defined as multiprocessor task scheduling in hybrid flow-shops. This paper presents a particle swarm optimization algorithm for the solution and reports its performance. The results are compared with other well known meta-heuristic techniques proposed for the solution of the same problem. Our results show that particle swarm optimization has merits in solving multiprocessor task scheduling in a hybrid flow-shop environment.
pacific rim conference on communications, computers and signal processing | 2003
Yu-Fai Fung; Wai-leung Cheung; M.G. Singh; M.F. Ercan
Linear equations are used in the mathematical models of many engineering problems, including load-flow in power engineering and electrical railway simulation. A common approach for solving the linear equations is by LU decomposition, which is then followed by forward and backward substitution. The LU decomposition operation is a computation intensive algorithm and in this paper, we present a cost-effective SIMD parallel algorithm for the LU decomposition of sparse matrices. The algorithm can be implemented using a common personal computer and does not require other hardware support.
Computers & Electrical Engineering | 2001
M. Fikret Ercan; Yu-Fai Fung; Ceyda Oǧuz
Abstract Multilayer multiprocessor systems are generally employed in real-time applications such as robotics and computer vision. This paper introduces three heuristic algorithms for multiprocessor task scheduling in such systems. In our model, tasks with arbitrary processing times and arbitrary processor requirements are considered. The scheduling aims at minimising completion time of processes in a two-layer system. We employed an effective lower bound (LB) for the problem. Then, we analysed the average performance of the heuristic algorithms by computing the average percentage deviation of each heuristic solution from the LB on a set of randomly generated problems. We have also applied these algorithms for scheduling computer vision tasks running on prototype multilayer architecture. Our computational and empirical results showed that the proposed heuristic algorithms perform well.
Future Generation Computer Systems | 2000
M. Fikret Ercan; Yu-Fai Fung; M. Suleyman Demokan
Abstract This paper describes a study of parallel image processing algorithms implemented on a one-dimensional DSP array. DSPs are developed for computationally intensive signal processing operations. Recently introduced parallel DSPs can be used for all levels of image processing operations and they provide easy development of a parallel system. In addition, due to the computing power delivered by these processors, we can employ coarse grain parallelism instead of the traditional fine-grain parallelism. Modularity, expandability and easy programming are other advantages of parallel DSPs. In this paper, parallel implementation of some selected image processing algorithms is described and performance results are presented.
ieee international conference on teaching assessment and learning for engineering | 2012
Yu-Fai Fung; Shuyun Ren; M. Fikret Ercan
When implementing final year projects, students may need to program a microprocessor in order to achieve the project objectives. Therefore, a system is designed in order to assist students who have limited background knowledge in computer engineering to learn the basics of microprocessor programming primarily using the C programming language. In addition to the software, various hardware components are provided so that students can develop simple systems so the learning outcomes can be enhanced by practice. From feedback collected, users found that they are able to learn simple C language programming for a microprocessor with the help of the system.