Network


Latest external collaboration on country level. Dive into details by clicking on the dots.

Hotspot


Dive into the research topics where Serge Vernalde is active.

Publication


Featured researches published by Serge Vernalde.


field-programmable logic and applications | 2003

ADRES: an architecture with tightly coupled VLIW processor and coarse-grained reconfigurable Matrix

Bingfeng Mei; Serge Vernalde; Diederik Verkest; Hugo De Man; Rudy Lauwereins

The coarse-grained reconfigurable architectures have advantages over the traditional FPGAs in terms of delay, area and configuration time. To execute entire applications, most of them combine an instruction set processor(ISP) and a reconfigurable matrix. However, not much attention is paid to the integration of these two parts, which results in high communication overhead and programming difficulty. To address this problem, we propose a novel architecture with tightly coupled very long instruction word (VLIW) processor and coarse-grained reconfigurable matrix. The advantages include simplified programming model, shared resource costs, and reduced communication overhead. To exploit this architecture, our previously developed compiler framework is adapted to the new architecture. The results show that the new architecture has good performance and is very compiler-friendly.


IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems | 1999

A new algorithm for elimination of common subexpressions

Robert Pasko; Patrick Schaumont; Veerle Derudder; Serge Vernalde; Daniela Durackova

The problem of an efficient hardware implementation of multiplications with one or more constants is encountered in many different digital signal-processing areas, such as image processing or digital filter optimization. In a more general form, this is a problem of common subexpression elimination, and as such it also occurs in compiler optimization and many high-level synthesis tasks. An efficient solution of this problem can yield significant improvements in important design parameters like implementation area or power consumption. In this paper, a new solution of the multiple constant multiplication problem based on the common subexpression elimination technique is presented. The performance of our method is demonstrated primarily on a finite-duration impulse response filter design. The idea is to implement a set of constant multiplications as a set of add-shift operations and to optimize these with respect to the common subexpressions afterwards. We show that the number of add/subtract operations can be reduced significantly this way. The applicability of the presented algorithm to the different high-level synthesis tasks is also indicated. Benchmarks demonstrating the algorithms efficiency are included as well.


field programmable logic and applications | 2002

Interconnection Networks Enable Fine-Grain Dynamic Multi-tasking on FPGAs

Théodore Marescaux; Andrei Bartic; Diederik Verkest; Serge Vernalde; Rudy Lauwereins

Multimedia support appears on embedded platforms, such as WAP for mobile phones. However, true multimedia applications require both the computation power that only dedicated hardware can provide and the flexibility of software implementations. To this end, we are investigating reconfigurable architectures, composed of an instruction-set processor running software processes and coupled to an FPGA on which hardware tasks are spawned by dynamic partial reconfiguration. This paper focuses on two main aspects. It explains how separating communication from computation enables hardware multi-tasking and it describes our implementation of a fixed communication-layer that decouples the computation elements, allowing them to be dynamically reconfigured. This communication layer is an interconnection network, implemented on a Virtex FPGA, allowing fast synchronous communication between hardware tasks implemented on the same matrix. The network is a 2D torus and uses wormhole routing. It achieves transfer rates up to 77.6 MB/s between two adjacent routers, when clocked at 40 MHz. Interconnection networks on FPGAs allow fine-grain dynamic partial reconfiguration and make hardware multi-tasking a reality.


field-programmable technology | 2002

DRESC: a retargetable compiler for coarse-grained reconfigurable architectures

Bingfeng Mei; Serge Vernalde; Diederik Verkest; H. De Man; Rudy Lauwereins

Coarse-grained reconfigurable architectures have become increasingly important in recent years. Automatic design or compiling tools are essential to their success. In this paper, we present a retargetable compiler for a family of coarse-grained reconfigurable architectures. Several key issues are addressed. Program analysis and transformation prepare dataflow for scheduling. Architecture abstraction generates an internal graph representation from a concrete architecture description. A modulo scheduling algorithm is key to exploit parallelism and achieve high performance. The experimental results show up to 28.7 instructions per cycle (IPC) over tested kernels.


design, automation, and test in europe | 2004

Design methodology for a tightly coupled VLIW/reconfigurable matrix architecture: a case study

Bingfeng Mei; Serge Vernalde; Diederik Verkest; Rudy Lauwereins

Coarse-grained reconfigurable architectures have seen growing importance recently. Design tools and methodology are essential to their success. Based on our previous work on modulo scheduling algorithms and a novel architecture with tightly coupled VLIW/reconfigurable matrix, we present a C-based design flow using an MPEG-2 decoder as a design example. The application is mapped to the architecture in less than one person-week starting from a software implementation. The kernel and overall speedup over the reference VLIW are 4.84 and 3.05 respectively. The case study shows that our methodology and architecture can deliver a competitive package in terms of design efforts and performance over other programmable architectures.


design, automation, and test in europe | 2003

Infrastructure for Design and Management of Relocatable Tasks in a Heterogeneous Reconfigurable System-on-Chip

Jean-Yves Mignolet; Vincent Nollet; Paul Coene; Diederik Verkest; Serge Vernalde; Rudy Lauwereins

The ability to (re)schedule a task either in hardware or software will be an important asset in a reconfigurable systems-on-chip. To support this feature we have developed an infrastructure that, combined with a suitable design environment permits the implementation and management of hardware/software relocatable tasks. This paper presents the general scope of our research, and details the communication scheme, the design environment and the hardware/software context switching issues. The infrastructure proved its feasibility by allowing us to design a relocatable video decoder. When implemented on an embedded platform, the decoder performs at 23 frames/s (320/spl times/240 pixels, 16 bits per pixel) in reconfigurable hardware and 6 frames/s in software.


design, automation, and test in europe | 1999

A methodology and design environment for DSP ASIC fixed point refinement

Radim Cmar; Luc Rijnders; Patrick Schaumont; Serge Vernalde; Ivo Bolsens

Complex signal processing algorithms are specified in floating point precision. When their hardware implementation requires fixed point precision, type refinement is needed. The paper presents a methodology and design environment for this quantization process. The method uses independent strategies for fixing MSB and LSB weights of fixed point signals. It enables short design cycles by combining the strengths of both analytical and simulation based methods.


international parallel and distributed processing symposium | 2003

Designing an operating system for a heterogeneous reconfigurable SoC

Vincent Nollet; Paul Coene; Diederik Verkest; Serge Vernalde; Rudy Lauwereins

The emerging need for large amounts of flexible computational power on embedded devices motivated many researchers to incorporate reconfigurable logic together with an instruction-set-processor (ISP) into their architectures. This implies that tomorrows applications will make use of both the ISP and the reconfigurable logic in order to provide the user with maximum performance. Today, however, a few stumbling blocks prevent these kinds of heterogeneous architectures from becoming mainstream. The technology still lacks a form of run-time management infrastructure. This infrastructure should ease application development by shielding the programmer from the complexity of the system and by providing a clear application development API. This paper presents a novel approach for designing an operating system for reconfigurable systems (OS4RS). Creating such an operating system is an integral part of our ongoing research regarding reconfigurable computing. An initial version of our operating system was used to manage a reconfigurable systems demonstrator.


design automation conference | 2004

Operating-system controlled network on chip

Vincent Nollet; Théodore Marescaux; Diederik Verkest; Jean-Yves Mignolet; Serge Vernalde

Managing a Network-on-Chip (NoC) in an efficient way is a challenging task. To succeed, the operating system (OS) needs to be tuned to the capabilities and the needs of the NoC. Only by creating a tight interaction can we combine the necessary flexibility with the required efficiency. This paper illustrates such an interaction by detailing the management of communication resources in a system containing a packet-switched NoC and a closely integrated OS. Our NoC system is emulated by linking an FPGA to a PDA. We show that, with the right NoC support, the OS is able to optimize communication resource usage. Additionally, the OS is able to diminish or remove the interference between independent applications sharing a common NoC communication resource.


design automation conference | 1998

A programming environment for the design of complex high speed ASICs

Patrick Schaumont; Serge Vernalde; Luc Rijnders; Marc Engels; Ivo Bolsens

A C++ based programming environment for the design of complex high speed ASICs is presented. The design of a 75 Kgate DECT transceiver is used as a driver example. Compact descriptions, combined with efficient simulation and synthesis strategies are essential for the design of such a complex system. It is shown how a C++ programming approach outperforms traditional HDL-based methods.

Collaboration


Dive into the Serge Vernalde's collaboration.

Top Co-Authors

Avatar

Diederik Verkest

Katholieke Universiteit Leuven

View shared research outputs
Top Co-Authors

Avatar
Top Co-Authors

Avatar

Rudy Lauwereins

Katholieke Universiteit Leuven

View shared research outputs
Top Co-Authors

Avatar

Marc Engels

Katholieke Universiteit Leuven

View shared research outputs
Top Co-Authors

Avatar

Jean-Yves Mignolet

Katholieke Universiteit Leuven

View shared research outputs
Top Co-Authors

Avatar

Ivo Bolsens

Katholieke Universiteit Leuven

View shared research outputs
Top Co-Authors

Avatar

Théodore Marescaux

Katholieke Universiteit Leuven

View shared research outputs
Researchain Logo
Decentralizing Knowledge