Network


Latest external collaboration on country level. Dive into details by clicking on the dots.

Hotspot


Dive into the research topics where Elias Vansteenkiste is active.

Publication


Featured researches published by Elias Vansteenkiste.


IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems | 2014

TPaR: Place and Route Tools for the Dynamic Reconfiguration of the FPGA's Interconnect Network

Elias Vansteenkiste; Brahim Al Farisi; Karel Bruneel; Dirk Stroobandt

Dynamic partial reconfiguration of FPGAs enables the dynamic specialization of the circuit for the runtime needs of the application. Previously a tool flow, called the TLUT tool flow, was developed to aid the designer in applying dynamic circuit specialization (DCS) for their designs. The TLUT tool flow generates an implementation in which the lookup tables (LUTs) can be specialized during runtime. In this paper, place and route algorithms are described for the TCON tool flow. The TCON tool flow generates implementations in which not only the logic infrastructure (LUTs) is dynamically specialized, but also the routing infrastructure of the FPGA. Exploiting the reconfigurability of the FPGA interconnection network further improves area (50% to 92% less LUTs and 36% to 81% less wiring), logic depth (a 63% to 80% reduction) and power consumption. To achieve this, major changes were needed, not only in the mapping, but also in the place and route steps. This work describes the altered place and route algorithms, called TPlace and Troute.


field-programmable logic and applications | 2013

Efficient implementation of Virtual Coarse Grained Reconfigurable Arrays on FPGAS

Karel Heyse; Tom Davidson; Elias Vansteenkiste; Karel Bruneel; Dirk Stroobandt

Fine grained Field Programmable Gate Arrays (FPGA) are complex to program and therefore suffer from high development costs. To solve this problem, Virtual Coarse Grained Reconfigurable Arrays (Virtual CGRA), or CGRAs implemented on FPGAs, have been proposed. Conventional implementations of VCGRAs use functional FPGA resources, such as LookUp Tables, to implement the virtual switch blocks, registers and other components that make the VCGRA configurable. We show that this is a large overhead that can often be avoided by mapping these components directly on lower level FPGA resources such as physical switch blocks and configuration memory. We show how this can be achieved using the tool flow for parameterised FPGA configurations and illustrate the advantages of this method by showing that an area reduction of 50% is attainable for a VCGRA aimed at regular expression matching.


field programmable logic and applications | 2012

Maximizing the reuse of routing resources in a reconfiguration-aware connection router

Elias Vansteenkiste; Karel Bruneel; Dirk Stroobandt

Parameterised configurations for FPGAs are configuration bitstreams of which some of the bits are defined as Boolean functions of parameters. By evaluating these Boolean functions using different parameter values, it is possible to quickly and efficiently derive specialised configuration bitstreams with different properties. Generating and using parameterized configurations requires a new tool flow. In this paper we propose a novel algorithm for the routing step of this tool flow. This new router, called the connection bundle router, is able to route a circuit with parameterized interconnections. It produces routing solutions in less time (up to a factor 5,2) and with a better quality in terms of number of wires (up to 38%) and minimum track width (up to 25%) than its predecessors. The connection bundle router is fully automated and uses a scalable connection-based representation for the parameterized interconnections in a tunable circuit.


applied reconfigurable computing | 2012

A connection router for the dynamic reconfiguration of FPGAs

Elias Vansteenkiste; Karel Bruneel; Dirk Stroobandt

Dynamic Circuit Specialization (DCS) is a new FPGA CAD tool flow that uses Run-Time Reconfiguration to automatically specialize the FPGA configuration for a whole range of specific data values. DCS implementations are a factor 5 faster and need a factor 8 less luts compared to conventional implementations. We propose a novel routing algorithm for reconfigurable routing, called the Connection router. In contrast to troute, another reconfiguration-aware router, our new router is fully automated and far more scalable.


ieee computer society annual symposium on vlsi | 2013

A novel tool flow for increased routing configuration similarity in multi-mode circuits

Brahim Al Farisi; Elias Vansteenkiste; Karel Bruneel; Dirk Stroobandt

A multi-mode circuit implements the functionality of a limited number of circuits, called modes, of which at any given time only one needs to be realised. Using run-time reconfiguration (RTR) of an FPGA, all the modes can be time-multiplexed on the same reconfigurable region, requiring only an area that can contain the biggest mode. Typically, conventional run-time reconfiguration techniques generate a configuration of the reconfigurable region for every mode separately. This results in configurations that are bit-wise very different. Thus, in this case, many bits need to be changed in the configuration memory to switch between modes, leading to long reconfiguration times. In this paper we present a novel tool flow that retains the placement of the conventional RTR flow, but uses TRoute, a reconfiguration-aware connection router, to implement the connections of all modes simultaneously. DRoute stimulates the sharing of routing resources between connections of different modes. This results in a significant increase in the similarity between the routing configurations of the modes. In the experimental results it is shown that the number of routing configuration bits that needs to be rewritten is reduced with a factor between 2 and 4 compared to conventional techniques.


reconfigurable communication centric systems on chip | 2016

EXTRA: Towards the exploitation of eXascale technology for reconfigurable architectures

Dirk Stroobandt; Ana Lucia Varbanescu; Cătălin Bogdan Ciobanu; Muhammed Al Kadi; Andreas Brokalakis; George Charitopoulos; Tim Todman; Xinyu Niu; Dionisios N. Pnevmatikatos; Amit Kulkarni; Elias Vansteenkiste; Wayne Luk; Marco D. Santambrogio; Donatella Sciuto; Michael Huebner; Tobias Becker; Georgi Gaydadjiev; Antonis Nikitakis; Alex J. W. Thom

To handle the stringent performance requirements of future exascale-class applications, High Performance Computing (HPC) systems need ultra-efficient heterogeneous compute nodes. To reduce power and increase performance, such compute nodes will require hardware accelerators with a high degree of specialization. Ideally, dynamic reconfiguration will be an intrinsic feature, so that specific HPC application features can be optimally accelerated, even if they regularly change over time. In the EXTRA project, we create a new and flexible exploration platform for developing reconfigurable architectures, design tools and HPC applications with run-time reconfiguration built-in as a core fundamental feature instead of an add-on. EXTRA covers the entire stack from architecture up to the application, focusing on the fundamental building blocks for run-time reconfigurable exascale HPC systems: new chip architectures with very low reconfiguration overhead, new tools that truly take reconfiguration as a central design concept, and applications that are tuned to maximally benefit from the proposed run-time reconfiguration techniques. Ultimately, this open platform will improve Europes competitive advantage and leadership in the field.


field programmable logic and applications | 2015

Estimating circuit delays in FPGAs after technology mapping

Berg Severens; Elias Vansteenkiste; Karel Heyse; Dirk Stroobandt

An FPGA implementation requires a significant effort of the hardware designer, who optimizes FPGA designs by going through many time-consuming CAD flow iterations. These iterations provide two types of feedback: (1) the FPGA performance and (2) the identification of the parts having the highest impact on the FPGA performance. Both depend on the wirelength behavior. Studies have been dedicated to the estimation of local [5] and global [4] wirelengths, but to our knowledge both performance estimations and identification of the critical zone are not present in literature. Therefore this paper, firstly, presents a comparison of three performance estimation techniques: logic depth, Monte Carlo simulation and fast placement (ordered from low to high accuracy and runtime). Secondly, four methods identifying the critical zone are compared. Results show that Monte Carlo simulations provide a good identification of the parts having the highest impact on the performance. We conclude that Monte Carlo simulations provide useful feedback within a short runtime (about 30 times faster than placement), reducing the time-to-market of FPGA implementations.


field-programmable technology | 2015

Analyzing the divide between FPGA academic and commercial results

Elias Vansteenkiste; Alireza S. Kaviani; Henri Fraisse

The pinnacle of success for academic work is often achieved by having impact on commercial products. In order to have a successful transfer bridge, academic evaluation flows need to provide representative results of similar quality to commercial flows. A majority of publications in FPGA research use the same set of known academic CAD tools and benchmarks to evaluate new architecture and tool ideas. However, it is not clear whether the claims in academic publications based on these tools and benchmarks translate to real benefits in commercial products. In this work we compare the latest Xilinx commercial tools and products with these well-known academic tools to identify the gap in the major figures of merit. Our results show that there is a significant 2.2X gap in speed-performance for similar process technology. We have also identified the area-efficiency and runtime divide between commercial and academic tools to be 5% and 2.2X, respectively. We show that it is possible to improve portions of the academic flow such as ABC logic optimization to match the quality of commercial tools at the expense of additional runtime. Our results also show that depth reduction, which is often used as the main figure of merit for logic optimization papers does not translate to post-routing timing improvements. We finally discuss the differences between academic and commercial benchmark designs. We explain the main differences and trends that may influence the topic choice and conclusions of academic research. This work emphasizes how difficult it is to identify the relevant FPGA academic work that can provide meaningful benefits for commercial products.


computational science and engineering | 2015

EXTRA: Towards an Efficient Open Platform for Reconfigurable High Performance Computing

Catalin Bogdan Ciobanu; Ana Lucia Varbanescu; Dionisios N. Pnevmatikatos; George Charitopoulos; Xinyu Niu; Wayne Luk; Marco D. Santambrogio; Donatella Sciuto; Muhammed Al Kadi; Michael Huebner; Tobias Becker; Georgi Gaydadjiev; Andreas Brokalakis; Antonis Nikitakis; Alex J. W. Thom; Elias Vansteenkiste; Dirk Stroobandt

To handle the stringent performance requirements of future exascale-class applications, High Performance Computing (HPC) systems need ultra-efficient heterogeneous compute nodes. To reduce power and increase performance, such compute nodes will require hardware accelerators with a high degree of specialization. Ideally, dynamic reconfiguration will be an intrinsic feature, so that specific HPC application features can be optimally accelerated, even if they regularly change over time. In the EXTRA project, we create a new and flexible exploration platform for developing reconfigurable architectures, design tools and HPC applications with run-time reconfiguration built-in as a core fundamental feature instead of an add-on. EXTRA covers the entire stack from architecture up to the application, focusing on the fundamental building blocks for run-time reconfigurable exascale HPC systems: new chip architectures with very low reconfiguration overhead, new tools that truly take reconfiguration as a central design concept, and applications that are tuned to maximally benefit from the proposed run-time reconfiguration techniques. Ultimately, this open platform will improve Europes competitive advantage and leadership in the field.


reconfigurable computing and fpgas | 2014

Parameterised FPGA reconfigurations for efficient test set generation

Alexandra Kourfali; Elias Vansteenkiste; Dirk Stroobandt

This paper proposes the use of parameterised FPGA configurations for a new test set generation approach. The time-consuming problem of test set generation aims at finding the right input values to fully test an ASIC design. Since well-known methods for test set generation such as fault simulation techniques have become impractical to use due to their speed limitations, FPGAs have been used in order to facilitate fault injection techniques. However, the development of previous FPGA fault injection techniques demonstrate either area or time overhead. This paper proposes a post-synthesis fault injection method that combines fault emulation with the parameterised configuration technique. The new fault-injected design is mapped with different mapping solutions based on dynamic specialisation of the logic and routing infrastructure of the FPGA during runtime. The experimental results for the proposed technique indicate a significant reduction of the logic depth and an area reduction up to a factor 10 compared to conventional tools.

Collaboration


Dive into the Elias Vansteenkiste's collaboration.

Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar

Georgi Gaydadjiev

Chalmers University of Technology

View shared research outputs
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar

Wayne Luk

Imperial College London

View shared research outputs
Top Co-Authors

Avatar

Xinyu Niu

Imperial College London

View shared research outputs
Researchain Logo
Decentralizing Knowledge