Andreas Brokalakis | Researchain

Archive Network Publication Hotspot Collaboration

Network

Latest external collaboration on country level. Dive into details by clicking on the dots.

Explore More

Hotspot

Dive into the research topics where Andreas Brokalakis is active.

Explore More

Publication

Featured researches published by Andreas Brokalakis.

digital systems design | 2012

FASTER: Facilitating Analysis and Synthesis Technologies for Effective Reconfiguration

Dionisios N. Pnevmatikatos; Tobias Becker; Andreas Brokalakis; Karel Bruneel; Georgi Gaydadjiev; Wayne Luk; Kyprianos Papadimitriou; Ioannis Papaefstathiou; Oliver Pell; Christiano Pilato; Matthieu Robart; Marco D. Santambrogio; Donatella Sciuto; Dirk Stroobandt; Tim Todman

The FASTER project aims to ease the definition, implementation and use of dynamically changing hardware systems. Our motivation stems from the promise reconfigurable systems hold for achieving better performance and extending product functionality and lifetime via the addition of new features that work at hardware speed. This is a clear advantage over the more straightforward software component adaptivity. However, designing a changing hardware system is both challenging and time consuming. The FASTER project will facilitate the use of reconfigurable technology by providing a complete methodology that enables designers to easily specify, analyse, implement and verify applications on platforms with general-purpose processors and acceleration modules implemented in the latest reconfigurable technology. To better adapt to different application requirements, the tool-chain will support both region-based and micro-reconfiguration and provide a flexible run-time system that will efficiently manage the reconfigurable resources. We will use applications from the embedded, high performance computing, and desktop domains to demonstrate the potential benefits of the FASTER tools on metrics such as performance, power consumption and total ownership cost.

international parallel and distributed processing symposium | 2016

A Fully Parameterized Virtual Coarse Grained Reconfigurable Array for High Performance Computing Applications

Amit Kulkarni; Elias Vasteenkiste; Dirk Stroobandt; Andreas Brokalakis; Antonios Nikitakis

Field Programmable Gate Arrays (FPGAs) have proven their potential in accelerating High Performance Computing (HPC) Applications. Conventionally such accelerators predominantly use, FPGAs that contain fine-grained elements such as LookUp Tables (LUTs), Switch Blocks (SB) and Connection Blocks (CB) as basic programmable logic blocks. However, the conventional implementation suffers from high reconfiguration and development costs. In order to solve this problem, programmable logic components are defined at a virtual higher abstraction level. These components are called Processing Elements (PEs) and the group of PEs along with the inter-connection network form an architecture called a Virtual Coarse-Grained Reconfigurable Array (VCGRA). The abstraction helps to reconfigure the PEs faster at the intermediate level than at the lower-level of an FPGA. Conventional VCGRA implementations (built on top of the lower levels of the FPGA) use functional resources such as LUTs to establish required connections (intra-connect) within a PE. In this paper, we propose to use the parameterized reconfiguration technique to implement the intra-connections of each PE with the aim to reduce the FPGA resource utilization (LUTs). The technique is used to parameterize the intra-connections with parameters that only change their value infrequently (whenever a new VCGRA function has to be reconfigured) and that are implemented as constants. Since the design is optimized for these constants at every moment in time, this reduces the resource utilization. Further, inter-connections (network between the multiple PEs) of the VCGRA grid can also be parameterized so that both the inter-and intra-connect network of the VCGRA grid can be mapped onto the physical switch blocks of the FPGA. For every change in parameter values a specialized bitstream is generated on the fly and the FPGA is reconfigured using the parameterized run-time reconfiguration technique. Our results show a drastic reduction in FPGA LUT resource utilization in the PE by at least 30% and in the intra-network of the PE by 31% when implementing an HPC application.

Microprocessors and Microsystems | 2013

HEAP: A Highly Efficient Adaptive multi-Processor framework

Luciano Lavagno; Mihai Teodor Lazarescu; Ioannis Papaefstathiou; Andreas Brokalakis; Johan Walters; Bart Kienhuis; Florian Schäfer

Writing parallel code is difficult, especially when starting from a sequential reference implementation. Our research efforts, as demonstrated in this paper, face this challenge directly by providing an innovative toolset that helps software developers profile and parallelize an existing sequential implementation, by exploiting top-level pipeline-style parallelism. The innovation of our approach is based on the facts that a) we use both automatic and profiling-driven estimates of the available parallelism, b) we refine those estimates using metric-driven verification techniques, and c) we support dynamic recovery of excessively optimistic parallelization. The proposed toolset has been utilized to find an efficient parallel code organization for a number of real-world representative applications, and a version of the toolset is provided in an open-source manner.

reconfigurable communication centric systems on chip | 2016

EXTRA: Towards the exploitation of eXascale technology for reconfigurable architectures

Dirk Stroobandt; Ana Lucia Varbanescu; Cătălin Bogdan Ciobanu; Muhammed Al Kadi; Andreas Brokalakis; George Charitopoulos; Tim Todman; Xinyu Niu; Dionisios N. Pnevmatikatos; Amit Kulkarni; Elias Vansteenkiste; Wayne Luk; Marco D. Santambrogio; Donatella Sciuto; Michael Huebner; Tobias Becker; Georgi Gaydadjiev; Antonis Nikitakis; Alex J. W. Thom

To handle the stringent performance requirements of future exascale-class applications, High Performance Computing (HPC) systems need ultra-efficient heterogeneous compute nodes. To reduce power and increase performance, such compute nodes will require hardware accelerators with a high degree of specialization. Ideally, dynamic reconfiguration will be an intrinsic feature, so that specific HPC application features can be optimally accelerated, even if they regularly change over time. In the EXTRA project, we create a new and flexible exploration platform for developing reconfigurable architectures, design tools and HPC applications with run-time reconfiguration built-in as a core fundamental feature instead of an add-on. EXTRA covers the entire stack from architecture up to the application, focusing on the fundamental building blocks for run-time reconfigurable exascale HPC systems: new chip architectures with very low reconfiguration overhead, new tools that truly take reconfiguration as a central design concept, and applications that are tuned to maximally benefit from the proposed run-time reconfiguration techniques. Ultimately, this open platform will improve Europes competitive advantage and leadership in the field.

digital systems design | 2012

HEAP: A Highly Efficient Adaptive Multi-processor Framework

Luciano Lavagno; Mihai Teodor Lazarescu; Johan Walters; Bart Kienhuis; Ioannis Papaefstathiou; Andreas Brokalakis; F. Schaefer

computational science and engineering | 2015

EXTRA: Towards an Efficient Open Platform for Reconfigurable High Performance Computing

Catalin Bogdan Ciobanu; Ana Lucia Varbanescu; Dionisios N. Pnevmatikatos; George Charitopoulos; Xinyu Niu; Wayne Luk; Marco D. Santambrogio; Donatella Sciuto; Muhammed Al Kadi; Michael Huebner; Tobias Becker; Georgi Gaydadjiev; Andreas Brokalakis; Antonis Nikitakis; Alex J. W. Thom; Elias Vansteenkiste; Dirk Stroobandt

ieee computer society annual symposium on vlsi | 2017

A CAD Open Platform for High Performance Reconfigurable Systems in the EXTRA Project

Marco Rabozzi; Rolando Brondolin; Giuseppe Natale; Emanuele Del Sozzo; Michael Huebner; Andreas Brokalakis; Catalin Bogdan Ciobanu; Dirk Stroobandt; Marco D. Santambrogio

As the power wall has become one of the main limiting factors for the performance of general purpose processors, the trend in High Performance Computing (HPC) is moving towards application-specific accelerators in order to meet the stringent performance requirements for exascale computing while still satisfying power budget constraints. Within this context, reconfigurable devices, and more specifically FPGA-based systems, represent a promising solution able to achieve highly energy efficient computations without jeopardizing performance. Nevertheless, the exploitation of reconfigurable hardware is still limited due to the hardware-software co-design challenges that it poses, the time consuming design space exploration process and the programming complexity. To overcome these challenges, the EXTRA European project addresses the reconfigurability of such devices as a first-class feature, covering the entire stack from the system architecture up to the application. Within this paper, we present the effort of the EXTRA project towards the definition of an adaptive open platform for the optimization and implementation of applications on high performance reconfigurable architectures. The underlying infrastructure of the platform is here presented, highlighting its capability to integrate modules from different developers in order to stimulate external contributions and open research.

design, automation, and test in europe | 2017

An open reconfigurable research platform as stepping stone to exascale high-performance computing

Dirk Stroobandt; Cătălin Bogdan Ciobanu; Marco D. Santambrogio; Gabriël Figueiredo; Andreas Brokalakis; Dionisios N. Pnevmatikatos; Michael Huebner; Tobias Becker; Alex J. W. Thom

To handle the stringent performance and power requirements of future exascale-class applications, High Performance Computing (HPC) systems need ultra-efficient heterogeneous compute nodes and hardware accelerators with a high degree of specialization. Ideally, dynamic reconfiguration will be an intrinsic feature, so that specific HPC application features can be optimally accelerated, even if they regularly change over time. We create a new and flexible exploration platform for developing reconfigurable architectures, design tools and HPC applications with run-time reconfiguration built-in as a core fundamental feature instead of an add-on. Our project proposes an open research platform that covers the entire stack from architecture up to the application, focusing on the fundamental building blocks for run-time reconfigurable exascale HPC systems: new chip architectures with very low reconfiguration overhead, new tools that truly take reconfiguration as a central design concept, and applications that are tuned to maximally benefit from the proposed run-time reconfiguration techniques. Ultimately, this open platform will enable groundbreaking research towards new exascale computing platforms.

international symposium on parallel and distributed processing and applications | 2014

FPGA-Based Design Using the FASTER Toolchain: The Case of STM Spear Development Board

Fabrizio Spada; Alberto Scolari; Gianluca Durelli; Riccardo Cattaneo; Marco D. Santambrogio; Donatella Sciuto; Dionisios N. Pnevmatikatos; Georgi Gaydadjiev; Oliver Pell; Andreas Brokalakis; Wayne Luk; Dirk Stroobandt; Danilo Pau

Even though FPGAs are becoming more and more popular as they are used in many different scenarios like communications and HPC, the steep learning curve needed to work with this technology is still the major limiting factor to their full success. Many works proposed to mitigate this problem by creating a companion of tools to support the designer during the development phase for this technology. The EU FASTER Project aims at realizing an integrated toolchain that assists the designer in the steps of the design flow that are necessary to port a given application onto an FPGA device. The novelty of the framework relies in the fact that the partial dynamic reconfiguration, which FPGA devices can exploit, is seen as a first class citizen throughout the whole design flow. This work reports a case study in which the FASTER toolchain has been used to port a raytracer application onto the STM Spear prototyping embedded platform. The paper discusses the steps done for the realization of the prototype and the results obtained on the target device. It finally reports some improvements that can be exploited to improve the performance of the hardware implementation that has been realized.

distributed computing in sensor systems | 2017