Network


Latest external collaboration on country level. Dive into details by clicking on the dots.

Hotspot


Dive into the research topics where Zdravko Popovic is active.

Publication


Featured researches published by Zdravko Popovic.


symposium on computer architecture and high performance computing | 2007

DTA-C: A Decoupled multi-Threaded Architecture for CMP Systems

Roberto Giorgi; Zdravko Popovic; Nikola Puzovic

One way to exploit Thread Level Parallelism (TLP) is to use architectures that implement novel multithreaded execution models, like Scheduled Data- Flow (SDF). This latter model promises an elegant decoupled and non-blocking execution of threads. Here we extend that model in order to be used in future scalable CMP systems where wire delay imposes to partition the design. In this paper we describe our approach and experiment with different distributed schedulers, different number of clusters and processors per cluster to show good scalability of our architecture. We describe our approach and present initial results on system scalability and performance. We suggest design choices to improve the scalability of the basic design.


international conference / workshop on embedded computer systems: architectures, modeling and simulation | 2009

Implementing Fine/Medium Grained TLP Support in a Many-Core Architecture

Roberto Giorgi; Zdravko Popovic; Nikola Puzovic

We believe that future many-core architectures should support a simple and scalable way to execute many threads that are generated by parallel programs. A good candidate to implement an efficient and scalable execution of threads is the DTA (Decoupled Threaded Architecture), which is designed to exploit fine/medium grained Thread Level Parallelism (TLP) by using a hardware scheduling unit and relying on existing simple cores. In this paper, we present an initial implementation of DTA concept in a many-core architecture where it interacts with other architectural components designed from scratch in order to address the problem of scalability. We present initial results that show the scalability of the solution that were obtained using a many-core simulator written in SARCSim (a variant of UNISIM) with DTA support.


international parallel and distributed processing symposium | 2009

Exploiting DMA to enable non-blocking execution in Decoupled Threaded Architecture

Roberto Giorgi; Zdravko Popovic; Nikola Puzovic

DTA (Decoupled Threaded Architecture) is designed to exploit fine/medium grained Thread Level Parallelism (TLP) by using a distributed hardware scheduling unit and relying on existing simple cores (in-order pipelines, no branch predictors, no ROBs).


international symposium on parallel and distributed computing | 2009

Programming Abstractions and Toolchain for Dataflow Multithreading Architectures

Kyriakos Stavrou; Demos Pavlou; Marios Nikolaides; Panayiotis Petrides; Paraskevas Evripidou; Pedro Trancoso; Zdravko Popovic; Roberto Giorgi

The need to exploit multi-core systems for parallel processing has revived the concept of dataflow. In particular, the Dataflow Multithreading architectures have proven to be good candidates for these systems. In this work we propose an abstraction layer that enables compiling and running a program written for an abstract Dataflow Multithreading architecture on different implementations. More specifically, we present a set of compiler directives that provide the programmer with the means to express most types of dependencies between code segments. In addition, we present the corresponding toolchain that transforms this code into a form that can be compiled for different implementations of the model. As a case study for this work, we present the usage of the toolchain for the TFlux and DTA architectures.


complex, intelligent and software intensive systems | 2009

Introducing Hardware TLP Support in the Cell Processor

Roberto Giorgi; Zdravko Popovic; Nikola Puzovic

The focus of our study is the support for fine/medium grained Thread Level Parallelism (TLP) by using a hardware scheduling unit and relying on existing simple cores. Simple cores are grouped into clusters in order to provide a scalable solution. As a proof of concept, we use an implementation based on the Cell Broadband Engine (CBE). Cell is a multiprocessor on a chip developed by Sony, Toshiba and IBM that contains one general purpose core and eight coprocessor elements that accelerate the multimedia and vector processing. The aim of this paper is to present a possible implementation of DTA (Decoupled Threaded Architecture) that is based on the Cell processor, while keeping the scalability of the original DTA.


digital systems design | 2008

Analyzing Scalability of Deblocking Filter of H.264 via TLP Exploitation in a New Many-Core Architecture

Roberto Giorgi; Zdravko Popovic; Nikola Puzovic; Arnaldo Azevedo; Ben H. H. Juurlink

In this paper we present results of parallelization of Deblocking Filter (DF) of H.264 video codec on decoupled threaded architecture (DTA). We parallelized the code trying to exploit all available thread level parallelism and to make it suitable for DTA architecture. Experimental results show that significant speed up can be achieved and that DTA architecture can efficiently exploit available parallelism. We also show comparison with parallelized version of DF for Cell architecture.


ACACES-2008 | 2008

Implementing DTA support in CellSim

Roberto Giorgi; Zdravko Popovic; Nikola Puzovic


Archive | 2005

Recent Proposals for Tiled Architectures

Sandro Bartolini; Roberto Giorgi; Enrico Martinelli; Zdravko Popovic


ACACES-2008 | 2008

Exploiting Parallelism of Deblocking Filter of H.264 on DTA architecture

Roberto Giorgi; Zdravko Popovic; Nikola Puzovic; Arnaldo Azevedo; Ben H. H. Juurlink


Archive | 2007

Memory access decoupling in a multithreaded architecture

Roberto Giorgi; Zdravko Popovic; Nikola Puzovic

Collaboration


Dive into the Zdravko Popovic's collaboration.

Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar

Arnaldo Azevedo

Delft University of Technology

View shared research outputs
Top Co-Authors

Avatar

Ben H. H. Juurlink

Technical University of Berlin

View shared research outputs
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Researchain Logo
Decentralizing Knowledge