Network


Latest external collaboration on country level. Dive into details by clicking on the dots.

Hotspot


Dive into the research topics where Muhammad Yasir Qadri is active.

Publication


Featured researches published by Muhammad Yasir Qadri.


Eurasip Journal on Embedded Systems | 2009

Data cache-energy and throughput models: design exploration for embedded processors

Muhammad Yasir Qadri; Klaus D. McDonald-Maier

Most modern 16-bit and 32-bit embedded processors contain cache memories to further increase instruction throughput of the device. Embedded processors that contain cache memories open an opportunity for the low-power research community to model the impact of cache energy consumption and throughput gains. For optimal cache memory configuration mathematical models have been proposed in the past. Most of these models are complex enough to be adapted for modern applications like run-time cache reconfiguration. This paper improves and validates previously proposed energy and throughput models for a data cache, which could be used for overhead analysis for various cache types with relatively small amount of inputs. These models analyze the energy and throughput of a data cache on an application basis, thus providing the hardware and software designer with the feedback vital to tune the cache or application for a given energy budget. The models are suitable for use at design time in the cache optimization process for embedded processors considering time and energy overhead or could be employed at runtime for reconfigurable architectures.


international conference on computer modelling and simulation | 2010

Analytical Evaluation of Energy and Throughput for Multilevel Caches

Muhammad Yasir Qadri; Klaus D. McDonald-Maier

With the increase of processor-memory performance gap, it has become important to gauge the performance of cache architectures so as to evaluate their impact on energy requirement and throughput of the system. Multilevel caches are found to be increasingly prevalent in the high-end processors. Additionally, the recent drive towards multicore systems has necessitated the use of multilevel cache hierarchies for shared memory architectures. This paper presents simplified and accurate mathematical models to estimate the energy consumption and the impact on throughput for multilevel caches for single core systems.


automation, robotics and control systems | 2010

JetBench: an open source real-time multiprocessor benchmark

Muhammad Yasir Qadri; Dorian Matichard; Klaus D. McDonald Maier

Performance comparison among various architectures is generally attained by using standard benchmark tools. This paper presents JetBench, an Open Source OpenMP based multicore benchmark application that could be used to analyse real time performance of a specific target platform. The application is designed to be platform independent by avoiding target specific libraries and hardware counters and timers. JetBench uses jet engine parameters and thermodynamic equations presented in the NASAs EngineSim program, and emulates a real-time jet engine performance calculator. The user is allowed to determine a flight profile with timing constraints, and adjust the number of threads. This paper discusses the structure of the application, thread distribution and its scalability on a custom symmetric multicore platform based on a cycle accurate full system simulator.


Journal of Intelligent and Fuzzy Systems | 2014

Energy and throughput aware fuzzy logic based reconfiguration for MPSoCs

Muhammad Yasir Qadri; Klaus D. McDonald Maier; Nadia N. Qadri

Multicore architectures offer an amount of parallelism that is often underutilized, as a result these underutilized resources become a liability instead of advantage. Inefficient resource sharing on the chip can have a negative impact on the performance of an application and may result in greater energy consumption. A large body of research now focuses on reconfigurable multicore architectures in order to support algorithms to find optimal solutions for improved energy and throughput balance. An ideal system would be able to optimize such reconfigurable systems to a level that optimum resources are allocated to a particular workload and all the other underutilized resources remain inactive for greater energy savings. This paper presents a fuzzy logic based reconfiguration engine targeted to optimize a multicore architecture according to the workload requirements for optimum balance between power and performance of the system. The proposed fuzzy logic reconfiguration engine is designed around a 16-core SCMP architecture comprising of reconfigurable cache memories, power gated cores and adaptive on-chip network routers for minimizing leakage energy effects for inactive components. A coarse grained architecture was selected for being able to reconfigure faster, thus making it feasible to be used for runtime adaptation schemes. The presented architecture is analyzed over a set of OpenMP based parallel benchmarks and results show significant energy savings in all cases.


complex, intelligent and software intensive systems | 2010

A Fuzzy Logic Reconfiguration Engine for Symmetric Chip Multiprocessors

Muhammad Yasir Qadri; Klaus D. McDonald-Maier

Recent developments in reconfigurable multiprocessor system on chip (MPSoC) have offered system designers a great amount of flexibility to exploit task concurrency with higher throughput and less energy consumption. This paper presents a novel fuzzy logic reconfiguration engine (FLRE) for coarse grain MPSoC reconfiguration that facilitates to identify an optimum balance between power and performance of the system. The FLRE is composed on two levels of abstraction layers. The system selects an optimal configuration of Level 1 / Level 2 cache size and Associativity, processor operating frequency and voltage, the number of cores based on miss rate, and energy and throughput information of the system both at core and SoC level. An 8-core symmetric chip multiprocessor has been used to evaluate the proposed scheme. The results show an overall decrease of energy consumption with not more than 30% decrease in the throughput.


adaptive hardware and systems | 2010

A fuzzy logic based dynamic reconfiguration scheme for optimal energy and throughput in symmetric chip multiprocessors

Muhammad Yasir Qadri; Klaus D. McDonald-Maier

Embedded systems architectures have traditionally often been investigated and designed in order to achieve a greater throughput combined with minimum energy consumption. With the advent of reconfigurable architectures it is now possible to support algorithms to find optimal solutions for an improved energy and throughput balance. As a result of ongoing research several online and offline techniques and algorithm have been proposed for hardware adaptation. This paper presents a novel coarse-grained reconfigurable symmetric chip multiprocessor (SCMP) architecture managed by a fuzzy logic engine that balances performance and energy consumption. The architecture incorporates reconfigurable level 1 (L1) caches, power gated cores and adaptive on-chip network routers to allow minimizing leakage energy effects for inactive components. A coarse grained architecture was selected as to be a focus for this study as it typically allows for fast reconfiguration as compared to the finegrained architectures, thus making it more feasible to be used for runtime adaption schemes. The presented architecture is analyzed using a set of OpenMP based parallel benchmarks and the results show significant improvements in performance while maintaining minimum energy consumption.


Journal of Circuits, Systems, and Computers | 2018

AC-DSE: Approximate Computing for the Design Space Exploration of Reconfigurable MPSoCs

Arsalan Shahid; Muhammad Yasir Qadri; Martin Fleury; Hira Waris; Ayaz Ahmad; Nadia N. Qadri

This paper concerns the design space exploration (DSE) of Reconfigurable Multi- Processor System-on- Chip (MPSoC) architectures. Reconfiguration allows users to allocate optimum system resources fo...


Microprocessors and Microsystems | 2016

Fuzzy logic based energy and throughput aware design space exploration for MPSoCs

Muhammad Yasir Qadri; Nadia N. Qadri; Klaus D. McDonald-Maier

Multicore architectures were introduced to mitigate the issue of increase in power dissipation with clock frequency. Introduction of deeper pipelines, speculative threading etc. for single core systems were not able to bring much increase in performance as compared to their associated power overhead. However for multicore architectures performance scaling with number of cores has always been a challenge. The Amdahls law shows that the theoretical maximum speedup of a multicore architecture is not even close to the multiple of number of cores. With less amount of code in parallel having more number of cores for an application might just contribute in greater power dissipation instead of bringing some performance advantage. Therefore there is a need of an adaptive multicore architecture that can be tailored for the application in use for higher energy efficiency. In this paper a fuzzy logic based design space exploration technique is presented that is targeted to optimize a multicore architecture according to the workload requirements in order to achieve optimum balance between throughput and energy of the system.


Journal of Circuits, Systems, and Computers | 2015

Software-Controlled Instruction Prefetch Buffering for Low-End Processors

Muhammad Yasir Qadri; Nadia N. Qadri; Martin Fleury; Klaus D. McDonald-Maier

This paper proposes a method of buffering instructions by software-based prefetching. The method allows low-end processors to improve their instruction throughput with a minimum of additional logic and power consumption. Low-end embedded processors do not employ caches for mainly two reasons. The first reason is that the overhead of cache implementation in terms of energy and area is considerable. The second reason is that, because a caches performance primarily depends on the number of hits, an increasing number of misses could cause a processor to remain in stall mode for a longer duration. As a result, a cache may become more of a liability than an advantage. In contrast, the benchmarked results for the proposed software-based prefetch buffering without a cache show a 5–10% improvement in execution time. They also show a 4% or more reduction in the energy-delay-square-product (ED2P) with a maximum reduction of 40%. The results additionally demonstrate that the performance and efficiency of the proposed architecture scales with the number of multicycle instructions. The benchmarked routines tested to arrive at these results are widely deployed components of embedded applications.


Archive | 2018

Internet of Things Shaping Smart Cities: A Survey

Arsalan Shahid; Bilal Khalid; Shahtaj Shaukat; Hashim Ali; Muhammad Yasir Qadri

Driven by the advances in hardware and software technologies, the term Internet of things has emerged as a worldwide framework of ‘smart’ internet-based interconnected electronic devices through web having a significant impact in the betterment of our traditional living style. The use of these web connected embedded devices, as Information and communication technologies for re-shaping modern cities, lead to the concept of smart cities. This chapter surveys the most important domains of smart cities and illustrates the recent research and development in them. After identifying critical areas, the chapter also highlights and discusses the issues and research gaps in recent technologies. Finally, it presents the opportunities and research directions in future advancements of intelligent cities.

Collaboration


Dive into the Muhammad Yasir Qadri's collaboration.

Top Co-Authors

Avatar

Nadia N. Qadri

COMSATS Institute of Information Technology

View shared research outputs
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar

Ayaz Ahmad

COMSATS Institute of Information Technology

View shared research outputs
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar

Hira Waris

COMSATS Institute of Information Technology

View shared research outputs
Top Co-Authors

Avatar

Mashal Ahmad

COMSATS Institute of Information Technology

View shared research outputs
Top Co-Authors

Avatar

Muhammad Iqbal

University of Central Punjab

View shared research outputs
Top Co-Authors

Avatar
Researchain Logo
Decentralizing Knowledge