Network


Latest external collaboration on country level. Dive into details by clicking on the dots.

Hotspot


Dive into the research topics where Ahmad Yasin is active.

Publication


Featured researches published by Ahmad Yasin.


international symposium on performance analysis of systems and software | 2014

A Top-Down method for performance analysis and counters architecture

Ahmad Yasin

Optimizing an applications performance for a given microarchitecture has become painfully difficult. Increasing microarchitecture complexity, workload diversity, and the unmanageable volume of data produced by performance tools increase the optimization challenges. At the same time resource and time constraints get tougher with recently emerged segments. This further calls for accurate and prompt analysis methods. The insights from this method guide a proposal for a novel performance counters architecture that can determine the true bottlenecks of a general out-of-order processor. Unlike other approaches, our analysis method is low-cost and already featured in in-production systems - it requires just eight simple new performance events to be added to a traditional PMU. It is comprehensive - no restriction to predefined set of performance issues. It accounts for granular bottlenecks in super-scalar cores, missed by earlier approaches.


ACM Transactions on Architecture and Code Optimization | 2015

Compiler-Directed Power Management for Superscalars

Jawad Haj-Yihia; Yosi Ben Asher; Efraim Rotem; Ahmad Yasin; Ran Ginosar

Modern superscalar CPUs contain large complex structures and diverse execution units, consuming wide dynamic power range. Building a power delivery network for the worst-case power consumption is not energy efficient and often is impossible to fit in small systems. Instantaneous power excursions can cause voltage droops. Power management algorithms are too slow to respond to instantaneous events. In this article, we propose a novel compiler-directed framework to address this problem. The framework is validated on a 4th Generation Intel® Core™ processor and with simulator on output trace. Up to 16% performance speedup is measured over baseline for the SPEC CPU2006 benchmarks.


ACM Transactions on Architecture and Code Optimization | 2016

Fine-Grain Power Breakdown of Modern Out-of-Order Cores and Its Implications on Skylake-Based Systems

Jawad Haj-Yihia; Ahmad Yasin; Yosi Ben Asher; Avi Mendelson

A detailed analysis of power consumption at low system levels becomes important as a means for reducing the overall power consumption of a system and its thermal hot spots. This work presents a new power estimation method that allows understanding the power breakdown of an application when running on modern processor architecture such as the newly released Intel Skylake processor. This work also provides a detailed power and performance characterization report for the SPEC CPU2006 benchmarks, analysis of the data using side-by-side power and performance breakdowns, as well as few interesting case studies.


data management on new hardware | 2017

A methodology for OLTP micro-architectural analysis

Utku Sirin; Ahmad Yasin; Anastasia Ailamaki

Micro-architectural analysis is critical to investigate the interaction between workloads and processors. While todays aggressive out-of-order processors provide a rich set of performance events for deep execution cycle analysis, OLTP characterization studies usually use a cache-miss-based method (CMBM). In this work, we investigate the validity and the functionality of CMBM by comparing it with Intels state-of-the-art Top-down Micro-architecture Analysis Method (TMAM) for OLTP workloads. We show that, while CMBM and TMAM provide a similar high-level micro-architectural behavior, it is inadequate for a fine-grained micro-architectural analysis. We further show that TMAM underestimates memory stalls. We optimize TMAMs execution cycle breakdown, and improve its estimation of memory stalls up to 50%.


ieee hot chips symposium | 2016

Inside 6th gen Intel ® Core™: New microarchitecture code named skylake

Ittai Anati; David Blythe; Jack Doweck; Hong Jiang; Wen-fu Kao; Julius Mandelblat; Lihu Rappoport; Efraim Rotem; Ahmad Yasin

•Skylake delivers record levels of performance and battery life in many personal computing use cases and form factors •Intel® Speed Shift Technology provides higher performance, responsiveness and efficiency at power constrained form factors •Skylake Processor Graphics delivers scalable performance, >1TFLOPS compute, enhanced low power media engines, flexible power management, and end-to-end 4K experience •Skylake family of products allows developers to: •Choose from wide range of platform capabilities •Innovate with products for wide range of thermal envelopes and I/O solutions •Optimize the system performance using the advanced PMU capabilities •Skylake introduces Intel® SGX: a revolutionary game changer to trusted application security in the main stream SW environment


ieee international symposium on workload characterization | 2014

Deep-dive analysis of the data analytics workload in CloudSuite

Ahmad Yasin; Yosi Ben-Asher; Avi Mendelson


IEEE Micro | 2017

Inside 6th-Generation Intel Core: New Microarchitecture Code-Named Skylake

Jack Doweck; Wen-fu Kao; Allen Lu; Julius Mandelblat; Anirudha Rahatekar; Lihu Rappoport; Efraim Rotem; Ahmad Yasin; Adi Yoaz


Archive | 2011

Causing an interrupt based on event count

Ahmad Yasin; Peggy J. Irelan; Ofer Levy; Emile Ziedan; Grant G. Zhou


usenix annual technical conference | 2015

Establishing a base of trust with performance counters for enterprise workloads

Andrzej Nowak; Ahmad Yasin; Avi Mendelson; Willy Zwaenepoel


Archive | 2014

PERFORMANCE SCALABILITY PREDICTION

Ahmad Yasin; Nir Rosenzweig; Eliezer Weissmann; Efraim Rotem

Collaboration


Dive into the Ahmad Yasin's collaboration.

Researchain Logo
Decentralizing Knowledge