David Morano
Northeastern University
Network
Latest external collaboration on country level. Dive into details by clicking on the dots.
Publication
Featured researches published by David Morano.
european conference on parallel processing | 2002
Augustus K. Uht; Alireza Khalafi; David Morano; Marcos de Alba; David R. Kaeli
In this paper we present a novel approach to exploiting ILP through the use of resource-flow computing. This model begins by executing instructions independent of data flow and control flow dependencies in a program. The rest of the execution time is spent applying programmatic data flow and control flow constraints to end up with a programmatically-correct execution. We present the design of a machine that uses time tags and Active Stations, realizing a registerless data path.In this contribution we focus our discussion on the Execution Window elements of our machine, present Instruction Per Cycle (IPC) speedups for SPECint95 and SPECint2000 programs, and discuss the scalability of our design to hundreds of processing elements.
ACM Sigarch Computer Architecture News | 2003
David Morano; Alireza Khalafi; David R. Kaeli; Augustus K. Uht
A microarchitecture is described that achieves high performance on conventional single-threaded program codes without compiler assistance. To obtain high instructions per clock (IPC) for inherently sequential (e.g., SpecInt-2000 programs), a large number of instructions must be in flight simultaneously. However, several problems are associated with such microarchitectures, including scalability, issues related to control flow, and memory latency.Our design investigates how to utilize a large mesh of processing elements in order to execute a singlethreaded program. We present a basic overview of our microarchitecture and discuss how it addresses scalability as we attempt to execute many instructions in parallel. The microarchitecture makes use of control and value speculative execution, multipath execution, and a high degree of out-of-order execution to help extract instruction level parallelism. Execution-time predication and time-tags for operands are used for maintaining program order. We provide simulation results for several geometries of our microarchitecture illustrating a range of design tradeoffs. Results are also presented that show the small performance impact over a range of memory system latencies.
Journal of Instruction-level Parallelism | 2003
Augustus K. Uht; David Morano; Alireza Khalafi; David R. Kaeli
Archive | 2001
Augustus K. Uht; David Morano; David R. Kaeli
Archive | 2001
Kelley Hall; Augustus K. Uht; David Morano; Alireza Khalafi; Marcos de Alba; Thomas F. Wenisch; Maryam Ashouei; David R. Kaeli
Archive | 2006
Augustus K. Uht; David Morano; David R. Kaeli
Archive | 2001
Augustus K. Uht; David Morano; Alireza Khalafi; Marcos de Alba; Thomas F. Wenisch; Maryam Ashouei; David R. Kaeli
Archive | 2008
Augustus K. Uht; David Morano; David R. Kaeli
Archive | 2008
Augustus K. Uht; David Morano; David R. Kaeli
Archive | 2011
Augustus K. Uht; David Morano; David R. Kaeli