Network


Latest external collaboration on country level. Dive into details by clicking on the dots.

Hotspot


Dive into the research topics where Ethan Schuchman is active.

Publication


Featured researches published by Ethan Schuchman.


field programmable gate arrays | 2009

Intel nehalem processor core made FPGA synthesizable

Graham Schelle; Jamison D. Collins; Ethan Schuchman; Perry H. Wang; Xiang Zou; Gautham N. Chinya; Ralf Plate; Thorsten Mattner; Franz Olbrich; Per Hammarlund; Ronak Singhal; Jim Brayton; Sebastian Steibl; Hong Wang

We present an FPGA-synthesizable version of the Intel Atom processor core, synthesized to a Virtex-5 based FPGA emulation system. To make the production Atom design in SystemVerilog synthesizable through industry standard EDA tool flow, we transformed and mapped latches in the design, converted clock gating, and replaced nonsynthesizable constructs with FPGA-synthesizable counterparts. Additionally, as the target FPGA emulator is hosted on a PC platform with the Pentium-based CPU socket that supports a significantly different front side bus (FSB) protocol from that of the Atom processor, we replaced the existing bus control logic in the Atom core with an alternate FSB protocol to communicate with the rest of the PC platform. With these efforts, we succeeded in synthesizing the entire Atom processor core to fit within a single Virtex-5 LX330 FPGA. The synthesizable Atom core runs at 50Mhz on the Pentium PC motherboard with fully functional I/O peripherals. It is capable of booting off-the-shelf MS-DOS, Windows XP and Linux operating systems, and executing standard x86 workloads.


international conference on parallel architectures and compilation techniques | 2008

Pangaea: a tightly-coupled IA32 heterogeneous chip multiprocessor

Henry Wong; Anne Bracy; Ethan Schuchman; Tor M. Aamodt; Jamison D. Collins; Perry H. Wang; Gautham N. Chinya; Ankur Khandelwal Groen; Hong Jiang; Hong Wang

Moores Law and the drive towards performance efficiency have led to the on-chip integration of general-purpose cores with special-purpose accelerators. Pangaea is a heterogeneous CMP design for non-rendering workloads that integrates IA32 CPU cores with non-IA32 GPU-class multi-cores, extending the current state-of-the-art CPU-GPU integration that physically “fuses” existing CPU and GPU designs. Pangaea introduces (1) a resource repartitioning of the GPU, where the hardware budget dedicated for 3D-specific graphics processing is used to build more general-purpose GPU cores, and (2) a 3-instruction extension to the IA32 ISA that supports tighter architectural integration and fine-grain shared memory collaborative multithreading between the IA32 CPU cores and the non-IA32 GPU cores. We implement Pangaea and the current CPU-GPU designs in fully-functional synthesizable RTL based on the production quality RTL of an IA32 CPU and an Intel GMA X4500 GPU. On a 65 nm ASIC process technology, the legacy graphics-specific fixed-function hardware has the area of 9 GPU cores and total power consumption of 5 GPU cores. With the ISA extensions, the latency from the time an IA32 core spawns a GPU thread to the time the thread begins execution is reduced from thousands of cycles to fewer than 30 cycles. Pangaea is synthesized on a FPGA-based prototype and runs off-the-shelf IA32 OSes. A set of general-purpose non-graphics workloads demonstrate speedups of up to 8.8×.


computing frontiers | 2011

AstroLIT: enabling simulation-based microarchitecture comparison between Intel® and Transmeta designs

Guilherme Ottoni; Gautham N. Chinya; Gerolf F. Hoflehner; Jamison D. Collins; Amit Kumar; Ethan Schuchman; David R. Ditzel; Ronak Singhal; Hong Wang

While the out-of-order engine has been the mainstream micro-architecture-design paradigm to achieve high performance, Transmeta took a different approach using dynamic binary translation (BT). To enable detailed comparison of these two radically different processor-design approaches, it is natural to leverage well-established simulation-based methodologies. However, BT-based processor designs pose new challenges to standard sampling-based simulation methodologies. This paper describes these challenges, and it also introduces the AstroLIT methodology to address them.


Archive | 2009

Providing Hardware Support For Shared Virtual Memory Between Local And Remote Physical Memory

Gautham N. Chinya; Hong Wang; Deepak A. Mathaikutty; Jamison D. Collins; Ethan Schuchman; James P. Held; Ajay V. Bhatt; Prashant Sethi; Stephen F. Whalley


Archive | 2007

Accelerating software lookups by using buffered or ephemeral stores

Bratin Saha; Ali-Reza Adl-Tabatabai; Ethan Schuchman


Archive | 2007

Methods and apparatuses for reducing power consumption of processor switch operations

Ethan Schuchman; Hong Wang; Christopher T. Weaver; Belliappa Kuttanna; Asit Mallick; Vivek De; Per Hammarlund


Archive | 2014

Single instruction for specifying and saving a subset of registers, specifying a pointer to a work-monitoring function to be executed after waking, and entering a low-power mode

Ethan Schuchman; Hong Wang; Christopher T. Weaver; Belliappa Kuttanna; Asit Mallick; Vivek De; Per Hammarlund


Archive | 2010

virtual platform for prototyping system-on-chip designs

Gautham N. Chinya; Hong Wang; Ethan Schuchman


Archive | 2016

Instruction and logic for scheduling instructions

Sebastian Winkel; Ethan Schuchman; Tyler N. Sondag; Girish Venkatasubramanian


Archive | 2013

Methods and apparatus to validate translated guest code in a dynamic binary translator

Girish Venkatasubramanian; Chaitanya Mangla; Gerolf F. Hoflehner; Ethan Schuchman

Researchain Logo
Decentralizing Knowledge