Network


Latest external collaboration on country level. Dive into details by clicking on the dots.

Hotspot


Dive into the research topics where Sai Luo is active.

Publication


Featured researches published by Sai Luo.


Operating Systems Review | 2011

Optimizing a shared virtual memory system for a heterogeneous CPU-accelerator platform

Shoumeng Yan; Xiaocheng Zhou; Ying Gao; Hu Chen; Gansha Wu; Sai Luo; Bratin Saha

The client computing platform is moving towards a heterogeneous architecture that combines scalar-oriented CPU cores and throughput-oriented accelerator cores. Recognizing that existing programming models for such heterogeneous platforms are still difficult for most programmers, we advocate a shared virtual memory programming model to improve programmability. In this paper, we focus on performance, and demonstrate that users need not sacrifice performance for programmability. We describe our approaches, experiences, and results in optimizing MYO on a heterogeneous platform consisting of a CPU and an Aubrey Isle accelerator. Our efforts involve the whole system software stack including the OS, runtime, and application.


ieee international conference on high performance computing, data, and analytics | 2009

Terascale chip multiprocessor memory hierarchy and programming model

Shoumeng Yan; Xiaocheng Zhou; Ying Gao; Hu Chen; Sai Luo; Peinan Zhang; Naveen Cherukuri; Ronny Ronen; Bratin Saha

Small scale chip multiprocessors are being shipped in volume by all microprocessor vendors. Many of these vendors are also investigating large scale chip multiprocessors targeted towards highly parallel workloads in media, graphics, and others. One of the most challenging aspects of architecting terascale processors is the design of a scalable memory hierarchy. Current proposals for providing coherent shared memory in terascale systems require a sophisticated coherence protocol and memory hierarchy. In this paper we propose an alternate memory configuration along with a programming model that significantly simplifies the terascale memory hierarchy. Our proposal still provides fully coherent shared memory but eliminates the hardware coherence protocol. Our programming model enables the programmer to better express the memory characteristic of terascale workloads. Finally, our proposed memory hierarchy performs better and is more scalable than conventional designs.


Archive | 2010

Sharing virtual functions in a shared virtual memory between heterogeneous processors of a computing platform

Shoumeng Yan; Sai Luo; Xiaocheng Zhou; Ying Gao; Hu Chen; Bratin Saha


Archive | 2009

SHARING VIRTUAL MEMORY-BASED MULTI-VERSION DATA BETWEEN THE HETEROGENOUS PROCESSORS OF A COMPUTER PLATFORM

Ying Gao; Hu Chen; Shoumeng Yan; Xiaocheng Zhou; Sai Luo; Bratin Saha


Archive | 2009

Two way communication support for heterogenous processors of a computer platform

Shoumeng Yan; Xiaocheng Zhou; Ying Gao; Mohan Rajagopalan; Rajiv Deodhar; David M. Putzolu; Clark Nelson; Milind Girkar; Robert Geva; Tiger Chen; Sai Luo; Stephen Junkins; Bratin Saha; Ravi Narayanaswamy; Patrick Xi


Archive | 2011

Passing data from a CPU to a graphics processor by writing multiple versions of the data in a shared memory

Ying Gao; Hu Chen; Shoumeng Yan; Xiaocheng Zhou; Sai Luo; Bratin Saha


usenix conference on hot topics in parallelism | 2010

A balanced programming model for emerging heterogeneous multicore systems

Wei Liu; Brian T. Lewis; Xiaocheng Zhou; Hu Chen; Ying Gao; Shoumeng Yan; Sai Luo; Bratin Saha


Archive | 2010

Recording Dirty Information in Software Distributed Shared Memory Systems

Shoumeng Yan; Ying Gao; Xiaocheng Zhou; Hu Chen; Sai Luo; Bratin Saha


Archive | 2017

MULTICORE MEMORY DATA RECORDER FOR KERNEL MODULE

Sai Luo; Tin-Fook Ngai; Hu Chen; Xiaocheng Zhou; Chunxiao Lin; Kang Zhao


Archive | 2010

The common use of virtual memory-based multiple-version data between the various processors of a computer platform

Hu Chen; Ying Gao; Sai Luo; Bratin Calif. Saha; Shoumeng Yan; Xiaocheng Zhou

Collaboration


Dive into the Sai Luo's collaboration.

Researchain Logo
Decentralizing Knowledge