Steve Shih-wei Liao | Researchain

Archive Network Publication Hotspot Collaboration

Network

Latest external collaboration on country level. Dive into details by clicking on the dots.

Explore More

Hotspot

Dive into the research topics where Steve Shih-wei Liao is active.

Explore More

Publication

Featured researches published by Steve Shih-wei Liao.

programming language design and implementation | 2002

Post-pass binary adaptation for software-based speculative precomputation

Steve Shih-wei Liao; Perry H. Wang; Hong Wang; Gerolf F. Hoflehner; Daniel M. Lavery; John Paul Shen

Recently, a number of thread-based prefetching techniques have been proposed. These techniques aim at improving the latency of single-threaded applications by leveraging multithreading resources to perform memory prefetching via speculative prefetch threads. Software-based speculative precomputation (SSP) is one such technique, proposed for multithreaded Itanium models. SSP does not require expensive hardware support-instead it relies on the compiler to adapt binaries to perform prefetching on otherwise idle hardware thread contexts at run time. This paper presents a post-pass compilation tool for generating SSP-enhanced binaries. The tool is able to: (1) analyze a single-threaded application to generate prefetch threads; (2) identify and embed trigger points in the original binary; and (3) produce a new binary that has the prefetch threads attached. The execution of the new binary spawns the speculative prefetch threads, which are executed concurrently with the main thread. Our results indicate that for a set of pointer-intensive benchmarks, the prefetching performed by the speculative threads achieves an average of 87% speedup on an in-order processor and 5% speedup on an out-of-order processor.

symposium on code generation and optimization | 2004

Physical experimentation with prefetching helper threads on Intel's hyper-threaded processors

Dongkeun Kim; Steve Shih-wei Liao; Perry H. Wang; J. del Cuvillo; Xinmin Tian; Xiang Zou; Hong Wang; Donald Yeung; Milind Girkar; John Paul Shen

Pre-execution techniques have received much attention as an effective way of prefetching cache blocks to tolerate the ever-increasing memory latency. A number of pre-execution techniques based on hardware, compiler, or both have been proposed and studied extensively by researchers. They report promising results on simulators that model a simultaneous multithreading (SMT) processor. We apply the helper threading idea on a real multithreaded machine, i.e., Intel Pentium 4 processor with hyper-threading technology, and show that indeed it can provide wall-clock speedup on real silicon. To achieve further performance improvements via helper threads, we investigate three helper threading scenarios that are driven by automated compiler infrastructure, and identify several key challenges and opportunities for novel hardware and software optimizations. Our study shows a program behavior changes dynamically during execution. In addition, the organizations of certain critical hardware structures in the hyper-threaded processors are either shared or partitioned in the multithreading mode and thus, the tradeoffs regarding resource contention can be intricate. Therefore, it is essential to judiciously invoke helper threads by adapting to the dynamic program behavior so that we can alleviate potential performance degradation due to resource contention. Moreover, since adapting to the dynamic behavior requires frequent thread synchronization, having light-weight thread synchronization mechanisms is important.

Archive | 2003

Speculative multi-threading for instruction prefetch and/or trace pre-build

Hong Wang; Tor M. Aamodt; Pedro Marcuello; Jared Stark; John Paul Shen; Antonio González; Per Hammarlund; Gerolf F. Hoflehner; Perry H. Wang; Steve Shih-wei Liao

Archive | 2003

Method and apparatus for affinity-guided speculative helper threads in chip multiprocessors

Hong Wang; Perry H. Wang; Jeffery A. Brown; Per Hammarlund; George Z. Chrysos; Doron Orenstein; Steve Shih-wei Liao; John Paul Shen

Archive | 2003

User-programmable low-overhead multithreading

Perry H. Wang; Hong Wang; John Paul Shen; Ashok N. Seshadri; Anthony N. Mah; William R. Greene; Ravi K. Chandran; Piyush Desai; Steve Shih-wei Liao

Archive | 2003

Method and apparatus for efficient utilization for prescient instruction prefetch

Tor M. Aamodt; Hong Wang; Per Hammarlund; John Paul Shen; Steve Shih-wei Liao; Perry H. Wang

Archive | 2003

Safe store for speculative helper threads

Hong Wang; Tor M. Aamodt; Per Hammarlund; John Paul Shen; Xinmin Tian; Milind Girkar; Perry H. Wang; Steve Shih-wei Liao

Archive | 2003

Control-quasi-independent-points guided speculative multithreading

Pedro Marcuello; Antonio González; Hong Wang; John Paul Shen; Per Hammarlund; Gerolf F. Hoflehner; Perry H. Wang; Steve Shih-wei Liao

Archive | 2002

Transformation of single-threaded code to speculative precomputation enabled code

Hong Wang; Perry H. Wang; Ross David Weldon; Scott M. Ettinger; Hideki Saito; Milind Girkar; Steve Shih-wei Liao; Mohammad R. Haghighat; Xinmin Tian; John Paul Shen; Oren Gershon

Archive | 2003