Network


Latest external collaboration on country level. Dive into details by clicking on the dots.

Hotspot


Dive into the research topics where Mark J. Charney is active.

Publication


Featured researches published by Mark J. Charney.


high performance computer architecture | 2001

Branch history guided instruction prefetching

Viji Srinivasan; Edward S. Davidson; Gary S. Tyson; Mark J. Charney; Thomas R. Puzak

Instruction cache misses stall the fetch stage of the processor pipeline and hence affect instruction supply to the processor. Instruction prefetching has been proposed as a mechanism to reduce instruction cache (I-cache) misses. However, a prefetch is effective only if accurate and initiated sufficiently early to cover the miss penalty. This paper presents a new hardware-based instruction prefetching mechanism, Branch History Guided Prefetching (BHGP), to improve the timeliness of instruction prefetches. BHGP correlates the execution of a branch instruction with I-cache misses and uses branch instructions to trigger prefetches of instructions that occur (N-1) branches later in the program execution, for a given N>1. Evaluations on commercial applications, windows-NT applications, and some CPU2000 applications show an average reduction of 66% in miss rate over all applications. BHGP improved the IPC bp 12 to 14% for the CPU2000 applications studied; on average 80% of the BHGP prefetches arrived in cache before their next use, even on a 4-wide issue machine with a 15 cycle L2 access penalty.


Ibm Journal of Research and Development | 1997

Profetching and memory system behavior of the SPEC95 benchmark suite

Mark J. Charney; Thomas R. Puzak

This paper presents instruction and data cache miss rates for the SPEC95™ benchmark suite. We have simulated the instruction and data traffic resulting from 500 million instructions of each of the 18 programs. Simulation results show that only a few of the applications place more than modest demands on the memory system. This was noticed for instruction caches, where only a few workloads required more than a 32Kb cache to achieve miss rates of less than one miss every 1000 instructions. We also analyze two prefetching algorithms using the SPEC95 workload: next-sequential prefetching and shadow-directory prefetching. Each prefetching algorithm is evaluated using three performance metrics: coverage, accuracy, and traffic. Variations in each prefetching algorithm involve the use of a confirmation mechanism that receives feedback information about the quality of each prefetch. With confirmation, the prefetching algorithm is able to enhance the accuracy of prefetching decisions. The results show that shadow-directory prefetching averages miss coverage about ten percent higher than next-sequential prefetching when used in prefetching instructions (about 60 percent coverage for next-sequential prefetching versus 70 percent for shadow-directory prefetching). The prefetching accuracy for both algorithms is more than 90 percent when a confirmation mechanism is used. In general, data prefetching is shown to be less accurate and to provide less coverage than instruction prefetching. Shadow-directory prefetching averaged about a 40 percent miss coverage versus a 25 percent miss coverage for next-sequential prefetching. Prefetching accuracy is over 70 percent when confirmation is applied.


international conference on parallel architectures and compilation techniques | 1999

Dynamic linking on a shared-memory multiprocessor

Bowen Alpern; Mark J. Charney; Jong-Deok Choi; Anthony Cocchi; Derek Lieber

This paper presents a technique for back-patching instructions in an SMP environment. This technique is used by the Jalapeno virtual machine to support dynamic class loading in Java. There is a small runtime overhead the first time a back-patch site is executed. Thereafter, it executes at the same speed as equivalent sites not requiring back-patching.


Archive | 2000

Memory Behavior of the SPEC2000 Benchmark Suite

Suleyman Sair; Mark J. Charney


Archive | 1996

Automatic cache prefetch timing with dynamic trigger migration

Mark J. Charney; Pradeep Dubey; Thomas Robert Puzak; William J. Starke


Archive | 1999

Method and apparatus for reducing latency in set-associative caches using set prediction

Mark J. Charney; Philip G. Emma; Daniel A. Prener; Thomas R. Puzak


Archive | 1999

Prefetching using future branch path information derived from branch prediction

Thomas R. Puzak; Allan M. Hartstein; Mark J. Charney; Daniel A. Prener; Peter Howland Oden


Archive | 2014

Vector friendly instruction format and execution thereof

Robert Valentine; Jesus Corbal San Adrian; Roger Espasa Sans; Robert D. Cavin; Bret L. Toll; Santiago Galan Duran; Jeffrey G. Wiedemeier; Sridhar Samudrala; Milind Girkar; Edward T. Grochowski; Jonathan C. Hall; Dennis R. Bradford; Elmoustapha Ould-Ahmed-Vall; James C. Abel; Mark J. Charney; Seth Abraham; Suleyman Sair; Andrew T. Forsyth; Lisa Wu; Charles R. Yount


Archive | 1999

Methods for caching cache tags

Albert Chang; Mark J. Charney; Robert K. Montoye; Thomas R. Puzak


Archive | 2014

METHODS AND APPARATUS FOR FUSING INSTRUCTIONS TO PROVIDE OR-TEST AND AND-TEST FUNCTIONALITY ON MULTIPLE TEST SOURCES

Maxim Loktyukhin; Robert Valentine; Julian C. Horn; Mark J. Charney

Researchain Logo
Decentralizing Knowledge