Network


Latest external collaboration on country level. Dive into details by clicking on the dots.

Hotspot


Dive into the research topics where Ashish Jha is active.

Publication


Featured researches published by Ashish Jha.


arXiv: High Energy Physics - Lattice | 2016

MILC staggered conjugate gradient performance on Intel KNL

Ruizi Li; Carleton DeTar; Douglas W. Doerfler; Steven Gottlieb; Ashish Jha; Dhiraj D. Kalamkar; D. Toussaint

We review our work done to optimize the staggered conjugate gradient (CG) algorithm in the MILC code for use with the Intel Knights Landing (KNL) architecture. KNL is the second gener- ation Intel Xeon Phi processor. It is capable of massive thread parallelism, data parallelism, and high on-board memory bandwidth and is being adopted in supercomputing centers for scientific research. The CG solver consumes the majority of time in production running, so we have spent most of our effort on it. We compare performance of an MPI+OpenMP baseline version of the MILC code with a version incorporating the QPhiX staggered CG solver, for both one-node and multi-node runs.


High Performance Parallelism Pearls#R##N#Volume 2: Multicore and Many-core Programming Approaches | 2016

Numerical Weather Prediction Optimization

Tom Henderson; John Michalakes; Indraneil M. Gokhale; Ashish Jha

This chapter describes the efforts to improve performance of the widely used WSM6 microphysics package used in numerical weather prediction via a variety of techniques including threading, vectorization, array alignment, improving data locality, and optional use of compile-time constants for loop and array index bounds. Code examples are included to illustrate each technique along with its performance benefits on Intel Xeon processors and Intel Xeon Phi coprocessors. Use of tools such as the Intel Thread Inspector to speed the performance tuning process are also described.


Archive | 2008

Technique for allocating objects in a managed run time environment

Ashish Jha


Archive | 2017

METHOD AND APPARATUS FOR VECTOR INDEX LOAD AND STORE

Ashish Jha; Robert Valentine; Elmoustapha Ould-Ahmed-Vall


Archive | 2014

PROCESSORS, METHODS, SYSTEMS, AND INSTRUCTIONS TO CONSOLIDATE UNMASKED ELEMENTS OF OPERATION MASKS

Ashish Jha


Archive | 2018

ARCHITECTURAL REGISTER REPLACEMENT FOR INSTRUCTIONS THAT USE MULTIPLE ARCHITECTURAL REGISTERS

Mark J. Charney; Robert Valentine; Milind Girkar; Ashish Jha; Bret L. Toll; Elmoustapha Ould-Ahmed-Vall; Jesus Corbal San Adrian; Jason W. Brandt


Archive | 2017

AGGREGATE SCATTER INSTRUCTIONS

Ashish Jha; Elmoustapha Ould-Ahmed-Vall; Robert Valentine; Mark J. Charney; Milind Girkar


Archive | 2017

Systems, Apparatuses, and Methods for Strided Loads

Robert Valentine; Elmoustapha Ould-Ahmed-Vall; Jason W. Brandt; Mark J. Charney; Ashish Jha; Milind Girkar; Bret L. Toll; Evgeny V. Stupachenko; Sergey Y. Ostanevich


Archive | 2017

Systems, Apparatuses, and Methods for Getting Even and Odd Data Elements

Robert Valentine; Elmoustapha Ould-Ahmed-Vall; Jason W. Brandt; Mark J. Charney; Ashish Jha; Milind Girkar; Bret L. Toll; Evgeny V. Stupachenko; Sergey Y. Ostanevich


Archive | 2017

VECTOR STORE/LOAD INSTRUCTIONS FOR ARRAY OF STRUCTURES

Ashish Jha; Elmoustapha Ould-Ahmed-Vall; Robert Valentine; Mark J. Charney; Milind Girkar

Collaboration


Dive into the Ashish Jha's collaboration.

Researchain Logo
Decentralizing Knowledge