Network


Latest external collaboration on country level. Dive into details by clicking on the dots.

Hotspot


Dive into the research topics where Sridhar Samudrala is active.

Publication


Featured researches published by Sridhar Samudrala.


architectural support for programming languages and operating systems | 2014

Speculative hardware/software co-designed floating-point multiply-add fusion

Marc Lupon; Enric Gibert; Grigorios Magklis; Sridhar Samudrala; Raúl Martínez; Kyriakos Stavrou; David R. Ditzel

A Fused Multiply-Add (FMA) instruction is currently available in many general-purpose processors. It increases performance by reducing latency of dependent operations and increases precision by computing the result as an indivisible operation with no intermediate rounding. However, since the arithmetic behavior of a single-rounding FMA operation is different than independent FP multiply followed by FP add instructions, some algorithms require significant revalidation and rewriting efforts to work as expected when they are compiled to operate with FMA--a cost that developers may not be willing to pay. Because of that, abundant legacy applications are not able to utilize FMA instructions. In this paper we propose a novel HW/SW collaborative technique that is able to efficiently execute workloads with increased utilization of FMA, by adding the option to get the same numerical result as separate FP multiply and FP add pairs. In particular, we extended the host ISA of a HW/SW co-designed processor with a new Combined Multiply-Add (CMA) instruction that performs an FMA operation with an intermediate rounding. This new instruction is used by a transparent dynamic translation software layer that uses a speculative instruction-fusion optimization to transform FP multiply and FP add sequences into CMA instructions. The FMA unit has been slightly modified to support both single-rounding and double-rounding fused instructions without increasing their latency and to provide a conservative fall-back path in case of mispeculation. Evaluation on a cycle-accurate timing simulator showed that CMA improved SPECfp performance by 6.3% and reduced executed instructions by 4.7%.


Archive | 2011

Systems, apparatuses, and methods for expanding a memory source into a destination register and compressing a source register into a destination memory location

Jesus Corbal San Adrian; Roger Espasa Sans; Robert Valentine; Santiago Galan Duran; Jeffrey G. Wiedemeier; Sridhar Samudrala; Milind Girkar; Andrew T. Forsyth; Victor W. Lee


Archive | 2014

Vector friendly instruction format and execution thereof

Robert Valentine; Jesus Corbal San Adrian; Roger Espasa Sans; Robert D. Cavin; Bret L. Toll; Santiago Galan Duran; Jeffrey G. Wiedemeier; Sridhar Samudrala; Milind Girkar; Edward T. Grochowski; Jonathan C. Hall; Dennis R. Bradford; Elmoustapha Ould-Ahmed-Vall; James C. Abel; Mark J. Charney; Seth Abraham; Suleyman Sair; Andrew T. Forsyth; Lisa Wu; Charles R. Yount


Archive | 2011

INSTRUCTION AND LOGIC TO PROVIDE VECTOR BLEND AND PERMUTE FUNCTIONALITY

Robert Valentine; Bret L. Toll; Jeff Wiedemeier; Sridhar Samudrala; Jesus Corbal


Archive | 2010

Functional unit for vector integer multiply add instruction

Jeff Wiedemeier; Sridhar Samudrala; Roger A. Golliver


Archive | 2011

SYSTEMS, APPARATUSES, AND METHODS FOR BLENDING TWO SOURCE OPERANDS INTO A SINGLE DESTINATION USING A WRITEMASK

Jesus Corbal San Adrian; Bret L. Toll; Robert Valentine; Jeffrey G. Wiedemeier; Sridhar Samudrala; Milind Girkar; Andrew T. Forsyth; Elmoustapha Ould-Ahmed-Vall; Dennis R. Bradford; Lisa K. Wu


Archive | 2015

DOUBLE ROUNDED COMBINED FLOATING-POINT MULTIPLY AND ADD

Sridhar Samudrala; Grigorios Magklis; Marc Lupon; David R. Ditzel


Archive | 2011

FUNCTIONAL UNIT FOR VECTOR LEADING ZEROES, VECTOR TRAILING ZEROES, VECTOR OPERAND 1s COUNT AND VECTOR PARITY CALCULATION

Jeff Wiedemeier; Sridhar Samudrala; Roger A. Golliver; Eric W. Mahurin


Archive | 2013

MECHANISM FOR FACILITATING DYNAMIC AND EFFICIENT FUSION OF COMPUTING INSTRUCTIONS IN SOFTWARE PROGRAMS

Marc Lupon; Raúl Martínez; Enric Gibert Codina; Kyriakos Stavrou; Grigorios Magklis; Sridhar Samudrala


Archive | 2017

MXCSR control method and apparatus

Grigorios Magklis; Josep M. Codina; Craig B. Zilles; Michael Neilly; Sridhar Samudrala; Alejandro Martinez Vicente; Polychronis Xekalakis; F. Jesús Sánchez; Marc Lupon; Georgios Tournavitis; Enric Gibert Codina; Crispin Gomez Requena; Antonio González; Mirem Hyuseinova; Christos E. Kotselidis; Fernando Latorre; Pedro Lopez; Carlos Madriles Gimeno; Pedro Marcuello; Raúl Martínez; Daniel Ortega; Demos Pavlou; Kyriakos Stavrou

Researchain Logo
Decentralizing Knowledge