Network


Latest external collaboration on country level. Dive into details by clicking on the dots.

Hotspot


Dive into the research topics where George Ostrouchov is active.

Publication


Featured researches published by George Ostrouchov.


Big Data Research | 2017

Programming with BIG Data in R: Scaling Analytics from One to Thousands of Nodes ☆ ☆☆

Drew Schmidt; Wei Chen Chen; Michael A. Matheson; George Ostrouchov

Abstract We present a tutorial overview showing how one can achieve scalable performance with R. We do so by utilizing several package extensions, including those from the pbdR project. These packages consist of high performance, high-level interfaces to and extensions of MPI, PBLAS, ScaLAPACK, I/O libraries, profiling libraries, and more. While these libraries shine brightest on large distributed platforms, they also work rather well on small clusters and often, surprisingly, even on a laptop with only two cores. Our tutorial begins with recommendations on how to get more performance out of your R code before considering parallel implementations. Because R is a high-level language, a function can have a deep hierarchy of operations. For big data, this can easily lead to inefficiency. Profiling is an important tool to understand the performance of an R code for both serial and parallel improvements. The pbdR packages provide a highly scalable capability for the development of novel distributed data analysis algorithms. This level of scalability is unmatched in other analysis software. Interactive speeds (seconds) are achieved for complex analysis algorithms on data 100 GB and more. This is possible because the interfaces add little overhead to the scalable libraries and their extensions. Furthermore, this is often achieved with little or no change to serial R codes. Our overview includes codes of varying complexity, illustrating reading data in parallel, the process of changing a serial code to a distributed parallel code, and how to engage distributed matrix computation from within R.


Archive | 2016

Programming with Big Data – Interface to MPI

Wei-Chen Chen; George Ostrouchov; Drew Schmidt; Pragneshkumar Patel; Hao Yu


Archive | 2017

Parallel Statistical Computing with R: An Illustration on Two Architectures

George Ostrouchov; Wei-Chen Chen; Drew Schmidt


Archive | 2016

Programming with Big Data – Scalable Linear Algebra Packages

Wei-Chen Chen; Drew Schmidt; George Ostrouchov; Pragneshkumar Patel


Archive | 2016

Programming with Big Data – Demonstrations and Examples Using'pbdR' Packages

Drew Schmidt; Wei-Chen Chen; George Ostrouchov; Pragneshkumar Patel


Archive | 2016

Programming with Big Data – Interface to ZeroMQ

Wei-Chen Chen; Drew Schmidt; Christian Heckendorf; George Ostrouchov


Archive | 2014

Programming with Big Data – Interface to Parallel UnidataNetCDF4 Format Data Files

Pragneshkumar Patel; George Ostrouchov; Wei-Chen Chen; Drew Schmidt; David Pierce


Archive | 2014

Programming with Big Data – Demonstrations of pbd Packages

Drew Schmidt; Wei-Chen Chen; George Ostrouchov; Pragneshkumar Patel


Archive | 2014

Programming with Big Data — MPI Profiling Tools

Wei-Chen Chen; Drew Schmidt; Gaurav Sehrawat; Pragneshkumar Patel; George Ostrouchov


Archive | 2013

Programming with Big Data – Distributed Matrix Methods

Drew Schmidt; Wei-Chen Chen; George Ostrouchov; Pragneshkumar Patel

Collaboration


Dive into the George Ostrouchov's collaboration.

Top Co-Authors

Avatar

Drew Schmidt

University of Tennessee

View shared research outputs
Top Co-Authors

Avatar

Wei-Chen Chen

Oak Ridge National Laboratory

View shared research outputs
Top Co-Authors

Avatar
Top Co-Authors

Avatar

Michael A. Matheson

Oak Ridge National Laboratory

View shared research outputs
Top Co-Authors

Avatar

Wei Chen Chen

Food and Drug Administration

View shared research outputs
Researchain Logo
Decentralizing Knowledge