Network
Latest external collaboration on country level. Dive into details by clicking on the dots.
Publication
Featured researches published by Ilie Gabriel Tanase.
ieee international conference on high performance computing data and analytics | 2015
Lifeng Nai; Yinglong Xia; Ilie Gabriel Tanase; Hyesoon Kim; Ching-Yung Lin
With the emergence of data science, graph computing is becoming a crucial tool for processing big connected data. Although efficient implementations of specific graph applications exist, the behavior of full-spectrum graph computing remains unknown. To understand graph computing, we must consider multiple graph computation types, graph frameworks, data representations, and various data sources in a holistic way. In this paper, we present GraphBIG, a benchmark suite inspired by IBM System G project. To cover major graph computation types and data sources, GraphBIG selects representative datastructures, workloads and data sets from 21 real-world use cases of multiple application domains. We characterized GraphBIG on real machines and observed extremely irregular memory patterns and significant diverse behavior across different computations. GraphBIG helps users understand the impact of modern graph computing on the hardware architecture and enables future architecture and system research.
Proceedings of Workshop on GRAph Data management Experiences and Systems | 2014
Ilie Gabriel Tanase; Yinglong Xia; Lifeng Nai; Yanbin Liu; Wei Tan; Jason Crawford; Ching-Yung Lin
Graph analytics on big data is currently a very active area of research in both industry and academia. To support graph analytics efficiently a large number of graph processing systems have emerged targeting various perspectives of a graph application such as in memory and on disk representations, persistent storage, database capability, runtimes and execution models for exploiting parallelism, etc. In this paper we discuss a novel graph processing system called System G Native Store which allows for efficient graph data organization and processing on modern computing architectures. In particular we describe a runtime designed to exploit multiple levels of parallelism and a generic infrastructure that allows users to express graphs with various in memory and persistent storage properties. We experimentally show the efficiency of System G Native Store for processing graph queries on state-of-the-art platforms.
international conference on big data | 2014
Yinglong Xia; Ilie Gabriel Tanase; Lifeng Nai; Wei Tan; Yanbin Liu; Jason Crawford; Ching-Yung Lin
Many Big Data analytics essentially explore the relationship among interconnected entities, which are naturally represented as graphs. However, due to the irregular data access patterns in the graph computations, it remains a fundamental challenge to deliver highly efficient solutions for large scale graph analytics. Such inefficiency restricts the utilization of many graph algorithms in Big Data scenarios. To address the performance issues in large scale graph analytics, we develop a graph processing system called System G, which explores efficient graph data organization for parallel computing architectures. We discuss various graph data organizations and their impact on data locality during graph traversals, which results in various cache performance behavior on processor side. In addition, we analyze data parallelism from architectures perspective and experimentally show the efficiency for System G based graph analytics. We present experimental results for commodity multicore clusters and IBM PERCS supercomputers to illustrate the performance of System G for large scale graph analytics.
acm sigplan symposium on principles and practice of parallel programming | 2014
Haichuan Wang; Peng Wu; Ilie Gabriel Tanase; Mauricio J. Serrano; José E. Moreira
Using SIMD (Single Instruction Multiple Data) is a cost-effective way to explore data parallelism on modern processors. Most processor vendors today provide SIMD engines, such as Altivec/VSX for POWER, SSE/AVX for Intel processors, and NEON for ARM. While high-level SIMD programming models are rapidly evolving, for many SIMD developers, the most effective way to get the performance out of SIMD is still by programming directly via vendor-provided SIMD intrinsics. However, intrinsics programming is both tedious and error-prone, and worst of all, introduces non-portable codes. This paper presents the Generic SIMD Library (https://github.com/genericsimd/generic_simd/), an open-source, portable C++ interface that provides an abstraction of short vectors and overloads most C/C++ operators for short vectors. The library provides several mappings from platform-specific intrinsics to the generic SIMD intrinsic interface so that codes developed based on the library are portable across different SIMD platforms. We have evaluated the library with several applications from the multimedia, data analytics and math domains. Compared with platform-specific intrinsics codes, using Generic SIMD Library results in less line-of-code, a 22% reduction on average, and achieves similar performance as platform-specific intrinsics versions.
Journal of Parallel and Distributed Computing | 2017
Lifeng Nai; Yinglong Xia; Ilie Gabriel Tanase; Hyesoon Kim
Abstract Graph computing is widely applied in a large number of big data applications. Despite its importance, high performance graph computing remains a challenge, especially for large-scale graphs. In this paper, by analyzing from the architectural perspective, we study computational behaviors of graph computing in real-world use cases. We benchmark a set of representative graph algorithms implemented on a unified framework and conduct experiments to analyze comprehensive performance characteristics. In the characterization, we observed multiple insights, including irregular memory patterns, significant diverse behavior across different computations, highly data dependent behaviors, etc., using large-scale synthetic and real-world graphs. To the best of our knowledge, this is the first comprehensive architectural study on the full-scope of graph computing. It can improve our understanding on graph computing and help high performance computing research for graph-based big data applications.
european conference on parallel processing | 2015
Guojing Cong; Ilie Gabriel Tanase; Yinglong Xia
We propose new approaches for accelerating minimum spanning forest algorithms on shared-memory platforms. Our approaches improve cache performance and reduce synchronization overhead of the base algorithms. On our target platform these optimizations achieve up to an order of magnitude speedup over the best prior parallel \({Bor{\mathring{u}}vka}\) implementation.
Archive | 2015
Kattamuri Ekanadham; William P. Horn; Joefon Jann; Manoj Kumar; José E. Moreira; Pratap Pattnaik; Mauricio J. Serrano; Ilie Gabriel Tanase; Hao Yu
Archive | 2013
Tsai-Yang Jea; Serban C. Maerean; Ilie Gabriel Tanase; Hanhong Xue
Archive | 2013
Gheorghe Almasi; Barnaby Dalton; Ilie Gabriel Tanase; Ettore Tiotto
Archive | 2015
Kattamuri Ekanadham; William P. Horn; Joefon Jann; Manoj Kumar; José E. Moreira; Pratap Pattnaik; Mauricio J. Serrano; Ilie Gabriel Tanase; Hao Yu