IEEE Transactions on Sustainable Computing | 2019

SPFC: An Effective Optimization for Vertex-Centric Graph Processing Systems

 
 
 
 
 

Abstract


The real-world demands of mining big data and smart data of graph structure have led to an active research of distributed graph processing. Many distributed graph processing systems <xref ref-type= bibr rid= ref19 >[19]</xref> , <xref ref-type= bibr rid= ref22 >[22]</xref> , <xref ref-type= bibr rid= ref23 >[23]</xref> adopt a vertex-centric programming paradigm. In these systems, messages are passed between vertices to propagate the latest states. The communication efficiency and the high overhead of synchronization are two key considerations of these systems <xref ref-type= bibr rid= ref8 >[8]</xref> , <xref ref-type= bibr rid= ref12 >[12]</xref> . In this paper, we propose a <bold>S</bold>low <bold>P</bold>assing <bold>F</bold>ast <bold>C</bold>onsuming (<bold>SPFC</bold>) approach which can effectively improve the overall performance of vertex-centric graph processing systems. In our approach, the message passing is slow but the consuming is fast. More specifically, at the message sender side, priority is given to those smart messages which contribute more to the algorithm convergence, and at the message receiver side, messages are consumed right after arriving without any delay and intermediate buffer. Besides, by using a two-phase termination check protocol, the global synchronous barrier can be completely eliminated. In addition, based on the slow message passing strategy, further performance improvement can be achieved with some accuracy loss by eliminating those messages which are less useful for algorithm convergence. We implement our approach based on Apache Giraph <xref ref-type= bibr rid= ref1 >[1]</xref> and evaluate it on a 12-machine cluster. The experimental results show that our method can effectively reduce the amount of message traffic and achieve up to an order of magnitude performance improvement compared with Giraph and GraphLab <xref ref-type= bibr rid= ref3 >[3]</xref> .

Volume 4
Pages 118-131
DOI 10.1109/TSUSC.2017.2780320
Language English
Journal IEEE Transactions on Sustainable Computing

Full Text