IEEE Transactions on Network Science and Engineering | 2021

On Batch-Processing Based Coded Computing for Heterogeneous Distributed Computing Systems

 
 
 
 
 

Abstract


In recent years, coded distributed computing (CDC) has attracted significant attention, because it can efficiently facilitate many delay-sensitive computation tasks against unexpected latencies in distributed computing systems. Despite such a salient feature, many design challenges and opportunities remain. In this paper, we focus on practical computing systems with heterogeneous computing resources, and design a novel CDC approach, called batch-processing based coded computing (BPCC), which exploits the fact that every computing node can obtain some coded results before it completes the whole task. To this end, we first describe the main idea of the BPCC framework, and then formulate an optimization problem for BPCC to minimize the task completion time by configuring the computation load. Through formal theoretical analyses, extensive simulation studies, and comprehensive real experiments on the Amazon EC2 computing clusters, we demonstrate promising performance of the proposed BPCC scheme, in terms of high computational efficiency and robustness to uncertain disturbances.

Volume 8
Pages 2438-2454
DOI 10.1109/tnse.2021.3095040
Language English
Journal IEEE Transactions on Network Science and Engineering

Full Text