Network


Latest external collaboration on country level. Dive into details by clicking on the dots.

Hotspot


Dive into the research topics where Dimitris S. Papailiopoulos is active.

Publication


Featured researches published by Dimitris S. Papailiopoulos.


very large data bases | 2013

XORing elephants: novel erasure codes for big data

Maheswaran Sathiamoorthy; Megasthenis Asteris; Dimitris S. Papailiopoulos; Alexandros G. Dimakis; Ramkumar Venkat Vadali; Scott Shaobing Chen; Dhruba Borthakur

Distributed storage systems for large clusters typically use replication to provide reliability. Recently, erasure codes have been used to reduce the large storage overhead of three-replicated systems. Reed-Solomon codes are the standard design choice and their high repair cost is often considered an unavoidable price to pay for high storage efficiency and high reliability. This paper shows how to overcome this limitation. We present a novel family of erasure codes that are efficiently repairable and offer higher reliability compared to Reed-Solomon codes. We show analytically that our codes are optimal on a recently identified tradeoff between locality and minimum distance. We implement our new codes in Hadoop HDFS and compare to a currently deployed HDFS module that uses Reed-Solomon codes. Our modified HDFS implementation shows a reduction of approximately 2× on the repair disk I/O and repair network traffic. The disadvantage of the new coding scheme is that it requires 14% more storage compared to Reed-Solomon codes, an overhead shown to be information theoretically optimal to obtain locality. Because the new codes repair failures faster, this provides higher reliability, which is orders of magnitude higher compared to replication.


international symposium on information theory | 2012

Locally repairable codes

Dimitris S. Papailiopoulos; Alexandros G. Dimakis

Distributed storage systems for large-scale applications typically use replication for reliability. Recently, erasure codes were used to reduce the large storage overhead, while increasing data reliability. A main limitation of off-the-shelf erasure codes is their high-repair cost during single node failure events. A major open problem in this area has been the design of codes that: 1) are repair efficient and 2) achieve arbitrarily high data rates. In this paper, we explore the repair metric of locality, which corresponds to the number of disk accesses required during a single node repair. Under this metric, we characterize an information theoretic tradeoff that binds together the locality, code distance, and storage capacity of each node. We show the existence of optimal locally repairable codes (LRCs) that achieve this tradeoff. The achievability proof uses a locality aware flow-graph gadget, which leads to a randomized code construction. Finally, we present an optimal and explicit LRC that achieves arbitrarily high data rates. Our locality optimal construction is based on simple combinations of Reed-Solomon blocks.


international conference on computer communications | 2012

Simple regenerating codes: Network coding for cloud storage

Dimitris S. Papailiopoulos; Jianqiang Luo; Alexandros G. Dimakis; Cheng Huang; Jin Li

Network codes designed specifically for distributed storage systems have the potential to provide dramatically higher storage efficiency for the same availability. One main challenge in the design of such codes is the exact repair problem: if a node storing encoded information fails, in order to maintain the same level of reliability we need to create encoded information at a new node. One of the main open problems in this emerging area has been the design of simple coding schemes that allow exact and low cost repair of failed nodes and have high data rates. In particular, all prior known explicit constructions have data rates bounded by 1/2. In this paper we introduce the first family of distributed storage codes that have simple look-up repair and can achieve rates up to 2/3. Our constructions are very simple to implement and perform exact repair by simple XORing of packets. We experimentally evaluate the proposed codes in a realistic cloud storage simulator and show significant benefits in both performance and reliability compared to replication and standard Reed-Solomon codes.


international symposium on information theory | 2013

Optimal locally repairable codes and connections to matroid theory

Itzhak Tamo; Dimitris S. Papailiopoulos; Alexandros G. Dimakis

Petabyte-scale distributed storage systems are currently transitioning to erasure codes to achieve higher storage efficiency. Classical codes like Reed-Solomon are highly suboptimal for distributed environments due to their high overhead in single-failure events. Locally Repairable Codes (LRCs) form a new family of codes that are repair efficient. In particular, LRCs minimize the number of nodes participating in single node repairs during which they generate small network traffic. Two large-scale distributed storage systems have already implemented different types of LRCs: Windows Azure Storage and the Hadoop Distributed File System RAID used by Facebook. The fundamental bounds for LRCs, namely the best possible distance for a given code locality, were recently discovered, but few explicit constructions exist. In this work, we present an explicit and simple to implement construction of optimal LRCs, for code parameters previously established by existence results. For the analysis of the optimality of our code, we derive a new result on the matroid represented by the codes generator matrix.


IEEE Transactions on Information Theory | 2014

Locally Repairable Codes

Dimitris S. Papailiopoulos; Alexandros G. Dimakis

Distributed storage systems for large-scale applications typically use replication for reliability. Recently, erasure codes were used to reduce the large storage overhead, while increasing data reliability. A main limitation of off-the-shelf erasure codes is their high-repair cost during single node failure events. A major open problem in this area has been the design of codes that: 1) are repair efficient and 2) achieve arbitrarily high data rates. In this paper, we explore the repair metric of locality, which corresponds to the number of disk accesses required during a single node repair. Under this metric, we characterize an information theoretic tradeoff that binds together the locality, code distance, and storage capacity of each node. We show the existence of optimal locally repairable codes (LRCs) that achieve this tradeoff. The achievability proof uses a locality aware flow-graph gadget, which leads to a randomized code construction. Finally, we present an optimal and explicit LRC that achieves arbitrarily high data rates. Our locality optimal construction is based on simple combinations of Reed-Solomon blocks.


IEEE Transactions on Information Theory | 2016

Locality and Availability in Distributed Storage

Ankit Singh Rawat; Dimitris S. Papailiopoulos; Alexandros G. Dimakis; Sriram Vishwanath

This paper studies the problem of information symbol availability in codes: we refer to a systematic code as code with


allerton conference on communication, control, and computing | 2011

Repair optimal erasure codes through hadamard designs

Dimitris S. Papailiopoulos; Alexandros G. Dimakis; Viveck R. Cadambe

(r, t)


global communications conference | 2010

Interference Alignment as a Rank Constrained Rank Minimization

Dimitris S. Papailiopoulos; Alexandros G. Dimakis

-availability if every information (systematic) symbol can be reconstructed from


international symposium on information theory | 2014

Locality and availability in distributed storage

Ankit Singh Rawat; Dimitris S. Papailiopoulos; Alexandros G. Dimakis; Sriram Vishwanath

t


Siam Journal on Optimization | 2017

Perturbed Iterate Analysis for Asynchronous Stochastic Optimization

Horia Mania; Xinghao Pan; Dimitris S. Papailiopoulos; Benjamin Recht; Kannan Ramchandran; Michael I. Jordan

disjoint groups of other code symbols, each of the sizes at most

Collaboration


Dive into the Dimitris S. Papailiopoulos's collaboration.

Top Co-Authors

Avatar

Alexandros G. Dimakis

University of Texas at Austin

View shared research outputs
Top Co-Authors

Avatar
Top Co-Authors

Avatar

Megasthenis Asteris

University of Texas at Austin

View shared research outputs
Top Co-Authors

Avatar

Zachary B. Charles

University of Wisconsin-Madison

View shared research outputs
Top Co-Authors

Avatar

Maximilian Lam

University of California

View shared research outputs
Top Co-Authors

Avatar

George N. Karystinos

Technical University of Crete

View shared research outputs
Top Co-Authors

Avatar

Ankit Singh Rawat

University of Texas at Austin

View shared research outputs
Top Co-Authors

Avatar

Hongyi Wang

University of Wisconsin-Madison

View shared research outputs
Top Co-Authors

Avatar
Top Co-Authors

Avatar

Benjamin Recht

University of California

View shared research outputs
Researchain Logo
Decentralizing Knowledge