Brian Cho

Network

Latest external collaboration on country level. Dive into details by clicking on the dots.

Explore More

Hotspot

Dive into the research topics where Brian Cho is active.

Explore More

Publication

Featured researches published by Brian Cho.

symposium on cloud computing | 2010

Making cloud intermediate data fault-tolerant

Steven Y. Ko; Imranul Hoque; Brian Cho; Indranil Gupta

Parallel dataflow programs generate enormous amounts of distributed data that are short-lived, yet are critical for completion of the job and for good run-time performance. We call this class of data as intermediate data. This paper is the first to address intermediate data as a first-class citizen, specifically targeting and minimizing the effect of run-time server failures on the availability of intermediate data, and thus on performance metrics such as job completion time. We propose new design techniques for a new storage system called ISS (Intermediate Storage System), implement these techniques within Hadoop, and experimentally evaluate the resulting system. Under no failure, the performance of Hadoop augmented with ISS (i.e., job completion time) turns out to be comparable to base Hadoop. Under a failure, Hadoop with ISS outperforms base Hadoop and incurs up to 18% overhead compared to base no-failure Hadoop, depending on the testbed setup.

symposium on cloud computing | 2013

Natjam: design and evaluation of eviction policies for supporting priorities and deadlines in mapreduce clusters

Brian Cho; Muntasir Raihan Rahman; Tej Chajed; Indranil Gupta; Cristina L. Abad; Nathan Roberts; Philbert Lin

This paper presents Natjam, a system that supports arbitrary job priorities, hard real-time scheduling, and efficient preemption for Mapreduce clusters that are resource-constrained. Our contributions include: i) exploration and evaluation of smart eviction policies for jobs and for tasks, based on resource usage, task runtime, and job deadlines; and ii) a work-conserving task preemption mechanism for Mapreduce. We incorporated Natjam into the Hadoop YARN scheduler framework (in Hadoop 0.23). We present experiments from deployments on a test cluster, Emulab and a Yahoo! Inc. commercial cluster, using both synthetic workloads as well as Hadoop cluster traces from Yahoo!. Our results reveal that Natjam incurs overheads as low as 7%, and is preferable to existing approaches.

international conference on cluster computing | 2010

Breaking the MapReduce Stage Barrier

Abhishek Verma; Nicolas Zea; Brian Cho; Indranil Gupta; Roy H. Campbell

The MapReduce model uses a barrier between the Map and Reduce stages. This provides simplicity in both programming and implementation. However, in many situations, this barrier hurts performance because it is overly restrictive. Hence, we develop a method to break the barrier in MapReduce in a way that improves efficiency. Careful design of our barrier-less MapReduce framework results in equivalent generality and retains ease of programming. We motivate our case with, and experimentally study our barrier-less techniques in, a wide variety of MapReduce applications divided into seven classes. Our experiments show that our approach can achieve better job completion times than a traditional MapReduce framework. This is due primarily to the interleaving of I/O and computation, and forgoing disk-intensive work. We achieve a reduction in job completion times that is 25% on average and 87% in the best case.

international conference on autonomic computing | 2011

Budget-constrained bulk data transfer via internet and shipping networks

Brian Cho; Indranil Gupta

Cloud collaborators wish to combine large amounts of data, in the order of TBs, from multiple distributed locations to a single datacenter. Such groups are faced with the challenge of reducing the latency of the transfer, without incurring excessive dollar costs. Our Pandora system is an autonomic system that creates data transfer plans that can satisfy latency and cost needs, by considering transferring the data through both Internet and disk shipments. Solving the planning problem is a critical step towards a truly autonomic bulk data transfer service. In this paper, we develop techniques to create an optimal transfer plan that minimizes transfer latency subject to a budget constraint. To systematically explore the solution space, we develop efficient binary search methods that find the optimal shipment transfer plan. Our experimental evaluation, driven by Internet bandwidth traces and actual shipment costs queried from FedEx web services, shows that these techniques work well on diverse, realistic networks.

international conference on distributed computing systems | 2010

New Algorithms for Planning Bulk Transfer via Internet and Shipping Networks

Brian Cho; Indranil Gupta

Cloud computing is enabling groups of academic collaborators, groups of business partners, etc., to come together in an ad-hoc manner. This paper focuses on the group-based data transfer problem in such settings. Each participant source site in such a group has a large dataset, which may range in size from gigabytes to terabytes. This data needs to be transferred to a single sink site (e.g., AWS, Google datacenters, etc.) in a manner that reduces both total dollar costs incurred by the group as well as the total transfer latency of the collective dataset. This paper is the first to explore the problem of planning a group-based deadline-oriented data transfer in a scenario where data can be sent over both: (1) the internet, and (2) by shipping storage devices (e.g., external or hot-plug drives, or SSDs) via companies such as Fedex, UPS, USPS, etc. We first formalize the problem and prove its NP-Hardness. Then, we propose novel algorithms and use them to build a planning system called Pandora (People and Networks Moving Data Around). Pandora uses new concepts of time-expanded networks and delta-time-expanded networks, combining them with integer programming techniques and optimizations for both shipping and internet edges. Our experimental evaluation using real data from Fedex and from PlanetLab indicate the Pandora planner manages to satisfy deadlines and reduce costs significantly.

acm ifip usenix international conference on middleware | 2007

AVMEM: availability-aware overlays for management operations in non-cooperative distributed systems

Ramsés Morales; Brian Cho; Indranil Gupta

Monitoring and management operations that query nodes based on their availability can be extremely useful in a variety of largescale distributed systems containing hundreds to thousands of hosts, e.g., p2p systems, Grids, and PlanetLab. This paper presents decentralized and scalable solutions to a subset of such availability-based management tasks. Specifically, we propose AVMEM, which is the first availability-aware overlay to date. AVMEM is intended for generic non-cooperative scenarios where nodes may be selfish and may wish to route messages to a large set of other nodes, especially if the selfish node has low availability. Under this setting, our concrete contributions are the following: (1) AVMEM allows arbitrary classes of application-specified predicates to create the membership relationships in the overlay. In order to avoid selfish nodes from exploiting the system, we focus on predicates that are random and consistent. In other words, whether a given node y is a neighbor of a given node x is decided based on a consistent and probabilistic predicate, dependent solely on the identifiers and availabilities of these two nodes, but without using any external inputs. (2) AVMEM protocols discover and maintain the overlay spanned by the application-specified AVMEM predicate in a scalable and fast manner. (3) We use AVMEM to execute important availability-based management operations, focusing on range-anycast, range-multicast, threshold-anycast, and threshold-multicast. AVMEM works well in the presence of selfish nodes, scales to thousands of nodes, and executes each of the targeted operations quickly and reliably. Our evaluation is driven by real-life churn traces from the Overnet p2p system, and shows that AVMEM works well in practical settings.

workshop on hot topics in operating systems | 2009

On availability of intermediate data in cloud computations

Steven Y. Ko; Imranul Hoque; Brian Cho; Indranil Gupta

Proceedings of the 1st International Workshop on Big Data, Streams and Heterogeneous Source Mining | 2012

Delta-SimRank computing on MapReduce

Liangliang Cao; Brian Cho; Hyun Duk Kim; Zhen Li; Min-Hsuan Tsai; Indranil Gupta

Archive | 2013

Natjam: Eviction Policies For Supporting Priorities and Deadlines in Mapreduce Clusters

Indranil Gupta; Brian Cho; Muntasir Raihan Rahman; Tej Chajed; Cristina L. Abad; Nathan Roberts; Philbert Lin

usenix annual technical conference | 2012

Surviving congestion in geo-distributed storage systems

Brian Cho; Marcos Kawazoe Aguilera

Explore More

Collaboration

Dive into the Brian Cho's collaboration.

Top Co-Authors

Cristina L. Abad

Escuela Superior Politecnica del Litoral

View shared research outputs

Top Co-Authors

Nathan Roberts

Yahoo!

View shared research outputs

Top Co-Authors

Marcos Kawazoe Aguilera

Microsoft

View shared research outputs

Top Co-Authors

Liangliang Cao

IBM

View shared research outputs

Explore More

University of Illinois at Urbana–Champaign

Archive Network Publication Hotspot Collaboration

Network

Latest external collaboration on country level. Dive into details by clicking on the dots.

Hotspot

Dive into the research topics where Brian Cho is active.

Publication

Featured researches published by Brian Cho.

Making cloud intermediate data fault-tolerant

Natjam: design and evaluation of eviction policies for supporting priorities and deadlines in mapreduce clusters

Breaking the MapReduce Stage Barrier

Budget-constrained bulk data transfer via internet and shipping networks

New Algorithms for Planning Bulk Transfer via Internet and Shipping Networks

AVMEM: availability-aware overlays for management operations in non-cooperative distributed systems

On availability of intermediate data in cloud computations

Delta-SimRank computing on MapReduce

Natjam: Eviction Policies For Supporting Priorities and Deadlines in Mapreduce Clusters

Surviving congestion in geo-distributed storage systems

Collaboration

Dive into the Brian Cho's collaboration.