Roshan Sumbaly | Researchain

Archive Network Publication Hotspot Collaboration

Network

Latest external collaboration on country level. Dive into details by clicking on the dots.

Explore More

Hotspot

Dive into the research topics where Roshan Sumbaly is active.

Explore More

Publication

Featured researches published by Roshan Sumbaly.

international conference on data engineering | 2012

Data Infrastructure at LinkedIn

Aditya Auradkar; Chavdar Botev; Shirshanka Das; Dave De Maagd; Alex Feinberg; Phanindra Ganti; Lei Gao; Bhaskar Ghosh; Kishore Gopalakrishna; Brendan Harris; Joel Koshy; Kevin Krawez; Jay Kreps; Shi Lu; Sunil Nagaraj; Neha Narkhede; Sasha Pachev; Igor Perisic; Lin Qiao; Tom Quiggle; Jun Rao; Bob Schulman; Abraham Sebastian; Oliver Seeliger; Adam Silberstein; BBoris Shkolnik; Chinmay Soman; Roshan Sumbaly; Kapil Surlaker; Sajid Topiwala

Linked In is among the largest social networking sites in the world. As the company has grown, our core data sets and request processing requirements have grown as well. In this paper, we describe a few selected data infrastructure projects at Linked In that have helped us accommodate this increasing scale. Most of those projects build on existing open source projects and are themselves available as open source. The projects covered in this paper include: (1) Voldemort: a scalable and fault tolerant key-value store, (2) Data bus: a framework for delivering database changes to downstream applications, (3) Espresso: a distributed data store that supports flexible schemas and secondary indexing, (4) Kafka: a scalable and efficient messaging system for collecting various user activity events and log data.

measurement and modeling of computer systems | 2013

Root cause detection in a service-oriented architecture

Myunghwan Kim; Roshan Sumbaly; Sam Shah

Large-scale websites are predominantly built as a service-oriented architecture. Here, services are specialized for a certain task, run on multiple machines, and communicate with each other to serve a users request. An anomalous change in a metric of one service can propagate to other services during this communication, resulting in overall degradation of the request. As any such degradation is revenue impacting, maintaining correct functionality is of paramount concern: it is important to find the root cause of any anomaly as quickly as possible. This is challenging because there are numerous metrics or sensors for a given service, and a modern website is usually composed of hundreds of services running on thousands of machines in multiple data centers. This paper introduces MonitorRank, an algorithm that can reduce the time, domain knowledge, and human effort required to find the root causes of anomalies in such service-oriented architectures. In the event of an anomaly, MonitorRank provides a ranked order list of possible root causes for monitoring teams to investigate. MonitorRank uses the historical and current time-series metrics of each sensor as its input, along with the call graph generated between sensors to build an unsupervised model for ranking. Experiments on real production outage data from LinkedIn, one of the largest online social networks, shows a 26% to 51% improvement in mean average precision in finding root causes compared to baseline and current state-of-the-art methods.

conference on information and knowledge management | 2014

Hotspot Detection in a Service-Oriented Architecture

Pranay Anchuri; Roshan Sumbaly; Sam Shah

Large-scale websites are predominantly built as a service-oriented architecture. Here, services are specialized for a certain task, run on multiple machines, and communicate with each other to serve a users request. Reducing latency and improving the cost to serve is quite important, but optimizing this service call graph is particularly challenging due to the volume of data and the graphs non-uniform and dynamic nature. In this paper, we present a framework to detect hotspots in a service-oriented architecture. The framework is general, in that it can handle arbitrary objective functions. We show that finding the optimal set of hotspots for a metric, such as latency, is NP-complete and propose a greedy algorithm by relaxing some constraints. We use a pattern mining algorithm to rank hotspots based on the impact and consistency. Experiments on real world service call graphs from LinkedIn, the largest online professional social network, show that our algorithm consistently outperforms baseline methods.

file and storage technologies | 2012