Network


Latest external collaboration on country level. Dive into details by clicking on the dots.

Hotspot


Dive into the research topics where Chaitanya Mishra is active.

Publication


Featured researches published by Chaitanya Mishra.


very large data bases | 2010

MRShare: sharing across multiple queries in MapReduce

Tomasz Nykiel; Michalis Potamias; Chaitanya Mishra; George Kollios; Nick Koudas

Large-scale data analysis lies in the core of modern enterprises and scientific research. With the emergence of cloud computing, the use of an analytical query processing infrastructure (e.g., Amazon EC2) can be directly mapped to monetary value. MapReduce has been a popular framework in the context of cloud computing, designed to serve long running queries (jobs) which can be processed in batch mode. Taking into account that different jobs often perform similar work, there are many opportunities for sharing. In principle, sharing similar work reduces the overall amount of work, which can lead to reducing monetary charges incurred while utilizing the processing infrastructure. In this paper we propose a sharing framework tailored to MapReduce. Our framework, MRShare, transforms a batch of queries into a new batch that will be executed more efficiently, by merging jobs into groups and evaluating each group as a single query. Based on our cost model for MapReduce, we define an optimization problem and we provide a solution that derives the optimal grouping of queries. Experiments in our prototype, built on top of Hadoop, demonstrate the overall effectiveness of our approach and substantial savings.


ACM Transactions on Database Systems | 2014

Sharing across Multiple MapReduce Jobs

Tomasz Nykiel; Michalis Potamias; Chaitanya Mishra; George Kollios; Nick Koudas

Large-scale data analysis lies in the core of modern enterprises and scientific research. With the emergence of cloud computing, the use of an analytical query processing infrastructure can be directly associated with monetary cost. MapReduce has been a popular framework in the context of cloud computing, designed to serve long-running queries (jobs) which can be processed in batch mode. Taking into account that different jobs often perform similar work, there are many opportunities for sharing. In principle, sharing similar work reduces the overall amount of work, which can lead to reducing monetary charges for utilizing the processing infrastructure. In this article we present a sharing framework tailored to MapReduce, namely, <tt>MRShare</tt>. Our framework, <tt>MRShare</tt>, transforms a batch of queries into a new batch that will be executed more efficiently, by merging jobs into groups and evaluating each group as a single query. Based on our cost model for MapReduce, we define an optimization problem and we provide a solution that derives the optimal grouping of queries. Given the query grouping, we merge jobs appropriately and submit them to MapReduce for processing. A key property of <tt>MRShare</tt> is that it is independent of the MapReduce implementation. Experiments with our prototype, built on top of Hadoop, demonstrate the overall effectiveness of our approach. <tt>MRShare</tt> is primarily designed for handling I/O-intensive queries. However, with the development of high-level languages operating on top of MapReduce, user queries executed in this model become more complex and CPU intensive. Commonly, executed queries can be modeled as evaluating pipelines of CPU-expensive filters over the input stream. Examples of such filters include, but are not limited to, index probes, or certain types of joins. In this article we adapt some of the standard techniques for filter ordering used in relational and stream databases, propose their extensions, and implement them through <tt>MRAdaptiveFilter</tt>, an extension of <tt>MRShare</tt> for expensive filter ordering tailored to MapReduce, which allows one to handle both single- and batch-query execution modes. We present an experimental evaluation that demonstrates additional benefits of <tt>MRAdaptiveFilter</tt>, when executing CPU-intensive queries in <tt>MRShare</tt>.


Archive | 2010

Facilitating interaction among users of a social network

Spencer G. Ahrens; Cameron Marlow; Lars Backstrom; Chaitanya Mishra


Archive | 2011

Personalizing a web page outside of a social networking system with content from the social networking system

Mark E. Zuckerberg; Ray C. He; Spencer G. Ahrens; Yofay Kari Lee; Chaitanya Mishra; Austin Haugen; Xin Liu; Michael Steven Vernal


Archive | 2012

Filtering Structured Search Queries Based on Privacy Settings

Michael Curtiss; Chaitanya Mishra


Archive | 2012

Personalizing A Web Page Outside Of A Social Networking System With Recommendations for Content From The Social Networking System

Mark E. Zuckerberg; Ray C. He; Spencer G. Ahrens; Yofay Kari Lee; Chaitanya Mishra; Austin Haugen; Xin Liu; Michael Steven Vernal


Archive | 2014

Using Inverse Operators for Queries on Online Social Networks

Rajat Raina; Kihyuk Hong; Sriram Sankar; Kittipat Virochsiri; Michael Curtiss; Chaitanya Mishra


Archive | 2012

Personalizing a web page outside of a social networking system with content from the social networking system that includes user actions

Mark E. Zuckerberg; Ray C. He; Spencer G. Ahrens; Yofay Kari Lee; Chaitanya Mishra; Austin Haugen; Xin Liu; Michael Steven Vernal


Archive | 2012

Personalizing a web page outside of a social networking system with content from the social networking system selected based on global information

Mark E. Zuckerberg; Ray C. He; Spencer G. Ahrens; Yofay Kari Lee; Chaitanya Mishra; Austin Haugen; Xin Liu; Michael Steven Vernal


Archive | 2017

FACILITATION OF INTERACTION BETWEEN USERS OF SOCIAL NETWORK

Spencer G. Ahrens; Cameron Marlow; Lars Backstrom; Chaitanya Mishra

Collaboration


Dive into the Chaitanya Mishra's collaboration.

Researchain Logo
Decentralizing Knowledge