Network


Latest external collaboration on country level. Dive into details by clicking on the dots.

Hotspot


Dive into the research topics where Mayur Datar is active.

Publication


Featured researches published by Mayur Datar.


international world wide web conferences | 2007

Google news personalization: scalable online collaborative filtering

Abhinandan S. Das; Mayur Datar; Ashutosh Garg; Shyamsundar Rajaram

Several approaches to collaborative filtering have been studied but seldom have studies been reported for large (several millionusers and items) and dynamic (the underlying item set is continually changing) settings. In this paper we describe our approach to collaborative filtering for generating personalized recommendations for users of Google News. We generate recommendations using three approaches: collaborative filtering using MinHash clustering, Probabilistic Latent Semantic Indexing (PLSI), and covisitation counts. We combine recommendations from different algorithms using a linear model. Our approach is content agnostic and consequently domain independent, making it easily adaptable for other applications and languages with minimal effort. This paper will describe our algorithms and system setup in detail, and report results of running the recommendations engine on Google News.


Data Streams - Models and Algorithms | 2007

Load Shedding in Data Stream Systems

Brian Babcock; Mayur Datar; Rajeev Motwani

Systems for processing continuous monitoring queries over data streams must be adaptive because data streams are often bursty and data characteristics may vary over time. In this chapter, we focus on one particular type of adaptivity: the ability to gracefully degrade performance via “load shedding” (dropping unprocessed tuples to reduce system load) when the demands placed on the system cannot be met in full given available resources. Focusing on aggregation queries, we present algorithms that determine at what points in a query plan should load shedding be performed and what amount of load should be shed at each point in order to minimize the degree of inaccuracy introduced into query answers. We also discuss strategies for load shedding for other types of queries (set-valued queries, join queries, and classification queries).


Data Stream Management | 2007

The Sliding-Window Computation Model and Results

Mayur Datar; Rajeev Motwani

The sliding-window model of computation is motivated by the assumption that, in certain data-stream processing applications, recent data is more useful and pertinent than older data. In such cases, we would like to answer questions about the data only over the last N most recent data elements (N is a parameter). We formalize this model of computation and answer questions about how much space and computation time is required to solve certain problems under the sliding-window model.


national conference on artificial intelligence | 2006

Comparative experiments on sentiment classification for online product reviews

Hang Cui; Vibhu Mittal; Mayur Datar


Archive | 2007

Annotation framework for video

Mayur Datar; Ashutosh Garg; Vibhu Mittal


Archive | 2004

Method and system for finding and aggregating reviews for a product

Jan Matthias Ruhl; Mayur Datar


Archive | 2005

Scalable user clustering based on set similarity

Mayur Datar; Ashutosh Garg


Archive | 2005

Method, system and graphical user interface for providing reviews for a product

Jan Matthias Ruhl; Mayur Datar; Jessica Yoko Wai-min Lee


Archive | 2006

Automatically generating ads and ad-serving index

Mayur Datar; Ashutosh Garg


foundations of computer science | 2004

On the streaming model augmented with a sorting primitive

Gagan Aggarwal; Mayur Datar; Sridhar Rajagopalan; Matthias Ruhl

Researchain Logo
Decentralizing Knowledge