Sergey Brin
Network
Latest external collaboration on country level. Dive into details by clicking on the dots.
Publication
Featured researches published by Sergey Brin.
international world wide web conferences | 2003
Monika Rauch Henzinger; Bay-Wei Chang; Brian Milch; Sergey Brin
Many daily activities present information in the form of a stream of text, and often people can benefit from additional information on the topic discussed. TV broadcast news can be treated as one such stream of text; in this paper we discuss finding news articles on the web that are relevant to news currently being broadcast.We evaluated a variety of algorithms for this problem, looking at the impact of inverse document frequency, stemming, compounds, history, and query length on the relevance and coverage of news articles returned in real time during a broadcast. We also evaluated several postprocessing techniques for improving the precision, including reranking using additional terms, reranking by document similarity, and filtering on document similarity. For the best algorithm, 84%-91% of the articles found were relevant, with at least 64% of the articles being on the exact topic of the broadcast. In addition, a relevant article was found for at least 70% of the topics.
IEEE Transactions on Knowledge and Data Engineering | 2003
Sergey Brin; Rajeev Rastogi; Kyuseok Shim
Association rules are useful for determining correlations between attributes of a relation and have applications in the marketing, financial, and retail sectors. Furthermore, optimized association rules are an effective way to focus on the most interesting characteristics involving certain attributes. Optimized association rules are permitted to contain uninstantiated attributes and the problem is to determine instantiations such that either the support, confidence, or gain of the rule is maximized. In this paper, we generalize the optimized gain association rule problem by permitting rules to contain disjunctions over uninstantiated numeric attributes. Our generalized association rules enable us to extract more useful information about seasonal and local patterns involving the uninstantiated attribute. For rules containing a single numeric attribute, we present an algorithm with linear complexity for computing optimized gain rules. Furthermore, we propose a bucketing technique that can result in a significant reduction in input size by coalescing contiguous values without sacrificing optimality. We also present an approximation algorithm based on dynamic programming for two numeric attributes. Using recent results on binary space partitioning trees, we show that the approximations are within a constant factor of the optimal optimized gain rules. Our experimental results with synthetic data sets for a single numeric attribute demonstrate that our algorithm scales up linearly with the attributes domain size as well as the number of disjunctions. In addition, we show that applying our optimized rule framework to a population survey real-life data set enables us to discover interesting underlying correlations among the attributes.
Archive | 2000
Benjamin Thomas Smith; Sergey Brin; Sanjay Ghemawat; Christopher D. Manning
Archive | 2006
Alexander Franz; Monika H. Henzinger; Sergey Brin; Brian Milch
Archive | 2000
Benjamin Thomas Smith; Sergey Brin; Sanjay Ghemawat; John A. Bauer
Archive | 2003
Simon Tong; Mark Pearson; Sergey Brin
Archive | 2011
Sergey Brin
Archive | 2003
Sergey Brin; Benedict A. Gomes; Simon Tong
Archive | 2013
Eric Peeters; Eric Teller; William Graham Patrick; Sergey Brin
Archive | 2011
Sergey Brin; Babak Amirparviz