Network


Latest external collaboration on country level. Dive into details by clicking on the dots.

Hotspot


Dive into the research topics where Shyam Kapur is active.

Publication


Featured researches published by Shyam Kapur.


knowledge discovery and data mining | 2004

Document preprocessing for naive Bayes classification and clustering with mixture of multinomials

Dmitry Pavlov; Ramnath Balasubramanyan; Byron Dom; Shyam Kapur; Jignashu Parikh

Naive Bayes classifier has long been used for text categorization tasks. Its sibling from the unsupervised world, the probabilistic mixture of multinomial models, has likewise been successfully applied to text clustering problems. Despite the strong independence assumptions that these models make, their attractiveness come from low computational cost, relatively low memory consumption, ability to handle heterogeneous features and multiple classes, and often competitiveness with the top of the line models. Recently, there has been several attempts to alleviate the problems of Naive Bayes by performing heuristic feature transformations, such as IDF, normalization by the length of the documents and taking the logarithms of the counts. We justify the use of these techniques and apply them to two problems: classification of products in Yahoo! Shopping and clustering the vectors of collocated terms in user queries to Yahoo! Search. The experimental evaluation allows us to draw conclusions about the promise that these transformations carry with regard to alleviating the strong assumptions of the multinomial model.


international acm sigir conference on research and development in information retrieval | 2006

Unity: relevance feedback using user query logs

Jignashu Parikh; Shyam Kapur

The exponential growth of the Web and the increasing ability of web search engines to index data have led to a problem of plenty. The number of results returned per query is typically in the order of millions of documents for many common queries. Although there is the benefit of added coverage for every query, the problem of ranking these documents and giving the best results gets worse. The problem is even more difficult in case of temporal and ambiguous queries. We try to address this problem using feedback from user query logs. We leverage a technology called Units for generating query refinements which are shown as Also try queries on Yahoo! Search. We consider these refinements as sub-concepts which help define user intent and use them to improve search relevance. The results obtained via live testing on Yahoo! Search are encouraging.


International Journal of Services Operations and Informatics | 2009

Online advertisement campaign optimisation

Weiguo Liu; Shi Zhong; Mayank Chaudhary; Shyam Kapur

Like any marketing campaigns, online advertisement campaigns need to be monitored, analysed and optimised. The quantitative methods are more crucial to online campaigns because of their dynamic pricing and highly interactive nature. Not only can marketing effectiveness be measured almost instantly in terms of measures such as click through rate and/or the acquisition/conversion rate, but a rich set of user data can also be collected and used by learning algorithms. The huge sets of dynamic data raise many challenging problems. In order to run a successful campaign, any serious advertiser, publisher or ad exchange network need a system that combines forecasting, data mining and optimisation techniques. In this paper, we propose such a methodology for a systematic analysis of the relevant problems and describe techniques that work on real world data as satisfactory solutions.


international conference on service operations and logistics, and informatics | 2007

Online Advertisement Campaign Optimization

Weiguo Liu; Shi Zhong; Mayank Chaudhary; Shyam Kapur

Like any marketing campaigns, online advertisement campaigns need to be monitored, analyzed and optimized. It is more so for online campaigns because online advertisements are usually sold in auction style. Prices can change very dynamically; the creatives, the landing pages and the targeting profiles can all be changed frequently to improve the effectiveness of a campaign. Also, the effectiveness can be measured almost instantly, usually in terms of click through rate and/or the acquisition/conversion rate. It raises many challenging problems in forecasting, data mining and optimization and entails an optimization system for any serious advertisers, publishers or ad networks.


Archive | 2003

Systems and methods for generating concept units from search queries

Shyam Kapur; Deepa Joshi


Archive | 2003

Systems and methods for search query processing using trend analysis

Shyam Kapur


Archive | 2004

Search processing with automatic categorization of queries

Shyam Kapur; Jignashu Parikh; Deepa Joshi


Archive | 2004

Universal search interface systems and methods

Shyam Kapur


Archive | 2005

Systems and methods for managing and using multiple concept networks for assisted search processing

Shyam Kapur; Jignashu Parikh; Deepa Joshi


Archive | 2014

Search system using search subdomain and hints to subdomains in search query statements and sponsored results on a subdomain-by-subdomain basis

Ali Diab; Scott A. Gatz; Shyam Kapur; David Ku; Chuck Kung; Phu Hoang; Qui Lu; Lynne Pogue; Yuan Kui Shen; Norman Shi; Thai Tran; Eckart Walther; Jeff Weiner

Collaboration


Dive into the Shyam Kapur's collaboration.

Researchain Logo
Decentralizing Knowledge