Network


Latest external collaboration on country level. Dive into details by clicking on the dots.

Hotspot


Dive into the research topics where Emre Velipasaoglu is active.

Publication


Featured researches published by Emre Velipasaoglu.


Information Retrieval | 2011

Intent-based diversification of web search results: metrics and algorithms

Olivier Chapelle; Shihao Ji; Ciya Liao; Emre Velipasaoglu; Larry Lai; Su-Lin Wu

We study the problem of web search result diversification in the case where intent based relevance scores are available. A diversified search result will hopefully satisfy the information need of user-L.s who may have different intents. In this context, we first analyze the properties of an intent-based metric, ERR-IA, to measure relevance and diversity altogether. We argue that this is a better metric than some previously proposed intent aware metrics and show that it has a better correlation with abandonment rate. We then propose an algorithm to rerank web search results based on optimizing an objective function corresponding to this metric and evaluate it on shopping related queries.


international acm sigir conference on research and development in information retrieval | 2012

Learning to suggest: a machine learning framework for ranking query suggestions

Umut Ozertem; Olivier Chapelle; Pinar Donmez; Emre Velipasaoglu

We consider the task of suggesting related queries to users after they issue their initial query to a web search engine. We propose a machine learning approach to learn the probability that a user may find a follow-up query both useful and relevant, given his initial query. Our approach is based on a machine learning model which enables us to generalize to queries that have never occurred in the logs as well. The model is trained on co-occurrences mined from the search logs, with novel utility and relevance models, and the machine learning step is done without any labeled data by human judges. The learning step allows us to generalize from the past observations and generate query suggestions that are beyond the past co-occurred queries. This brings significant gains in coverage while yielding modest gains in relevance. Both offline (based on human judges) and online (based on millions of user interactions) evaluations demonstrate that our approach significantly outperforms strong baselines.


conference on information and knowledge management | 2006

Performance thresholding in practical text classification

Hinrich Schütze; Emre Velipasaoglu; Jan O. Pedersen

In practical classification, there is often a mix of learnable and unlearnable classes and only a classifier above a minimum performance threshold can be deployed. This problem is exacerbated if the training set is created by active learning. The bias of actively learned training sets makes it hard to determine whether a class has been learned. We give evidence that there is no general and efficient method for reducing the bias and correctly identifying classes that have been learned. However, we characterize a number of scenarios where active learning can succeed despite these difficulties.


international world wide web conferences | 2011

Identifying primary content from web pages and its application to web search ranking

Emre Velipasaoglu

Web pages are usually highly structured documents. In some documents, content with different functionality is laid out in blocks, some merely supporting the main discourse. In other documents, there may be several blocks of unrelated main content. Indexing a web page as if it were a linear document can cause problems because of the diverse nature of its content. If the retrieval function treats all blocks of the web page equally without attention to structure, it may lead to irrelevant query matches. In this paper, we describe how content quality of different blocks of a web page can be utilized to improve a retrieval function. Our method is based on segmenting a web page into semantically coherent blocks and learning a predictor of segment content quality. We also describe how to use segment content quality estimates as weights in the BM25F formulation. Experimental results show our method improves relevance of retrieved results by as much as 4.5% compared to BM25F that treats the body of a web page as a single section, and by a larger margin of over 9% for difficult queries.


conference on information and knowledge management | 2011

Suggestion set utility maximization using session logs

Umut Ozertem; Emre Velipasaoglu; Larry Lai

Assistance technology is undoubtedly one of the important elements in the commercial search engines, and routing the user towards the right direction throughout the search sessions is of great importance for providing a good search experience. Most search assistance methods in the literature that involve query generation, query expansion and other techniques consider each suggestion candidate individually, which implies an independence assumption. We challenge this independence assumption and give a method to maximize the utility of a given set of suggestions. For this, we will define a measure of conditional utility for query pairs using query-URL bipartite graphs based on the session logs (clicked and viewed URLs). Afterwards, we remove the redundant queries from the suggestion set using a greedy algorithm to be able to replace them with more useful ones. Both offline (based on user studies and session log analysis) and online (based on millions of user interactions) evaluations show that modeling the conditional utility and maximizing the utility of the set of queries (by eliminating redundant ones) significantly increases the effectiveness of the search assistance both for the presubmit and postsubmit modes.


international conference on user modeling, adaptation, and personalization | 2013

Building Rich User Search Queries Profiles

Elif Aktolga; Alpa Jain; Emre Velipasaoglu

It is well-known that for a variety of search tasks involving queries more relevant results can be presented if they are personalized according to a user’s interests and search behavior. This can be achieved with user-dependent, rich web search queries profiles. These are typically built as part of a specific search personalization task so that it is unclear which characteristics of queries are most effective for modeling the user-query relationship in general. In this paper, we explore various approaches for explicitly modeling this user-query relationship independently of other search components. Our models employ generative models in layers in a prediction task. The results show that the best signals for modeling the user-query relationship come from the given query’s terms and entities together with information from related entities and terms, yielding a relative improvement of up to 24.5% in MRR and Success over the baseline methods.


international world wide web conferences | 2010

Web search engine metrics: (direct metrics to measure user satisfaction)

Ali Dasdan; Kostas Tsioutsiouliklis; Emre Velipasaoglu

Search engines are important resources for finding information on the Web. They are also important for publishers and advertisers to present their content to users. Thus, user satisfaction is key and must be quantified. In this tutorial, we give a practical review of web search metrics from a user satisfaction point of view. We cover metrics for relevance, comprehensiveness, coverage, diversity, discovery freshness, content freshness, and presentation. We will also describe how these metrics can be mapped to proxy metrics for the stages of a generic search engine pipeline. The practitioners can apply these metrics readily and the researchers can get motivation for new problems to work on, especially in formalizing and refining metrics.


international acm sigir conference on research and development in information retrieval | 2007

Improving active learning recall via disjunctive boolean constraints

Emre Velipasaoglu; Hinrich Schütze; Jan O. Pedersen

Active learning efficiently hones in on the decision boundary between relevant and irrelevant documents, but in the process can miss entire clusters of relevant documents, yielding classifiers with low recall. In this paper, we propose a method to increase active learning recall by constraining sampling to a document subset rich in relevant examples.


Archive | 2009

Automatic classification of segmented portions of web pages

Lei Duan; Fan Li; Emre Velipasaoglu; Swapnil Hajela; Deepayan Chakrabarti


Archive | 2008

Automatic visual segmentation of webpages

Deepayan Chakrabarti; Manav Ratan Mital; Swapnil Hajela; Emre Velipasaoglu

Collaboration


Dive into the Emre Velipasaoglu's collaboration.

Researchain Logo
Decentralizing Knowledge