Network


Latest external collaboration on country level. Dive into details by clicking on the dots.

Hotspot


Dive into the research topics where Shay Hummel is active.

Publication


Featured researches published by Shay Hummel.


conference on information and knowledge management | 2012

Back to the roots: a probabilistic framework for query-performance prediction

Oren Kurland; Anna Shtok; Shay Hummel; Fiana Raiber; David Carmel; Ofri Rom

The query-performance prediction task is estimating the effectiveness of a search performed in response to a query when no relevance judgments are available. Although there exist many effective prediction methods, these differ substantially in their basic principles, and rely on diverse hypotheses about the characteristics of effective retrieval. We present a novel fundamental probabilistic prediction framework. Using the framework, we derive and explain various previously proposed prediction methods that might seem completely different, but turn out to share the same formal basis. The derivations provide new perspectives on several predictors (e.g., Clarity). The framework is also used to devise new prediction approaches that outperform the state-of-the-art.


international conference on the theory of information retrieval | 2011

A unified framework for post-retrieval query-performance prediction

Oren Kurland; Anna Shtok; David Carmel; Shay Hummel

The query-performance prediction task is estimating the effectiveness of a search performed in response to a query in lack of relevance judgments. Post-retrieval predictors analyze the result list of top-retrieved documents. While many of these previously proposed predictors are supposedly based on different principles, we show that they can actually be derived from a novel unified prediction framework that we propose. The framework is based on using a pseudo effective and/or ineffective ranking as reference comparisons to the ranking at hand, the quality of which we want to predict. Empirical exploration provides support to the underlying principles, and potential merits, of our framework.


international world wide web conferences | 2016

On the Retrieval of Wikipedia Articles Containing Claims on Controversial Topics

Haggai Roitman; Shay Hummel; Ella Rabinovich; Benjamin Sznajder; Noam Slonim; Ehud Aharoni

This work presents a novel claim-oriented document retrieval task. For a given controversial topic, relevant articles containing claims that support or contest the topic are retrieved from a Wikipedia corpus. For that, a two-step retrieval approach is proposed. At the first step, an initial pool of articles that are relevant to the topic are retrieved using state-of-the-art retrieval methods. At the second step, articles in the initial pool are re-ranked according to their potential to contain as many relevant claims as possible using several claim discovery features. Hence, the second step aims at maximizing the overall claim recall of the retrieval system. Using a recently published claims benchmark, the proposed retrieval approach is demonstrated to provide more relevant claims compared to several other retrieval alternatives.


international joint conference on natural language processing | 2015

TR9856: A Multi-word Term Relatedness Benchmark

Ran Levy; Liat Ein-Dor; Shay Hummel; Ruty Rinott; Noam Slonim

Measuring word relatedness is an important ingredient of many NLP applications. Several datasets have been developed in order to evaluate such measures. The main drawback of existing datasets is the focus on single words, although natural language contains a large proportion of multiword terms. We propose the new TR9856 dataset which focuses on multi-word terms and is significantly larger than existing datasets. The new dataset includes many real world terms such as acronyms and named entities, and further handles term ambiguity by providing topical context for all term pairs. We report baseline results for common relatedness methods over the new data, and exploit its magnitude to demonstrate that a combination of these methods outperforms each individual method.


international acm sigir conference on research and development in information retrieval | 2014

Using the cross-entropy method to re-rank search results

Haggai Roitman; Shay Hummel; Oren Kurland

We present a novel unsupervised approach to re-ranking an initially retrieved list. The approach is based on the Cross Entropy method applied to permutations of the list, and relies on performance prediction. Using pseudo predictors we establish a lower bound on the prediction quality that is required so as to have our approach significantly outperform the original retrieval. Our experiments serve as a proof of concept demonstrating the considerable potential of the proposed approach. A case in point, only a tiny fraction of the huge space of permutations needs to be explored to attain significant improvements over the original retrieval.


intelligent user interfaces | 2014

Microcosm: visual discovery, exploration and analysis of social communities

Haggai Roitman; Ariel Raviv; Shay Hummel; Shai Erera; David Konopniki

Social communities play an important role in many domains. While a lot of attention has been given to developing efficient methods for detecting and analyzing social communities, it still remains a great challenge to provide intuitive search interfaces for end-users who wish to discover and explore such communities. Trying to fill the gaps, in this demonstration we present Microcosm: a holistic solution for visual discovery, exploration and analysis of social communities.


international acm sigir conference on research and development in information retrieval | 2014

A fusion approach to cluster labeling

Haggai Roitman; Shay Hummel; Michal Shmueli-Scheuer

We present a novel approach to the cluster labeling task using fusion methods. The core idea of our approach is to weigh labels, suggested by any labeler, according to the estimated labelers decisiveness with respect to each of its suggested labels. We hypothesize that, a cluster labelers labeling choice for a given cluster should remain stable even in the presence of a slightly incomplete cluster data. Using state-of-the-art cluster labeling and data fusion methods, evaluated over a large data collection of clusters, we demonstrate that, overall, the cluster labeling fusion methods that further consider the labelers decisiveness provide the best labeling performance.


international acm sigir conference on research and development in information retrieval | 2012

Clarity re-visited

Shay Hummel; Anna Shtok; Fiana Raiber; Oren Kurland; David Carmel


international conference on computational linguistics | 2014

Claims on demand -- an initial demonstration of a system for automatic detection and polarity identification of context dependent claims in massive corpora

Noam Slonim; Ehud Aharoni; Carlos Alzate; Roy Bar-Haim; Yonatan Bilu; Lena Dankin; Iris Eiron; Daniel Hershcovich; Shay Hummel; Mitesh M. Khapra; Tamar Lavee; Ran Levy; Paul Matchen; Anatoly Polnarov; Vikas Raykar; Ruty Rinott; Amrita Saha; Naama Zwerdling; David Konopnicki; Dan Gutfreund


empirical methods in natural language processing | 2017

Unsupervised corpus-wide claim detection.

Ran Levy; Shai Gretz; Benjamin Sznajder; Shay Hummel; Ranit Aharonov; Noam Slonim

Collaboration


Dive into the Shay Hummel's collaboration.

Top Co-Authors

Avatar

Oren Kurland

Technion – Israel Institute of Technology

View shared research outputs
Top Co-Authors

Avatar

Anna Shtok

Technion – Israel Institute of Technology

View shared research outputs
Researchain Logo
Decentralizing Knowledge