Glen Jeh
Network
Latest external collaboration on country level. Dive into details by clicking on the dots.
Publication
Featured researches published by Glen Jeh.
knowledge discovery and data mining | 2002
Glen Jeh; Jennifer Widom
The problem of measuring similarity of objects arises in many applications, and many domain-specific measures have been developed, e.g., matching text across documents or computing overlap among item-sets. We propose a complementary approach, applicable in any domain with object-to-object relationships, that measures similarity of the structural context in which objects occur, based on their relationships with other objects. Effectively, we compute a measure that says two objects are similar if they are related to similar objects: This general similarity measure, called SimRank, is based on a simple and intuitive graph-theoretic model. For a given domain, SimRank can be combined with other domain-specific similarity measures. We suggest techniques for efficient computation of SimRank scores, and provide experimental results on two application domains showing the computational feasibility and effectiveness of our approach.
international world wide web conferences | 2003
Glen Jeh; Jennifer Widom
Recent web search techniques augment traditional text matching with a global notion of importance based on the linkage structure of the web, such as in Googles PageRank algorithm. For more refined searches, this global notion of importance can be specialized to create personalized views of importance--for example, importance scores can be biased according to a user-specified set of initially-interesting pages. Computing and storing all possible personalized views in advance is impractical, as is computing personalized views at query time, since the computation of each view requires an iterative computation over the web graph. We present new graph-theoretical results, and a new technique based on these results, that encode personalized views as partial vectors. Partial vectors are shared across multiple personalized views, and their computation and storage costs scale well with the number of views. Our approach enables incremental computation, so that the construction of personalized views from partial vectors is practical at query time. We present efficient dynamic programming algorithms for computing partial vectors, an algorithm for constructing personalized views from partial vectors, and experimental results demonstrating the effectiveness and scalability of our techniques.
international world wide web conferences | 2006
Beverly Yang; Glen Jeh
Major search engines currently use the history of a users actions (e.g., queries, clicks) to personalize search results. In this paper, we present a new personalized service, query-specific web recommendations (QSRs), that retroactively answers queries from a users history as new results arise. The QSR system addresses two important subproblems with applications beyond the system itself: (1) Automatic identification of queries in a users history that represent standing interests and unfulfilled needs. (2) Effective detection of interesting new results to these queries. We develop a variety of heuristics and algorithms to address these problems, and evaluate them through a study of Google history users. Our results strongly motivate the need for automatic detection of standing interests from a users history, and identifies the algorithms that are most useful in doing so. Our results also identify the algorithms, some which are counter-intuitive, that are most useful in identifying interesting new results for past queries, allowing us to achieve very high precision over our data set.
knowledge discovery and data mining | 2004
Glen Jeh; Jennifer Widom
Existing data mining algorithms on graphs look for nodes satisfying specific properties, such as specific notions of structural similarity or specific measures of link-based importance. While such analyses for predetermined properties can be effective in well-understood domains, sometimes identifying an appropriate property for analysis can be a challenge, and focusing on a single property may neglect other important aspects of the data. In this paper, we develop a foundation for mining the properties themselves. We present a theoretical framework defining the space of graph properties, a variety of mining queries enabled by the framework, techniques to handle the enormous size of the query space, and an experimental system called F-Miner that demonstrates the utility and feasibility of property mining.
Archive | 2004
Sepandar D. Kamvar; Taher H. Haveliwala; Glen Jeh
Archive | 2004
Taher H. Haveliwala; Glen Jeh; Sepandar D. Kamvar
Archive | 2012
Taher H. Haveliwala; Glen Jeh; Sepandar D. Kamvar
Archive | 2003
Taher H. Haveliwala; Sepandar D. Kamvar; Glen Jeh
Archive | 2004
Taher H. Haveliwala; Glen Jeh; Sepandar D. Kamvar
Archive | 2003
Sepandar D. Kamvar; Taher H. Haveliwala; Glen Jeh; Gene H. Golub