Howard R. Turtle | Researchain

Archive Network Publication Hotspot Collaboration

Network

Latest external collaboration on country level. Dive into details by clicking on the dots.

Explore More

Hotspot

Dive into the research topics where Howard R. Turtle is active.

Explore More

Publication

Featured researches published by Howard R. Turtle.

international acm sigir conference on research and development in information retrieval | 1991

Evaluation of an inference network-based retrieval model

Howard R. Turtle; W. Bruce Croft

Network representations have been used in information retrieval since at least the early 1960’s. Networks have been used to support diverse retrieval functions, including browsing [38], document clustering [7], spreading activation search [4], support for multiple search strategies [11], and representation of user knowledge [27] or document content [40]. Recent work suggests that significant improvements in retrieval performance will require techniques that, in some sense “understand” the content of documents and queries [9, 43] and can be used to infer probable relationships between documents and queries. In this view, information retrieval is an inference or evidential reasoning process in which we estimate the probability that a user’s information need, expressed as one or more queries, is met given a document as “evidence.” Network representations show promise as mechanisms for inferring these kinds of relationships [4, 12].

international acm sigir conference on research and development in information retrieval | 1989

Inference Networks for Document Retrieval

Howard R. Turtle; W. Bruce Croft

The use of inference networks to support document retrieval is introduced. A network-basead retrieval model is described and compared to conventional probabilistic and Boolean models.

Laboratory Automation & Information Management | 1997

System of document representation retrieval by successive iterated probability sampling

Howard R. Turtle; Gerald J. Morton; F. Kinley Larntz

An information retrieval system based on probabilities that documents meet information needs. The frequency of occurrence of a representation in a collection of documents is estimated by identifying the frequency of occurrence of the representation in a sample of documents and calculating the difference between the maximum and minimum probable frequencies of occurrence of the representation in the collection. If the difference does not exceed a limit, a midpoint of the maximum and minimum probable frequencies is the estimated frequency of occurrence of the representation. Document distribution probabilities are optimized and probability thresholds are established for the identification of documents. An initial probability threshold is established and is adjusted as the probabilities are scored for documents in samples. The document result list is iteratively adjusted through the samples.

international acm sigir conference on research and development in information retrieval | 1991

The use of phrases and structured queries in information retrieval

W. Bruce Croft; Howard R. Turtle; David D. Lewis

Both phrases and Boolean queries have a long history in information retrieval, particularly in commercial systems. In previous work, Boolean queries have been used as a source of phrases for a statistical retrieval model, This work, like the majority of research on phrases, resulted in little improvement in retrieval effectiveness, In this paper, we describe an approach where phrases identified in natural language queries are used to build structured queries for a probabilistic retrieval model. Our results show that using phrases in this way can improve performance, and that phrases that are automatically extracted from a natural language query perform nearly as well as manually selected phrases.

Information Processing and Management | 1995

Query evaluation: strategies and optimizations

Howard R. Turtle; James Flood

This paper discusses the two major query evaluation strategies used in large text retrieval systems and analyzes the performance of these strategies. We then discuss several optimization techniques that can be used to reduce evaluation costs and present simulation results to compare the performance of these optimization techniques when evaluating natural language queries with a collection of full text legal materials.

international acm sigir conference on research and development in information retrieval | 2003

Challenges in information retrieval and language modeling: report of a workshop held at the center for intelligent information retrieval, University of Massachusetts Amherst, September 2002

James Allan; Jay Aslam; Nicholas J. Belkin; Chris Buckley; James P. Callan; W. Bruce Croft; Susan T. Dumais; Norbert Fuhr; Donna Harman; David J. Harper; Djoerd Hiemstra; Thomas Hofmann; Eduard H. Hovy; Wessel Kraaij; John D. Lafferty; Victor Lavrenko; David Lewis; Liz Liddy; R. Manmatha; Andrew McCallum; Jay M. Ponte; John M. Prager; Dragomir R. Radev; Philip Resnik; Stephen E. Robertson; Ron G. Rosenfeld; Salim Roukos; Mark Sanderson; Richard M. Schwartz; Amit Singhal

Information retrieval (IR) research has reached a point where it is appropriate to assess progress and to define a research agenda for the next five to ten years. This report summarizes a discussion of IR research challenges that took place at a recent workshop. The attendees of the workshop considered information retrieval research in a range of areas chosen to give broad coverage of topic areas that engage information retrieval researchers. Those areas are retrieval models, cross-lingual retrieval, Web search, user modeling, filtering, topic detection and tracking, classification, summarization, question answering, metasearch, distributed retrieval, multimedia retrieval, information extraction, as well as testbed requirements for future work. The potential use of language modeling techniques in these areas was also discussed. The workshop identified major challenges within each of those areas. The following are recurring themes that ran throughout: • User and context sensitive retrieval • Multi-lingual and multi-media issues • Better target tasks • Improved objective evaluations • Substantially more labeled data • Greater variety of data sources • Improved formal models Contextual retrieval and global information access were identified as particularly important long-term challenges.

The Computer Journal | 1992

A comparison of text retrieval models

Howard R. Turtle; W. Bruce Croft

Many retrieval models have been proposed as the basis of text retrieval systems. The three main classes that have been investigated are the exact-match, vector space and probabilistic models. The retrieval effectiveness of strategies based on these models has been evaluated experimentally, but there has been little in the way of comparison in terms of their formal properties. In this paper we introduce a recent form of the probabilistic model based on inference networks, and show how the vector space and exact-match models can be described in this framework. Differences between these models can be explained as differences in the estimation of probabilities, both in the initial search and during relevance feedback.

acm conference on hypertext | 1989

A retrieval model incorporating hypertext links

W.B. Croft; Howard R. Turtle

FORMAL MODELS OF RETRIEVAL PROVIDE THE BASIS FOR RETRIEVING HYPERTEXT NODES IN RESPONSE TO EXPLICIT QUERIES. THIS RETRIEVAL FACILITY CAN BE INTEGRATED WITH THE TYPICAL BROWSING FACILITY FOR EFFECTIVE ACCESS TO LARGE DATABASES. IN THIS PAPER WE DESCRIBE A RETRIEVAL MODEL DEVELOPED FOR BIBLIOGRAPHIC INFORMATION RETRIEVAL AND SHOW HOW HYPERTEXT LINKS CAN BE INCORPORATED. THE MODEL TREATS RETRIEVAL AS A FORM OF INFERENCE IN WHICH MULTIPLE SOURCES OF EVIDENCE, INCLUDING HYPERTEXT LINKS, ARE COMBINED AND USED TO RANK THE RETRIEVED NODES. IMPLEMENTATION ASPECTS OF THE MODEL ARE ALSO DISCUSSED.

international acm sigir conference on research and development in information retrieval | 1994

Natural language vs. Boolean query evaluation: a comparison of retrieval performance

Howard R. Turtle

The results of experiments comparing the relative performance of natural language and Boolean query formulations are presented. The experiments show that on average a current generation natural language system provides better retrieval performance than expert searchers using a Boolean retrieval system when searching full-text legal materials. Methodological issues are reviewed and the effect of database size on query formulation strategy is discussed.

international acm sigir conference on research and development in information retrieval | 2005

Optimization strategies for complex queries

Trevor Strohman; Howard R. Turtle; W. Bruce Croft

Previous research into the efficiency of text retrieval systems has dealt primarily with methods that consider inverted lists in sequence; these methods are known as term-at-a-time methods. However, the literature for optimizing document-at-a-time systems remains sparse.We present an improvement to the max_score optimization, which is the most efficient known document-at-a-time scoring method. Like max_score, our technique, called term bounded max_score, is guaranteed to return exactly the same scores and documents as an unoptimized evaluation, which is particularly useful for query model research. We simulated our technique to explore the problem space, then implemented it in Indri, our large scale language modeling search engine. Tests with the GOV2 corpus on title queries show our method to be 23% faster than max_score alone, and 61% faster than our document-at-a-time baseline. Our optimized query times are competitive with conventional term-at-a-time systems on this years TREC Terabyte task.

Explore More