Mingfang Wu | Researchain

Archive Network Publication Hotspot Collaboration

Network

Latest external collaboration on country level. Dive into details by clicking on the dots.

Explore More

Hotspot

Dive into the research topics where Mingfang Wu is active.

Explore More

Publication

Featured researches published by Mingfang Wu.

international acm sigir conference on research and development in information retrieval | 2009

Including summaries in system evaluation

Andrew Turpin; Falk Scholer; Kalvero Järvelin; Mingfang Wu; J. Shane Culpepper

In batch evaluation of retrieval systems, performance is calculated based on predetermined relevance judgements applied to a list of documents returned by the system for a query. This evaluation paradigm, however, ignores the current standard operation of search systems which require the user to view summaries of documents prior to reading the documents themselves. In this paper we modify the popular IR metrics MAP and P@10 to incorporate the summary reading step of the search process, and study the effects on system rankings using TREC data. Based on a user study, we establish likely disagreements between relevance judgements of summaries and of documents, and use these values to seed simulations of summary relevance in the TREC data. Re-evaluating the runs submitted to the TREC Web Track, we find the average correlation between system rankings and the original TREC rankings is 0.8 (Kendall τ), which is lower than commonly accepted for system orderings to be considered equivalent. The system that has the highest MAP in TREC generally remains amongst the highest MAP systems when summaries are taken into account, but other systems become equivalent to the top ranked system depending on the simulated summary relevance. Given that system orderings alter when summaries are taken into account, the small amount of effort required to judge summaries in addition to documents (19 seconds vs 88 seconds on average in our data) should be undertaken when constructing test collections.

text retrieval conference | 2001

Using clustering and classification approaches in interactive retrieval

Mingfang Wu; Michael Fuller; Ross Wilkinson

Satisfying non-trivial information needs involves collecting information from multiple resources, and synthesizing an answer that organizes that information. Traditional recall/precision-oriented information retrieval focuses on just one phase of that process: how to efficiently and effectively identify documents likely to be relevant to a specific, focused query. The TREC Interactive Track has as its goal the location of documents that pertain to different instances of a query topic, with no reward for duplicated coverage of topic instances. This task is similar to the task of organizing answer components into a complete answer. Clustering and classification are two mechanisms for organizing documents into groups. In this paper, we present an ongoing series of experiments that test the feasibility and effectiveness of using clustering and classification as an aid to instance retrieval and, ultimately, answer construction. Our results show that users prefer such structured presentations of candidate result set to a list-based approach. Assessment of the structured organizations based on the subjective judgement of the experiment subjects suggests that the structured organization can be more effective; however, assessment based on objective judgements shows mixed results. These results indicate that a full determination of the success of the approach depends on assessing the quality of the final answers generated by users, rather than on performance during the intermediate stages of answer construction.

international conference on user modeling, adaptation, and personalization | 2001

Generating Personal Travel Guides - And Who Wants Them?

Cécile Paris; Stephen Wan; Ross Wilkinson; Mingfang Wu

In this paper we describe a system that generates synthesized web pages as a travel guide through integrating a discourse planner with a document retrieval system. We then present our investigation on whether the guide generated by such a system is actually preferred by users over a more general guide.

adaptive hypermedia and adaptive web based systems | 2000

Generating Personal Travel Guides from Discourse Plans

Ross Wilkinson; Shijian Lu; François Paradis; Cécile Paris; Stephen Wan; Mingfang Wu

This paper describes a system that delivers travel guides tailored to individual needs. It does so by integrating a discourse planner with a system for querying the web and generating synthesised web pages using document prescriptions. We show by way of example how a user model can lead to a personal travel guide, and show what this might look like in different media. We briefly describe the investigations we are undertaking to determine the utility of such approaches.

international acm sigir conference on research and development in information retrieval | 2001

Searcher performance in question answering

Mingfang Wu; Michael Fuller; Ross Wilkinson

There are many tasks that require information finding. Some can be largely automated, and others greatly benefit from successful interaction between system and searcher. We are interested in the task of answering questions where some synthesis of information is required-the answer would not generally be given from a single passage of a single document. We investigate whether variation in the way a list of documents is delivered affected searcher performance in the question answering task. We will show that there is a significant difference in performance using a list customized to the task type, compared with a standard web-engine list. This indicates that paying attention to the task and the searcher interaction may provide substantial improvement in task performance.

adaptive hypermedia and adaptive web based systems | 2004

Myriad: An Architecture for Contextualized Information Retrieval and Delivery

Cécile Paris; Mingfang Wu; Keith Vander Linden; Matt Post; Shijian Lu

Users’ information needs are largely driven by the context in which they make their decisions. This context is dynamic. It includes the users’ characteristics, their current domain of application, the tasks they commonly perform and the device they are currently using. This context is also evolving. When one information need is satisfied, another is likely to emerge. An information access system must, therefore, be able to track this dynamic and evolving context, and exploit it to retrieve actionable information from appropriate sources and deliver it in a form suitable for the current situation. This paper presents a generic architecture that supports the construction of information retrieval and delivery systems that make use of context. The architecture, called Myriad, includes an adaptive virtual document planner, and explicit, dynamic representations of the user’s current context.

Journal of the Association for Information Science and Technology | 2012

Using anchor text for homepage and topic distillation search tasks

Mingfang Wu; David Hawking; Andrew Turpin; Falk Scholer

Past work suggests that anchor text is a good source of evidence that can be used to improve web searching. Two approaches for making use of this evidence include fusing search results from an anchor text representation and the original text representation based on a documents relevance score or rank position, and combining term frequency from both representations during the retrieval process. Although these approaches have each been tested and compared against baselines, different evaluations have used different baselines; no consistent work enables rigorous cross-comparison between these methods. The purpose of this work is threefold. First, we survey existing fusion methods of using anchor text in search. Second, we compare these methods with common testbeds and web search tasks, with the aim of identifying the most effective fusion method. Third, we try to correlate search performance with the characteristics of a test collection. Our experimental results show that the best performing method in each category can significantly improve search results over a common baseline. However, there is no single technique that consistently outperforms competing approaches across different collections and search tasks.

ACM Transactions on The Web | 2011

Topic Distillation with Query-Dependent Link Connections and Page Characteristics

Mingfang Wu; Falk Scholer; Andrew Turpin

Searchers on the Web often aim to find key resources about a topic. Finding such results is called topic distillation. Previous research has shown that the use of sources of evidence such as page indegree and URL structure can significantly improve search performance on interconnected collections such as the Web, beyond the use of simple term distribution statistics. This article presents a new approach to improve topic distillation by exploring the use of external sources of evidence: link structure, including query dependent indegree and outdegree; and web page characteristics, such as the density of anchor links. Our experiments with the TREC .GOV collection, an 18GB crawl of the US .gov domain from 2002, show that using such evidence can significantly improve search effectiveness, with combinations of evidence leading to significant performance gains over both full-text and anchor-text baselines. Moreover, we demonstrate that, at a different scope level, both local query-dependent outdegree and query-dependent indegree out-performed their global query-independent counterparts; and at the same scope level, outdegree out-performed indegree. Adding query-dependent indegree or page characteristics to query-dependent outdegree could have a small, but not significant, improvement.

international conference on electronic publishing | 1998

Using Document Relationships for Better Answers

Mingfang Wu; Ross Wilkinson

In most retrieval systems the answer to a query is a ranked list of documents. There is little information about the ranking and no support for exploring the relationships that may exist between the documents. In this paper we consider the use of clustering answers to better support users satisfying their information needs. We show how clustering reflects the nature of some information needs, and how the clustering can be used to find more relevant documents than would be the case using simple lists. This work contributes to our approach of building answers to information needs, rather than simply providing lists.

international acm sigir conference on research and development in information retrieval | 2008

User preference choices for complex question answering

Mingfang Wu; Falk Scholer; Andrew Turpin

Question answering systems increasingly need to deal with complex information needs that require more than simple factoid answers. The evaluation of such systems is usually carried out using precision- or recall-based system performance metrics. Previous work has demonstrated that when users are shown two search result lists side-by-side, they can reliably differentiate between the qualities of the lists. We investigate the consistency between this user-based approach and system-oriented metrics in the question answering environment. Our initial results indicate that the two methodologies show a high level of disagreement.

Explore More