Thijs Westerveld
University of Twente
Network
Latest external collaboration on country level. Dive into details by clicking on the dots.
Publication
Featured researches published by Thijs Westerveld.
international acm sigir conference on research and development in information retrieval | 2002
Wessel Kraaij; Thijs Westerveld; Djoerd Hiemstra
An important class of searches on the world-wide-web has the goal to find an entry page (homepage) of an organisation. Entry page search is quite different from Ad Hoc search. Indeed a plain Ad Hoc system performs disappointingly. We explored three non-content features of web pages: page length, number of incoming links and URL form. Especially the URL form proved to be a good predictor. Using URL form priors we found over 70% of all entry pages at rank 1, and up to 89% in the top 10. Non-content features can easily be embedded in a language model framework as a prior probability.
Lecture notes in artificial intelligence | 2001
Carol Peters; Djoerd Hiemstra; Wessel Kraaij; Renée Pohlmann; Thijs Westerveld
Read more and get great! Thats what the book enPDFd cross language information retrieval and evaluation will give for every reader to read this book. This is an on-line book provided in this website. Even this book becomes a choice of someone to read, many in the world also loves it so much. As what we talk, when you read more every page of this cross language information retrieval and evaluation, what you will obtain is something great.
international acm sigir conference on research and development in information retrieval | 2003
Thijs Westerveld; Arjen P. de Vries
The main conclusion from the metrics-based evaluation of video retrieval systems at TRECs video track is that non-interactive image retrieval from general collections using visual information only is not yet feasible. We show how a detailed analysis of retrieval results -- looking beyond mean average precision (MAP) scores on topical relevance -- gives significant insight in the main problems with the visual part of the retrieval model under study. Such an analytical approach proves an important addition to standard evaluation measures.
international acm sigir conference on research and development in information retrieval | 2002
Thijs Westerveld
We present a framework in which probabilistic models for textual and visual information retrieval can be integrated seamlessly. The framework facilitates searching for imagery using textual descriptions and visual examples simultaneously. The underlying Language Models for text and Gaussian Mixture Models for images have proven successful in various retrieval tasks.
International Workshop of the Initiative for the Evaluation of XML Retrieval | 2006
Theodora Tsikrika; Thijs Westerveld
The multimedia track focuses on using the structure of the document to extract, relate, and combine the relevance of different multimedia fragments. This paper presents a brief overview of the track, it’s collection tasks and goals. We also report the results and the approaches of the participating groups.
IEEE Transactions on Circuits and Systems for Video Technology | 2007
de Franciska Jong; Thijs Westerveld; de Arjen Vries
This paper addresses the focus of this special issue by analyzing the potential contribution of linguistic content and other nonimage aspects to the processing of audiovisual data. It summarizes the various ways in which linguistic content analysis contributes to enhancing the semantic annotation of multimedia content, and, as a consequence, to improving the effectiveness of conceptual media access tools. A number of techniques are presented, including the time-alignment of textual resources, audio and speech processing, content reduction and reasoning tools, and the exploitation of surface features
INEX'05 Proceedings of the 4th international conference on Initiative for the Evaluation of XML Retrieval | 2005
Vojkan Mihajlovic; Georgina Ramirez; Thijs Westerveld; Djoerd Hiemstra; Henk Ernst Blok; Arjen P. de Vries
Retrieving information from heterogeneous data sources in a flexible manner and within a single (database) framework is still a challenge. In this paper we present several extensions of our prototype database system TIJAH developed for structured retrieval. The extensions are aimed at modeling vague selection of XML elements and image retrieval. All three levels (conceptual, logical, and physical) of the TIJAH system are enhanced to support the extensions. Additionally, we analyze different ways of removing overlap and explain how structural information can be used for relevance feedback.
cross language evaluation forum | 2000
Djoerd Hiemstra; Wessel Kraaij; Renée Pohlmann; Thijs Westerveld
This paper describes the official runs of the Twenty-One group for the first CLEF workshop. The Twenty-One group participated in the monolingual, bilingual and multilingual tasks. The following new techniques are introduced in this paper. In the bilingual task we experimented with different methods to estimate translation probabilities. In the multilingual task we experimented with refinements on raw-score merging techniques and with a new relevance feedback algorithm that re-estimates both the models translation probabilities and the relevance weights. Finally, we performed preliminary experiments to exploit the web to generate translation probabilities and bilingual dictionaries, notably for English-Italian and English-Dutch.
conference on information and knowledge management | 2005
Georgina Ramirez; Thijs Westerveld; Arjen P. de Vries
The structural features of XML components are an extra source of information that should be used in a content-oriented retrieval task on this type of documents. In this paper we explore one of the structural features from the INEX collection [1] that could be used in content-oriented search. We analyse the gain this knowledge could add to the performance of an information retrieval system and present a first approach on how this structural information could be extracted from a relevance feedback process to be used as priors in a language modelling framework.
international acm sigir conference on research and development in information retrieval | 2007
Theodora Tsikrika; Thijs Westerveld
Structured document retrieval allows for the retrieval of document fragments, i.e. XML elements, containing relevant information. The main INEX Ad Hoc track focuses on text-based XML element retrieval. Although text is dominantly present in most XML document collections, other types of media can also be found. Existing research on multimedia information retrieval has shown that it is far from trivial to determine the combined relevance of a document that contains several multimedia objects. The objective of the INEX multimedia track is to exploit the XML structure that provides a logical level at which multimedia objects are connected, to improve the retrieval performance of an XML-driven multimedia information retrieval system.