Jens Graupmann
Max Planck Society
Network
Latest external collaboration on country level. Dive into details by clicking on the dots.
Publication
Featured researches published by Jens Graupmann.
very large data bases | 2004
Jens Graupmann; Michael Biwer; Christian Zimmer; Patrick Zimmer; Matthias Bender; Martin Theobald; Gerhard Weikum
This chapter introduces a concept-based Web search engine for HTML, XML, and deep Web data—Context-Oriented Multi-Format Portal-Aware Search System (COMPASS). It also presents the features and architectures of COMPASS. The internal query language of COMPASS resembles a highly simplified version of mainstream languages such as SQL, XPath, or XQuery. Search conditions refer to concepts and values that correspond to element names and contents in an XML setting, and attribute names and values in a SQL setting. COMPASS uses a centralized data index for efficient search evaluation. All data and also the relationships among documents are represented in a relational database. All data formats are transformed into XML by using heuristics as well as external annotation tools such as GATE.
very large data bases | 2003
Sergej Sizov; Jens Graupmann; Martin Theobald
Publisher Summary Focused crawling is a relatively new, promising approach to improving the recall of expert search on the Web. It typically starts from a user- or community specific tree of topics along with a few training documents for each tree node, and then crawls the Web with focus on these topics of interest. This process can efficiently build a theme-specific, hierarchical directory whose nodes are populated with relevant high-quality documents for expert Web search. The BINGO! focused crawler implements an approach that aims to overcome the limitations of the initial training data. BINGO! identifies, among the crawled and positively classified documents of a topic, characteristic archetypes and uses them for periodically retraining the classifier. This way the crawler is dynamically adapted based on the most significant documents seen so far.
extending database technology | 2004
Jens Graupmann
In this paper we show the current state of the ongoing research concerning our prototype for a search engine on semi-structured data incorporating rules mined on extracted structured data We illuminate some ideas from the research field of data mining and how to apply them to the retrieval process Additionally, we show technical aspects and features of our search engine.
very large data bases | 2005
Jens Graupmann; Ralf Schenkel; Gerhard Weikum
conference on innovative data systems research | 2003
Sergej Sizov; Martin Theobald; Stefan Siersdorfer; Gerhard Weikum; Jens Graupmann; Michael Biwer; Patrick Zimmer
geographic information retrieval | 2006
Jens Graupmann; Ralf Schenkel; Ross S. Purves; Christopher B. Jones
IEEE Data(base) Engineering Bulletin | 2002
Jens Graupmann; Gerhard Weikum
BTW 2003 | 2003
Jens Graupmann; Michael Biwer; Patrick Zimmer; Gerhard Weikum; Harald Schöning; Erhard Rahm
Untitled Event | 2005
Jens Graupmann; Ralf Schenkel
Untitled Event | 2005
Jens Graupmann; Ralf Schenkel; Gerhard Weikum; Klemens Böhm; Christian S. Jensen; Laura M. Haas; Martin L. Kersten; Per-Ake Larson; Beng Chin Ooi