Sebastian Blohm
Karlsruhe Institute of Technology
Network
Latest external collaboration on country level. Dive into details by clicking on the dots.
Publication
Featured researches published by Sebastian Blohm.
european conference on principles of data mining and knowledge discovery | 2007
Sebastian Blohm; Philipp Cimiano
Textual patterns have been used effectively to extract information from large text collections. However they rely heavily on textual redundancy in the sense that facts have to be mentioned in a similar manner in order to be generalized to a textual pattern. Data sparseness thus becomes a problem when trying to extract information from hardly redundant sources like corporate intranets, encyclopedic works or scientific databases. We present results on applying a weakly supervised pattern induction algorithm to Wikipedia to extract instances of arbitrary relations. In particular, we apply different configurations of a basic algorithm for pattern induction on seven different datasets. We show that the lack of redundancy leads to the need of a large amount of training data but that integrating Web extraction into the process leads to a significant reduction of required training data while maintaining the accuracy of Wikipedia. In particular we show that, though the use of the Web can have similar effects as produced by increasing the number of seeds, it leads overall to better results. Our approach thus allows to combine advantages of two sources: The high reliability of a closed corpus and the high redundancy of the Web.
international conference on knowledge capture | 2007
José Iria; Victoria S. Uren; Alberto Lavelli; Sebastian Blohm; Aba-Sah Dadzie; Thomas Franz; Ioannis Kompatsiaris; João Magalhães; Spiros Nikolopoulos; Christine Preisach; Piercarlo Slavazza
In large organizations the resources needed to solve challenging problems are typically dispersed over systems within and beyond the organization, and also in different media. However, there is still the need, in knowledge environments, for extraction methods able to combine evidence for a fact from across different media. In many cases the whole is more than the sum of its parts: only when considering the different media simultaneously can enough evidence be obtained to derive facts otherwise inaccessible to the knowledge worker via traditional methods that work on each single medium separately. In this paper, we present a cross-media knowledge extraction framework specifically designed to handle large volumes of documents composed of three types of media text, images and raw data and to exploit the evidence across the media. Our goal is to improve the quality and depth of automatically extracted knowledge.
national conference on artificial intelligence | 2007
Sebastian Blohm; Philipp Cimiano; Egon Stemle
Proceedings of the KI 2008 Workshop on Ontology-Based Information Extraction Systems | 2008
Sebastian Blohm; Philipp Cimiano
OTT’06 - Ontologies in Text Technology: Approaches to Extract Semantic KnowledInformation. Publications of the Institute of Cognitive Science, 1-2007 | 2006
Sebastian Blohm; Philipp Cimiano
Applied Semantic Web Technologies | 2011
Sebastian Blohm; Krisztian Buza; Philipp Cimiano; Lars Schmidt-Thieme
WOP'09 Proceedings of the 2009 International Conference on Ontology Patterns - Volume 516 | 2009
Nadejda Nikitina; Sebastian Rudolph; Sebastian Blohm
SemSearch | 2008
Jürgen Umbrich; Sebastian Blohm
Archive | 2008
Sebastian Blohm; Markus Krötzsch; Philipp Cimiano
national conference on artificial intelligence | 2008
Sebastian Blohm; Markus Krötzsch; Philipp Cimiano