Andreas Kupfer
Braunschweig University of Technology
Network
Latest external collaboration on country level. Dive into details by clicking on the dots.
Publication
Featured researches published by Andreas Kupfer.
pacific-asia conference on knowledge discovery and data mining | 2006
Brigitte Mathiak; Andreas Kupfer; Richard Münch; Claudia Täubner; Silke Eckstein
In this paper we present a picture search engine for life science literature and show how it can be used to improve literature preselection. This preselection is needed as a way to compensate for the vast amounts of literature that are available. While searching for DNA binding sites for example, we wanted to add the results of specific experiments (DNAse I footprint and EMSA) to our database. The preselection via abstract search was very unspecific (150 000 hits), but by looking for paper with images concerning the experiments, we could improve precision immensely. They are displayed like hits in a search engine, allowing easy and quick quality assessment without having to read through the whole paper. The images are found by their annotation in the paper: the figure caption. To identify that, we analyse the layout of the paper: the position of the image and the surrounding text.
international conference on data mining | 2007
Brigitte Mathiak; Andreas Kupfer; Carolina Rio Bartulos; Tatjana Scope; Johann Weiland; Silke Eckstein
Finding out which genes are expressed in which circumstances is one of the most common tasks in text mining for bioinformatics. But usually the derived data comes from the abstract or other describing texts in the literature. In the age of modern high-throughput microarray analysis, however, there is too much data to be described textual; instead this data often comes in form of tables. In this paper, we are looking specifically at the tables, an approach to our knowledge never described before. The goal is to attach gene names found in tables to their context for a convenient literature review. In order to do so, matching literature has to be downloaded and pre-processed. After that has been done, gene names or protein names can be found through a fast and reliable search, presenting all the associated literature at a glance.
international conference on conceptual modeling | 2006
Andreas Kupfer; Silke Eckstein; Karl Neumann; Brigitte Mathiak
Currently, knowledge from biological research is stored in hundreds of databases, counting only public accessible ones. Finding specific data in these is a challenging task which can be supported by ontologies describing them. The maintenance of a corresponding ontology is time consuming manual work, because research database schemas change rapidly. Our project will reduce the work by automating tasks, like a generation process and applying schema changes to the corresponding ontology. We call the proposed method coevolution, because database schema and ontology are allowed to evolve independently without ever losing their connection to each other. Our method consists of initial ontology generation, manual annotation and change propagation steps.
International Journal of Bioinformatics Research and Applications | 2007
Andreas Kupfer; Silke Eckstein; Britta Stormann; Brigitte Mathiak
Ontologies are one of the key technologies for data integration and meta-databases, by connecting databases at a semantical level. Still, the database has to be connected to the ontology and vice versa. To achieve this, we propose a two step process. First, we automatically generate an ontology directly from the database schema. Next, we annotate this with concepts from a domain specific ontology. We demonstrate this for signal transduction pathways, which describe how cells can react to external stimuli. In this paper we present our mapping of database schemas to ontologies and describe how these ontologies can be enriched by semantical information.
2006 7th International Baltic Conference on Databases and Information Systems | 2006
Andreas Kupfer; Silke Eckstein; Karl Neumann; Brigitte Mathiak
Connecting scientific databases is a challenging task which can be supported by ontologies describing them on a semantical level. Unfortunately, ontologies for databases are rarely used, because schemas of research databases change rapidly while the ontologies grow as well. Maintaining the connection between both is time consuming manual work. Therefore it is worthwhile to reduce the required work by automating tasks. We propose an approach that allows the database schema and the ontology to change and evolve, without ever losing their connection to each other. This is done by mapping database schemas to ontologies, enriching these ontologies with semantical information and transfering schema changes to the ontology
Informatik - Forschung Und Entwicklung | 2005
Brigitte Mathiak; Andreas Kupfer; Karl Neumann
ZusammenfassungMit den XML-basierten Sprachen GML, XSLT und SVG lassen sich Geodaten nicht nur anwendungsorientiert modellieren, sondern auch kartenähnlich visualisieren. In dieser Fallstudie zeigen wir das, indem wir realistische Geodatenbestände der Landesvermessungsämter zunächst mit der Geography Markup Language (GML) nachmodellieren. So mit GML strukturierte Daten werden dann mit der Extensible Stylesheet Language for Transformation (XSLT) auf Elemente der Sprache Scalable Vector Graphics (SVG) abgebildet. Dabei wird der Prozess der kartografischen Visualisierung durch XSLT-Konstrukte modelliert und auch gleichzeitig implementiert. Als Ergebnis erhalten wir Grafiken, die den entsprechenden Karten der Landesämter zumindest nicht unähnlich sind. AbstractUsing the XML-based languages GML, XSLT and SVG, we modeled German geo data and also visualized it in map-like graphics. This case study shows the feasibility of that approach, by modelling the data with the Geography Markup Language (GML). Then, the GML-structured data is mapped with the Extensible Stylesheet Language for Transformation (XSLT) to elements of the language Scalable Vector Graphics (SVG). The process of cartographical visualization is thus modelled through XSLT-constructs and at same time also implemented. The results are graphics, which share close ressemblance to the corresponding maps of the official offices.
Mining Complex Data | 2009
Brigitte Mathiak; Andreas Kupfer; Silke Eckstein
It is said that the world knowledge is in the Internet. Scientific knowledge is in the books, journals and conference proceedings. Yet both repositories are too large to skim through manually. We need clever algorithms to cope with the huge amount of information. To filter, sort and ultimately mine the information available it is vital to use every source of information we have. A common technique is to mine the text from the publications, but they are more complex than the text they include. The position of the words gives us clues about their meaning. Additional images either supplement the text or offer proof to a proposition. Tables cannot be understood before deciphering the rows and columns. To deal with the additional information, classic text mining techniques have to be coupled with spatial data and image data. In this chapter, we will give some background to the various techniques, explain the necessary pre-processing steps involved and present two case studies, one from image mining and one from table identification.
international conference on data mining | 2006
Brigitte Mathiak; Andreas Kupfer; Tatjana Scope; Britta Stormann; Silke Eckstein
The aim of literature retrieval is to find significant papers on a given topic. In previous publications, we examined the use of choosing these papers based on the pictures they include. To refine this approach we seek to employ picture classification to further narrow down the number of interesting pictures presented. This can be useful, for example, when looking for the results of specific experiments. The classification can also be useful as a data cleansing step, to omit all unnecessary pictures not used as a figure. We use a method originally designed to distinguish between photos and computer-generated pictures on the Web. We show that this method can not only be used to distinguish between raw data and derived representation figures, we can also reliably eliminate nonfigure pictures in the document, like text pages and logos. We tested this approach on two different data sets with different topics and different non-figure problems, both with satisfactory results
international conference of the ieee engineering in medicine and biology society | 2006
Claudia Täubner; Brigitte Mathiak; Andreas Kupfer; Fleischer N; Silke Eckstein
computer-based medical systems | 2006
Andreas Kupfer; Silke Eckstein; Karl Neumann; Brigitte Mathiak