Network


Latest external collaboration on country level. Dive into details by clicking on the dots.

Hotspot


Dive into the research topics where Wolfgang Gatterbauer is active.

Publication


Featured researches published by Wolfgang Gatterbauer.


international world wide web conferences | 2007

Towards domain-independent information extraction from web tables

Wolfgang Gatterbauer; Paul Bohunsky; Marcus Herzog; Bernhard Krüpl; Bernhard Pollak

Traditionally, information extraction from web tables has focused on small, more or less homogeneous corpora, often based on assumptions about the use of <table> tags. A multitude of different HTML implementations of web tables make these approaches difficult to scale. In this paper, we approach the problem of domain-independent information extraction from web tables by shifting our attention from the tree-based representation of webpages to a variation of the two-dimensional visual box model used by web browsers to display the information on the screen. The there by obtained topological and style information allows us to fill the gap created by missing domain-specific knowledge about content and table templates. We believe that, in a future step, this approach can become the basis for a new way of large-scale knowledge acquisition from the current Visual Web.


international world wide web conferences | 2005

Using visual cues for extraction of tabular data from arbitrary HTML documents

Bernhard Krüpl; Marcus Herzog; Wolfgang Gatterbauer

We describe a method to extract tabular data from web pages. Rather than just analyzing the DOM tree, we also exploit visual cues in the rendered version of the document to extract data from tables which are not explicitly marked with an HTML table element. To detect tables, we rely on a variant of the well-known X-Y cut algorithm as used in the OCR community. We implemented the system by directly accessing Mozillas box model that contains the positional data for all HTML elements of a given web page.


international world wide web conferences | 2006

Estimating required recall for successful knowledge acquisition from the web

Wolfgang Gatterbauer

Information on the Web is not only abundant but also redundant. This redundancy of information has an important consequence on the relation between the recall of an information gathering system and its capacity to harvest the core information of a certain domain of knowledge. This paper provides a new idea for estimating the necessary Web coverage of a knowledge acquisition system in order to achieve a certain desired coverage of the contained core information.


Archive | 2013

Counterexamples to Commonly Held Assumptions on Unit Commitment and Market Power Assessment

Wolfgang Gatterbauer; Marija D. Ilic

This chapter revisits two commonly accepted assumptions about the economics of deregulated electricity markets. It first disproves that, in theory and under the condition of perfect information, decentralized and centralized unit commitment (UC) would lead to the same power quantities traded and the same optimal social welfare. It then shows that a generator owner’s optimum bid sequence for an auction market is generally above marginal cost, even where absolutely no abuse of market power is involved.


national conference on artificial intelligence | 2006

Table extraction using spatial reasoning on the CSS2 visual box model

Wolfgang Gatterbauer; Paul Bohunsky


Archive | 2008

Information extraction using spatial reasoning on the CSS2 visual box model

Wolfgang Gatterbauer; Bernhard Kruepl; Paul Bohunsky; Marcus Herzog


Archive | 2002

Interdependencies of Electricity Market Characteristics and Bidding Strategies of Power Producers

Wolfgang Gatterbauer


Advanced power plant processes for solid fuels focussing on combined cycles. Workshop | 2000

Der Graz Cycle für Industriekraftwerke gefeuert mit Brenngasen aus Kohle- und Schwerölvergasung

Herbert Jericha; A. Lukasser; Wolfgang Gatterbauer


conference on current trends in theory and practice of informatics | 2007

Creating Permanent Test Collections of Web Pages for Information Extraction Research.

Bernhard Pollak; Wolfgang Gatterbauer


Archive | 2005

Web Information Extraction Using Eupeptic Data in Web Tables

Wolfgang Gatterbauer; Bernhard Krüpl; Wolfgang Holzinger; Marcus Herzog

Collaboration


Dive into the Wolfgang Gatterbauer's collaboration.

Top Co-Authors

Avatar

Marcus Herzog

Vienna University of Technology

View shared research outputs
Top Co-Authors

Avatar

Bernhard Krüpl

Vienna University of Technology

View shared research outputs
Top Co-Authors

Avatar

Paul Bohunsky

Vienna University of Technology

View shared research outputs
Top Co-Authors

Avatar

Bernhard Pollak

Vienna University of Technology

View shared research outputs
Top Co-Authors

Avatar

Herbert Jericha

Graz University of Technology

View shared research outputs
Top Co-Authors

Avatar

Wolfgang Holzinger

Vienna University of Technology

View shared research outputs
Top Co-Authors

Avatar

Marija D. Ilic

Carnegie Mellon University

View shared research outputs
Researchain Logo
Decentralizing Knowledge