Sven Groppe
University of Lübeck
Network
Latest external collaboration on country level. Dive into details by clicking on the dots.
Publication
Featured researches published by Sven Groppe.
TAEBC-2011 | 2011
Sven Groppe
The Semantic Web, which is intended to establish a machine-understandable Web, is currently changing from being an emerging trend to a technology used in complex real-world applications. A number of standards and techniques have been developed by the World Wide Web Consortium (W3C), e.g., the Resource Description Framework (RDF), which provides a general method for conceptual descriptions for Web resources, and SPARQL, an RDF querying language. Recent examples of large RDF data with billions of facts include the UniProt comprehensive catalog of protein sequence, function and annotation data, the RDF data extracted from Wikipedia, and Princeton Universitys WordNet. Clearly, querying performance has become a key issue for Semantic Web applications.In his book, Groppe details various aspects of high-performance Semantic Web data management and query processing. His presentation fills the gap between Semantic Web and database books, which either fail to take into account the performance issues of large-scale data management or fail to exploit the special properties of Semantic Web data models and queries. After a general introduction to the relevant Semantic Web standards, he presents specialized indexing and sorting algorithms, adapted approaches for logical and physical query optimization, optimization possibilities when using the parallel database technologies of todays multicore processors, and visual and embedded query languages.Groppe primarily targets researchers, students, and developers of large-scale Semantic Web applications. On the complementary book webpage readers will find additional material, such as an online demonstration of a query engine, and exercises, and their solutions, that challenge their comprehension of the topics presented.
acm symposium on applied computing | 2008
Sven Groppe; Jinghua Groppe; Volker Linnemann; Dirk Kukulenz; Nils Hoeller; Christoph Reinke
The tree-based languages XQuery and XSLT for XML are widely supported. Many tools do not yet support the new RDF graph query language SPARQL. We propose to embed SPARQL subqueries into XQuery/XSLT, such that XQuery and XSLT benefit from the graph query language constructs of SPARQL, and SPARQL benefits from features of XQuery/XSLT, which SPARQL does not support. The embedding enables XQuery/XSLT tools to handle at the same time XML queries and SPARQL subqueries, and XML and RDF data.
signal-image technology and internet-based systems | 2007
Sven Groppe; Jinghua Groppe; Dirk Kukulenz; Volker Linnemann
The basic data format of the semantic Web is RDF. SPARQL, which has been developed by the W3C, is the upcoming standard for RDF query languages. Typical engines for processing SPARQL queries on RDF data first read all RDF data, may build indices of the complete read data and afterwards evaluate SPARQL queries. Such engines cannot operate on streaming RDF data. Streaming query engines operating on streams of data can (a) discard irrelevant input as early as possible, and thus save processing costs and space costs, (b) build indices only on those parts of the data, which are needed for the evaluation of the query, and (c) determine partial results of a query as early as possible, and thus evaluate queries more efficiently. We propose such a streaming SPARQL engine, which is the first streaming SPARQL engine to the best of our knowledge.
Journal of Sensor and Actuator Networks | 2013
Richard Mietz; Sven Groppe; Kay Uwe Römer; Dennis Pfisterer
The Internet of Things is anticipated to connect billions of embedded devices equipped with sensors to perceive their surroundings. Thereby, the state of the real world will be available online and in real-time and can be combined with other data and services in the Internet to realize novel applications such as Smart Cities, Smart Grids, or Smart Healthcare. This requires an open representation of sensor data and scalable search over data from diverse sources including sensors. In this paper we show how the Semantic Web technologies RDF (an open semantic data format) and SPARQL (a query language for RDF-encoded data) can be used to address those challenges. In particular, we describe how prediction models can be employed for scalable sensor search, how these prediction models can be encoded as RDF, and how the models can be queried by means of SPARQL.
data and knowledge engineering | 2008
Jinghua Groppe; Sven Groppe
The satisfiability test checks, whether or not the evaluation of a query returns the empty set for any input document, and can be used in query optimization for avoiding the submission and the computation of unsatisfiable queries. Thus, applying the satisfiability test before executing a query can save processing time and query costs. We focus on the satisfiability problem for queries formulated in the XML query language XPath, and propose a schema-based approach to the satisfiability test of XPath queries, which checks whether or not an XPath query conforms to the constraints in a given schema. If an XPath query does not conform to the constraints given in the schema, the evaluation of the query will return an empty result for any valid XML document. Thus, the XPath query is unsatisfiable. We present a complexity analysis of our approach, which proves that our approach is efficient for typical cases. We present an experimental analysis of our developed prototype, which shows the optimization potential of avoiding the evaluation of unsatisfiable queries.
data and knowledge engineering | 2006
Sven Groppe; Stefan Böttcher; Georg Birkenheuer; André Höing
Applications using XML for data representation very often use different XML formats and thus require the transformation of XML data. The common approach transforms entire XML documents from one format into another, e.g. by using an XSLT stylesheet. Different from this approach, we use an XSLT stylesheet in order to transform a given XPath query or a given XSLT query so that we retrieve and transform only that part of the XML document, which is sufficient to answer the given query. Among other things, our approach avoids problems of replication, saves processing time, and in distributed scenarios, transportation costs.
acm symposium on applied computing | 2009
Jinghua Groppe; Sven Groppe; Sebastian Ebers; Volker Linnemann
Since there are a lot of similar or common properties between RDF and relational databases and between SPARQL and SQL, many efforts focus on leveraging the research results of optimizing relational query languages for optimizing SPARQL queries. However, SPARQL has its own characteristics different from SQL, which are not fully exploited by existing work. Therefore, there is still much space for research on optimizing SPARQL queries. Based on the triple nature of RDF data, we create 7 indices to retrieve RDF data quickly; based on the SPARQL-specific properties and the 7 indices, we develop a new, efficient approach to computing join by dynamically restricting triple patterns. Our experimental results show the efficiency of our approach.
international conference on move to meaningful internet systems | 2007
Matthias Droop; Markus Flarer; Jinghua Groppe; Sven Groppe; Volker Linnemann; Jakob Pinggera; Florian Santner; Michael Schier; Felix Schöpf; Hannes Staffler; Stefan Zugal
The W3C has developed XPath [3] as a query language for XML data. XPath is embedded in many other languages like XQuery and XSLT. The name of XPath derives from its basic concept, the path expression, with which the user can hierarchically address the nodes of the XML data. The user of XPath may not only use simple relationships like parent-child, but also more complex relationships like the descendant relationship, which is the transitive closure of the parent-child relationship. Furthermore, complex filter expressions are allowed in XPath queries. RDF is a language for representing information about resources in the World Wide Web. SPARQL [2] supports querying RDF by triple and optional patterns, con- and disjunctions and extensible value testing.
international conference on data engineering | 2006
Sven Groppe; Stefan Böttcher; Jinghua Groppe
XPath is widely used as an XML query language and is embedded in XQuery expressions and in XSLT stylesheets. In this paper, we propose a rule set which logically simplifies XPath queries by using a heuristic method in order to improve the processing time. Furthermore, we show how to substitute the XPath 2.0 intersect and except operators in a given XPath query with computed filter expressions. A performance evaluation comparing the execution times of the original XPath queries, which contain the intersect and except operators, and of the queries that are the result of our simplification approach shows that, depending on the used query evaluator and on the original query, performance improvements of a factor of up to 350 are possible.
web information and data management | 2003
Sven Groppe; Stefan Böttcher
Whenever XML data must be shared by heterogeneous applications, transformations between different application-specific XML formats are necessary. The state-of-the-art method transforms entire XML documents from one application format into another e.g. by using an XSLT stylesheet, so that each application can work locally on its preferred format. In our approach, we use an XSLT stylesheet in order to transform a given XPath query such that we retrieve and transform only that part of the XML document which is sufficient to answer the given query. Among other things, our approach avoids problems of replication, saves processing time and in distributed scenarios, transportation costs.