Network


Latest external collaboration on country level. Dive into details by clicking on the dots.

Hotspot


Dive into the research topics where Sebastian Kuckuk is active.

Publication


Featured researches published by Sebastian Kuckuk.


european conference on parallel processing | 2014

ExaStencils: Advanced Stencil-Code Engineering

Christian Lengauer; Sven Apel; Matthias Bolten; Armin Größlinger; Frank Hannig; Harald Köstler; Ulrich Rüde; Jürgen Teich; Alexander Grebhahn; Stefan Kronawitter; Sebastian Kuckuk; Hannah Rittich; Christian Schmitt

Project ExaStencils pursues a radically new approach to stencil-code engineering. Present-day stencil codes are implemented in general-purpose programming languages, such as Fortran, C, or Java, or derivates thereof, and harnesses for parallelism, such as OpenMP, OpenCL or MPI. ExaStencils favors a much more domain-specific approach with languages at several layers of abstraction, the most abstract being the mathematical formulation, the most concrete the optimized target code. At every layer, the corresponding language expresses not only computational directives but also domain knowledge of the problem and platform to be leveraged for optimization. This approach will enable a highly automated code generation at all layers and has been demonstrated successfully before in the U.S. projects FFTW and SPIRAL for certain linear transforms.


ieee international conference on high performance computing data and analytics | 2014

ExaSlang: a domain-specific language for highly scalable multigrid solvers

Christian Schmitt; Sebastian Kuckuk; Frank Hannig; Harald Köstler; Jürgen Teich

High-Performance Computing (HPC) systems are becoming increasingly parallel and heterogeneous. As a consequence, HPC applications, such as simulation software, need to be especially designed towards these systems to achieve optimal performance. This, in turn, leads to higher complexity, making software engineers and scientists require a deep knowledge of the hardware and its technologies. As a remedy, domain-specific languages (DSLs) are a convenient technology for domain experts to describe settings and problems they want to solve using terms and models familiar to them. This specification is transformed into a target language, i. e., source code in another programming language or a binary executable, by a specialized compiler. We propose ExaSlang, a language for the specification of numerical solvers based on the multigrid method targeting distributed-memory systems. Furthermore, we present the transformation framework that drives the corresponding source-to-source compiler. It emits C++ code utilizing a hybrid OpenMP and MPI parallelization. Moreover, we substantiate our approach with scaling results of our code scaling up to the complete JUQUEEN cluster, consisting of 28,672 nodes, with a total of 458,752 cores.


international conference on computational science and its applications | 2014

An Evaluation of Domain-Specific Language Technologies for Code Generation

Christian Schmitt; Sebastian Kuckuk; Harald Köstler; Frank Hannig; Jürgen Teich

Software systems are becoming increasingly complex, requiring a deep knowledge to work and program with them. This is especially true for simulation frameworks used by scientists and engineers, but also applies to completely different domains such as mobile or web applications. To ease working with these systems, domain-specific languages (DSLs) are a convenient way to enable domain experts describe settings and problems they want to solve using terms familiar to them. Building upon this specification in the DSL, a compiler transform this to the target software framework, e.,g., runnable program code. To write such a compiler, a solid implementation framework is needed. In this paper, we propose criteria for the evaluation of textual programming language implementation frameworks to which we accordingly evaluate four technologies, namely Spoofax/IMP, Rascal MPL, a custom approach using C++ and a custom approach using Scala.


Parallel Processing Letters | 2014

Experiments on Optimizing the Performance of Stencil Codes with SPL Conqueror

Alexander Grebhahn; Sebastian Kuckuk; Christian Schmitt; Harald Köstler; Norbert Siegmund; Sven Apel; Frank Hannig; Jürgen Teich

A standard technique for numerically solving elliptic partial differential equations on structured grids is to discretize them, and, then, to apply an efficient geometric multi-grid solver. Unfortunately, finding the optimal choice of multi-grid components and parameter settings is challenging and existing auto-tuning techniques fail to explain performance-optimal settings. To improve the state of the art, we explore whether recent work on optimizing configurations of product lines can be applied to the stencil-code domain. In particular, we extend the domain-independent tool SPL Conqueror in an empirical study to predict the performance-optimal configurations of three geometric multi-grid stencil codes: a program using HIPAcc, the evaluation prototype HSMGP, and a program using DUNE. For HIPAcc, we reach an prediction accuracy of 96%, on average, measuring only 21.4% of all configurations; we predict a configuration that is nearly optimal after measuring less than 0.3% of all configurations. For HSMGP, w...


Computation | 2016

Automatic Generation of Massively Parallel Codes from ExaSlang

Sebastian Kuckuk; Harald Köstler

Domain-specific languages (DSLs) have the potential to provide an intuitive interface for specifying problems and solutions for domain experts. Based on this, code generation frameworks can produce compilable source code. However, apart from optimizing execution performance, parallelization is key for pushing the limits in problem size and an essential ingredient for exascale performance. We discuss necessary concepts for the introduction of such capabilities in code generators. In particular, those for partitioning the problem to be solved and accessing the partitioned data are elaborated. Furthermore, possible approaches to expose parallelism to users through a given DSL are discussed. Moreover, we present the implementation of these concepts in the ExaStencils framework. In its scope, a code generation framework for highly optimized and massively parallel geometric multigrid solvers is developed. It uses specifications from its multi-layered external DSL ExaSlang as input. Based on a general version for generating parallel code, we develop and implement widely applicable extensions and optimizations. Finally, a performance study of generated applications is conducted on the JuQueen supercomputer.


Software for Exascale Computing | 2016

Systems of Partial Differential Equations in ExaSlang

Christian Schmitt; Sebastian Kuckuk; Frank Hannig; Jürgen Teich; Harald Köstler; Ulrich Rüde; Christian Lengauer

As HPC systems are becoming increasingly heterogeneous and diverse, writing software that attains maximum performance and scalability while remaining portable as well as easily composable is getting more and more challenging. Additionally, code that has been aggressively optimized for certain execution platforms is usually not easily portable to others without either losing a great share of performance or investing many hours by re-applying optimizations. One possible remedy is to exploit the potential given by technologies such as domain-specific languages (DSLs) that provide appropriate abstractions and allow the application of technologies like automatic code generation and auto-tuning. In the domain of geometric multigrid solvers, project ExaStencils follows this road by aiming at providing highly optimized and scalable numerical solvers, specifically tuned for a given application and target platform. Here, we introduce its DSL ExaSlang with data types for local vectors to support computations that use point-local vectors and matrices. These data types allow an intuitive modeling of many physical problems represented by systems of partial differential equations (PDEs), e.g., the simulation of flows that include vector-valued velocities.


International Journal of Parallel, Emergent and Distributed Systems | 2013

Interactive particle dynamics using OpenCL and Kinect

Sebastian Kuckuk; Tobias Preclik; Harald Köstler

In this paper, we describe an interactive real-time simulation of granular, spherical particles which is able to run on a single workstation. The simulation is based on a discrete element method approach and fully implemented using Open Computing Language, enabling execution on CPUs and GPUs alike. The simulation results are visualised using DirectX 10 and instancing. Furthermore, we enable the user to control the visualisation and the simulation in a very intuitive way by supporting user tracking and speech recognition, both using the Microsoft Kinect sensor. We also compare the performance of different implementation strategies on both CPUs and GPUs, and, as a sample application, we simulate the Brazil nut effect.


Concurrency and Computation: Practice and Experience | 2017

Towards generating efficient flow solvers with the ExaStencils approach

Sebastian Kuckuk; Gundolf Haase; Diego A. Vasco; Harald Köstler

ExaStencils aims at providing intuitive interfaces for the specification of numerical problems and resulting solvers, particularly those from the class of (geometric) multigrid methods. It envisions a multi‐layered domain‐specific language and a sophisticated code generation framework ultimately emitting source code in a chosen target language. We present our recent advances in fully generating solvers applied to 3D fluid mechanics for nonisothermal/non‐Newtonian flows. In detail, a system of time‐dependent, nonlinear partial differential equations is discretized on a cubic, nonuniform, and staggered grid using finite volumes. We examine the contained problem of coupled Navier‐Stokes and temperature equations, which are linearized and solved using the SIMPLE algorithm and geometric multigrid solvers, as well as the incorporation of non‐Newtonian properties. Furthermore, we provide details on necessary extensions to our domain‐specific language and code generation framework, in particular, those concerning the handling of boundary conditions, support for nonequidistant staggered grids, and supplying specialized functions to express operations reoccurring in the scope of finite volume discretizations. Many of these enhancements are generalizable and thus suitable for utilization in similar projects. Lastly, we demonstrate the applicability of our code generation approach by providing convincing performance results for fully generated and automatically parallelized solvers.


international conference on algorithms and architectures for parallel processing | 2016

Redundancy Elimination in the ExaStencils Code Generator

Stefan Kronawitter; Sebastian Kuckuk; Christian Lengauer

Optimizing the performance of compute-bound codes requires, among other techniques, the elimination of redundant computations. The well-known concept of common subexpression elimination can achieve this in parts, and almost every production compiler conducts such an optimization. However, due to the conservative nature of these compilers, an external redundancy elimination can additionally increase the performance. For stencil codes using finite volume discretizations, an extension to eliminate redundancies between loop iterations is also very promising. We integrated both a classic common subexpression elimination and an extended version in the Exastencils code generator and present their impact on a real-world application.


parallel computing | 2013

A Generic Prototype to Benchmark Algorithms and Data Structures for Hierarchical Hybrid Grids.

Sebastian Kuckuk; Björn Gmeiner; Harald Köstler; Ulrich Rüde

Collaboration


Dive into the Sebastian Kuckuk's collaboration.

Top Co-Authors

Avatar

Harald Köstler

University of Erlangen-Nuremberg

View shared research outputs
Top Co-Authors

Avatar

Christian Schmitt

University of Erlangen-Nuremberg

View shared research outputs
Top Co-Authors

Avatar

Frank Hannig

University of Erlangen-Nuremberg

View shared research outputs
Top Co-Authors

Avatar

Jürgen Teich

University of Erlangen-Nuremberg

View shared research outputs
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar

Ulrich Rüde

University of Erlangen-Nuremberg

View shared research outputs
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Researchain Logo
Decentralizing Knowledge