Hanno Scharwaechter
RWTH Aachen University
Network
Latest external collaboration on country level. Dive into details by clicking on the dots.
Publication
Featured researches published by Hanno Scharwaechter.
cryptographic hardware and embedded systems | 2009
David Kammler; Diandian Zhang; Peter Schwabe; Hanno Scharwaechter; Markus Langenberg; Dominik Auras; Gerd Ascheid; Rudolf Mathar
This paper presents a design-space exploration of an application-specific instruction-set processor (ASIP) for the computation of various cryptographic pairings over Barreto-Naehrig curves (BN curves). Cryptographic pairings are based on elliptic curves over finite fields--in the case of BN curves a field
design, automation, and test in europe | 2004
Manuel Hohenauer; Hanno Scharwaechter; Kingshuk Karuri; Oliver Wahlen; Tim Kogel; Rainer Leupers; Gerd Ascheid; Heinrich Meyr; Gunnar Braun; Hans van Someren
\mathbb{F}_p
international conference on hardware/software codesign and system synthesis | 2007
Hanno Scharwaechter; Jonghee M. Youn; Rainer Leupers; Yunheung Paek; Gerd Ascheid; Heinrich Meyr
of large prime order p . Efficient arithmetic in these fields is crucial for fast computation of pairings. Moreover, computation of cryptographic pairings is much more complex than elliptic-curve cryptography (ECC) in general. Therefore, we facilitate programming of the proposed ASIP by providing a C compiler. In order to speed up
ACM Transactions in Embedded Computing Systems | 2007
Hanno Scharwaechter; David Kammler; Andreas Wieferink; Manuel Hohenauer; Kingshuk Karuri; Jianjiang Ceng; Rainer Leupers; Gerd Ascheid; Heinrich Meyr
\mathbb{F}_p
application-specific systems, architectures, and processors | 2005
Mohammad Mostafizur; Rahman Mozumdar; Kingshuk Karuri; Anupam Chattopadhyay; Stefan Kraemer; Hanno Scharwaechter; Heinrich Meyr; Gerd Ascheid; Rainer Leupers
arithmetic, a RISC core is extended with additional scalable functional units. Because the resulting speedup can be limited by the memory throughput, utilization of multiple data-memory banks is proposed. The presented design needs 15.8 ms for the computation of the Optimal-Ate pairing over a 256-bit BN curve at 338 MHz implemented with a 130 nm standard cell library. The processor core consumes 97 kGates making it suitable for the use in embedded systems.
Design Automation for Embedded Systems | 2011
Hanno Scharwaechter; David Kammler; Rainer Leupers; Gerd Ascheid; Heinrich Meyr
Retargetable C compilers are key tools for efficient architecture exploration for embedded processors. In this paper we describe a novel approach to retargetable compilation based on LISA, an industrial processor modeling language for efficient ASIP design. In order to circumvent the well-known trade-off between flexibility and code quality in retargetable compilation, we propose a user-guided, semiautomatic methodology that in turn builds on a powerful existing C compiler design platform. Our approach allows to include generated C compilers into the ASIP architecture exploration loop at an early stage, thereby allowing for a more efficient design process and avoiding application/architecture mismatches. We present the corresponding methodology and tool suite and provide experimental data for two real-life embedded processors that prove the feasibility of the approach.
software and compilers for embedded systems | 2004
Hanno Scharwaechter; David Kammler; Andreas Wieferink; Manuel Hohenauer; Kingshuk Karuri; Jianjiang Ceng; Rainer Leupers; Gerd Ascheid; Heinrich Meyr
We address the problem of instruction selection for Multi-Output Instructions (MOIs), producing more than one result. Such inherently parallel hardware instructions are very common in the area of Application Specific Instruction Set Processors (ASIPs) and Digital Signal Processors (DSPs) which are frequently used in System-on-Chips as programmable cores. In order to provide high-level programmability, and consequently guarantee widespread acceptance, sophisticated compiler support for these programmable cores is of high importance. Since it is not possible to model MultiOutput Instructions as trees in the compilers Intermediate Representation (IR), traditional approaches for code selection are not sufficient. Extending traditional code-generation approaches for MOI-selection is essentially a graph covering problem, which is known to be NP-complete. We present a new heuristic algorithm incorporated in a retargetable code-generator generator capable of exploiting arbitrary inherently parallel MOIs. We prove the concept by integrating the tool into the LCC compiler which has been targeted towards different Instruction Set Architectures based on the MIPS architecture. Several network applications as well as some DSP benchmarks were compiled and evaluated to obtain results.
Software - Practice and Experience | 2011
Jonghee M. Youn; Jongwon Lee; Yunheung Paek; Jongeun Lee; Hanno Scharwaechter; Rainer Leupers
Application-Specific Instruction-Set Processors (ASIPs) are becoming increasingly popular in the world of customized, application-driven System-on-Chip (SoC) designs. Efficient ASIP design requires an iterative architecture exploration loop---gradual refinement of the processor architecture starting from an initial template. To accomplish this task, design automation tools are used to detect bottlenecks in embedded applications, to implement application-specific processor instructions, and to automatically generate the required software tools (such as instruction-set simulator, C-compiler, assembler, and profiler), as well as to synthesize the hardware. This paper describes an architecture exploration loop for an ASIP coprocessor that implements common encryption functionality used in symmetric block cipher algorithms for internet protocol security (IPSec). The coprocessor is accessed via shared memory and, as a consequence, our approach is easily adaptable to arbitrary main processor architectures. This paper presents the extended version of our case study that has been already published on the SCOPES conference in 2004. In both papers, a MIPS architecture is used as the main processor and Blowfish as encryption algorithm.
design, automation, and test in europe | 2006
Hanno Scharwaechter; Manuel Hohenauer; Rainer Leupers; Gerd Ascheid; Heinrich Meyr
The growth of the Internet in the last decade has made current networking applications immensely complex. Systems running such applications need special architectural support to meet the tight constraints of power and performance. This paper presents a case study of architecture exploration and optimization of an application specific instruction set processor (ASIP) for networking applications. The case study particularly focuses on the effects of instruction set customization for applications from different layers of the protocol stack. Using a state-of-the-art VLIW processor as the starting template, and architecture description language (ADL) based architecture exploration tools, this case study suggests possible instruction set and architectural modifications that can speed-up some networking applications up to 6.8 times. Moreover, this paper also shows that there exist very few similarities between diverse networking applications. Our results suggest that, it is extremely difficult to have a common set of architectural features for efficient network protocol processing and, ASIPs with specialized instruction sets can become viable solutions for such an application domain.
IEE Proceedings - Computers and Digital Techniques | 2005
Rainer Leupers; Manuel Hohenauer; Jianjiang Ceng; Hanno Scharwaechter; Heinrich Meyr; Gerd Ascheid; Gunnar Braun
Compiler-in-the-Loop (CiL) architecture exploration is widely accepted as being the right track for fast development of Application Specific Instruction-set Processors (ASIP). In this context, both, automatic application-specific Instruction Set Extension (ISE) and code generation by a compiler have received huge attention in the past. Together, both techniques enable processor designers to quickly adapt a processor’s Instruction Set Architecture (ISA) to the needs of a certain set of applications and to provide an appropriate high-level programming model. This manuscript presents a tool flow for identification and utilization of Custom Instructions (CIs) during architecture exploration in an automated fashion. By embedding this tool flow in an industry-proven architecture exploration framework, a methodology for simultaneous compiler/architecture co-exploration is derived. The advantage of the presented tool flow lies in its ability to develop a reusable ISA and an appropriate compiler for a set of applications and therefore to support the design of programmable architectures. In addition, ASIP architecture exploration is effectively improved since time consuming application analysis and compiler retargeting is automated. Through compilation and simulation of several benchmarks in accordance to extended ISAs, reliable feedback on speedup, code size and usability of identified CIs is provided. Furthermore, results on area consumption for extended ISAs are presented in order to compare the obtained speedup with the invested hardware effort of new CIs.