Jouni Riihimäki | Researchain

Archive Network Publication Hotspot Collaboration

Network

Latest external collaboration on country level. Dive into details by clicking on the dots.

Explore More

Hotspot

Dive into the research topics where Jouni Riihimäki is active.

Explore More

Publication

Featured researches published by Jouni Riihimäki.

ACM Transactions in Embedded Computing Systems | 2006

UML-based multiprocessor SoC design framework

Tero Kangas; Petri Kukkala; Heikki Orsila; Erno Salminen; Marko Hännikäinen; Timo D. Hämäläinen; Jouni Riihimäki; Kimmo Kuusilinna

This paper describes a complete design flow for multiprocessor systems-on-chips (SoCs) covering the design phases from system-level modeling to FPGA prototyping. The design of complex heterogeneous systems is enabled by raising the abstraction level and providing several system-level design automation tools. The system is modeled in a UML design environment following a new UML profile that specifies the practices for orthogonal application and architecture modeling. The design flow tools are governed in a single framework that combines the subtools into a seamless flow and visualizes the design process. Novel features also include an automated architecture exploration based on the system models in UML, as well as the automatic back and forward annotation of information in the design flow. The architecture exploration is based on the global optimization of systems that are composed of subsystems, which are then locally optimized for their particular purposes. As a result, the design flow produces an optimized component allocation, task mapping, and scheduling for the described application. In addition, it implements the entire system for FPGA prototyping board. As a case study, the design flow is utilized in the integration of state-of-the-art technology approaches, including a wireless terminal architecture, a network-on-chip, and multiprocessing utilizing RTOS in a SoC. In this study, a central part of a WLAN terminal is modeled, verified, optimized, and prototyped with the presented framework.

design, automation, and test in europe | 2005

UML 2.0 Profile for Embedded System Design

Petri Kukkala; Jouni Riihimäki; Marko Hännikäinen; Timo D. Hämäläinen; Klaus Kronlöf

The unified modeling language (UML) 2.0 is emerging in the area of embedded system design. This paper presents a new UML 2.0 profile - called TUT-profile - that introduces a set of stereotypes and design rules for an application, platform, and mapping. The profile classifies different application and platform components, and enables their parameterization. The TUT-profile concentrates on the structure of an application and platform, and utilizes standard UML 2.0 for the behavioral modeling. The application is seen as a set of active classes with an internal behavior. Correspondingly, the platform is seen as a component library with a parameterized presentation in UML 2.0 for each library component.

signal processing systems | 2006

HIBI Communication Network for System-on-Chip

Erno Salminen; Tero Kangas; Timo D. Hämäläinen; Jouni Riihimäki; Vesa Lahtinen; Kimmo Kuusilinna

This paper presents a communication network targeted for complex system-on-chip (SoC) and network-on-chip (NoC) designs. The Heterogeneous IP Block Interconnection (HIBI) aims at maximum efficiency and minimum energy per transmitted bit combined with quality-of-service (QoS) in transfers. Other features include support for hierarchical topologies with several clock domains, flexible scalability, and runtime reconfiguration of network parameters. HIBI is intended for integrating coarse-grain components such as intellectual property (IP) blocks that have size of thousands of gates.HIBI has been implemented in VHDL and SystemC and synthesized on several CMOS technologies and on FPGA. A 32-bit wrapper requires 5400 gates and runs with 315 MHz on 0.18 μ m technology which shows that only minimal area overhead is paid for the advanced features. The area and frequency results are well comparable to other NoC proposals.Furthermore, data transfers are shown to approach the maximum theoretical performance for protocol efficiency. HIBI network is accompanied with a design framework with tools for optimizing the system through automated design space exploration.

international conference / workshop on embedded computer systems: architectures, modeling and simulation | 2004

HIBI v.2 Communication Network for System-on-Chip

Erno Salminen; Vesa Lahtinen; Tero Kangas; Jouni Riihimäki; Kimmo Kuusilinna; Timo D. Hämäläinen

This paper presents a communication network targeted for complex system-on-chip (SoC) and network-on-chip (NoC) designs. The Heterogeneous IP Block Interconnection v.2 (HIBI) aims at maximum efficiency and energy saving per transmitted bit combined with guaranteed quality-of-service (QoS) in transfers. Other features include support for arbitrary topologies with several clock domains, flexible scalablility in signalling and run-time reconfiguration of network parameters. HIBI has been implemented in VHDL and SystemC and synthesized in 0.18 CMOS technology with area comparable to other NoC wrappers. HIBI data transfers are shown to approach the maximum theoretical performance for protocol efficiency.

Journal of Systems Architecture | 2007

Benchmarking mesh and hierarchical bus networks in system-on-chip context

Erno Salminen; Tero Kangas; Vesa Lahtinen; Jouni Riihimäki; Kimmo Kuusilinna; Timo D. Hämäläinen

The performance and area of a System-on-Chip depend on the utilized communication method. This paper presents simulation-based comparison of generic, synthesizable single bus, hierarchical bus, and 2-dimensional mesh on-chip networks. Performance of the network depends heavily on the application and therefore six test cases with multiple parameter values are used. Furthermore, two versions of each network topology are compared. The results show that hierarchical bus scales well to large number of agents and offers a good performance and area trade-off although it has smaller aggregate bandwidth and area than mesh. Hierarchical HIBI bus achieves runtimes comparable to 2-dimensional cut-through mesh with about 50% smaller network logic. However, depending on the test case, the runtime can be reduced by 20-50% when wider bus links are utilized.

international symposium on system-on-chip | 2003

Using a communication generator in SoC architecture exploration

Tero Kangas; Jouni Riihimäki; Erno Salminen; Kimmo Kuusilinna; Timo D. Hämäläinen

This paper presents an implementation of a communication generator, named transaction generator. It is utilized in communication-centric SoC architecture exploration where the objective is to find the optimal hardware allocation, task partitioning, and scheduling with a given application model and architectural requirements. An application is abstracted with a process network model of computation and architecture is described with characteristic metrics. In addition to accelerating architecture exploration, transaction generator can be used in development, verification and comparison of on-chip communication networks.

norchip | 2005

Requirements for network-on-chip benchmarking

Erno Salminen; Tero Kangas; Jouni Riihimäki; Timo D. Hämäläinen

This work presents the motivation, basic concepts, and requirements for benchmarking a network-on-chip (NoC). Currently there is practically no benchmark sets for NoC or the presented tools do not meet the requirements. The presented benchmarking method utilizes traffic generator with a dataflow models of the applications. Combined with transaction-level NoC, the abstract application model allows approximately 200/spl times/ speedup and on average 10% error in estimated runtime w.r.t. cycle-accurate HW/SW cosimulation without exposing the exact internal functionality of the application.

international symposium on circuits and systems | 2002

Parameter optimization tool for enhancing on-chip network performance

Jouni Riihimäki; Erno Salminen; Kimmo Kuusilinna; Timo D. Hämäläinen

In this paper, we present a tool to be used in the optimization of interconnection parameters in order to achieve optimal performance and implementation with minimal costs. The optimization tool uses an iterative algorithm to optimize the interconnection parameters, such as data width, priorities, and the time an agent can reserve the interconnection, to fulfill the given constraints. In the used test case, the required area decreased 50% while 85% of the original bandwidth was obtained. This was due to an improved arbitration process.

international conference / workshop on embedded computer systems: architectures, modeling and simulation | 2004

A communication-centric design flow for HIBI-based SoCs

Tero Kangas; Jouni Riihimäki; Erno Salminen; Vesa Lahtinen; Heikki Orsila; Kimmo Kuusilinna; Timo D. Hämäläinen

This paper describes a design flow for Systems-on-Chip(SoCs) utilizing a previously presented HIBI communication network. The system designer is assisted with an automated two-level architecture exploration that optimizes the component allocation, task mapping and scheduling with static application analyses, and dynamic simulations. The utilization of a system-level model of computation enables fast analysis of the design and facilitates automated architecture exploration. Communication design is in a key role in the design flow since it is a critical part of contemporary SoCs. The platform of the design flow is based on the HIBI communication network that is easily scalable and parameterizable for a variety of communication requirements. As a result, the design flow selects the computational component from library, HIBI network instance and application mapping that optimizes the result of cost function. The designer assists the flow by defining the cost function and optimization control parameters as well as giving the architecture and mapping constraints.

digital systems design | 2003

Distributing SoC simulations over a network of computers

Jouni Riihimäki; Väinö Helminen; Kimmo Kuusilinna; Timo D. Hämäläinen

This paper presents parallelization of System-on-Chip (SoC) simulation over a network of computers. A custom C-language-based SoC exploration and simulation tool, Discrete Time Network Simulator (DTNS), is used to examine the problem. Parallelization is implemented with either CORBA or TCP/IP sockets. The distributed DTNS architecture facilitates the analysis of computation requirements for reasonable distribution. The minimum execution time per distributed process for the parallelization to be profitable is 1.2ms with CORBA and 0.4ms with TCP/IP implementation. These results are based on our networked PCs running the Linux operating system. The same network is used to evaluate this distribution method in case of video encoder SoC simulation.

Explore More