Network


Latest external collaboration on country level. Dive into details by clicking on the dots.

Hotspot


Dive into the research topics where Ernst Gunnar Gran is active.

Publication


Featured researches published by Ernst Gunnar Gran.


international parallel and distributed processing symposium | 2010

First experiences with congestion control in InfiniBand hardware

Ernst Gunnar Gran; Magne Eimot; Sven-Arne Reinemo; Tor Skeie; Olav Lysne; Lars Paul Huse; Gilad Shainer

In lossless interconnection networks congestion control (CC) can be an effective mechanism to achieve high performance and good utilization of network resources. Without CC, congestion in one node may grow into a congestion tree that can degrade the performance severely. This degradation can affect not only contributors to the congestion, but also throttles innocent traffic flows in the network. The InfiniBand standard describes CC functionality for detecting and resolving congestion. The InfiniBand CC concept is rich in the way that it specifies a set of parameters that can be tuned in order to achieve effective CC. There is, however, limited experience with the InfiniBand CC mechanism. To the best of our knowledge, only a few simulation studies exist. Recently, InfiniBand CC has been implemented in hardware, and in this paper we present the first experiences with such equipment. We show that the implemented InfiniBand CC mechanism effectively resolves congestion and improves fairness by solving the parking lot problem, if the CC parameters are appropriately set. By conducting extensive testing on a selection of the CC parameters, we have explored the parameter space and found a subset of parameter values that leads to efficient CC for our test scenarios. Furthermore, we show that the InfiniBand CC increases the performance of the well known HPC Challenge benchmark in a congested network.


Computer Networks | 2014

NorNet Core - A multi-homed research testbed

Ernst Gunnar Gran; Thomas Dreibholz; Amund Kvalbein

Over the last decade, the Internet has grown at a tremendous speed in both size and complexity. Nowadays, a large number of important services - for instance e-commerce, healthcare and many others - depend on the availability of the underlying network. Clearly, service interruptions due to network problems may have a severe impact. On the long way towards the Future Internet, the complexity will grow even further. Therefore, new ideas and concepts must be evaluated thoroughly, and particularly in realistic, real-world Internet scenarios, before they can be deployed for production networks. For this purpose, various testbeds - for instance PlanetLab, GpENI or G-Lab - have been established and are intensively used for research. However, all of these testbeds lack the support for so-called multi-homing.Multi-homing denotes the connection of a site to multiple Internet service providers, in order to achieve redundancy. Clearly, with the need for network availability, there is a steadily growing demand for multi-homing. The idea of the NorNet Core project is to establish a Future Internet research testbed with multi-homed sites, in order to allow researchers to perform experiments with multi-homed systems. Particular use cases for this testbed include realistic experiments in the areas of multi-path routing, load balancing, multi-path transport protocols, overlay networks and network resilience. In this paper, we introduce the NorNet Core testbed as well as its architecture.


ieee/acm international symposium cluster, cloud and grid computing | 2011

On the Relation between Congestion Control, Switch Arbitration and Fairness

Ernst Gunnar Gran; Eitan Zahavi; Sven-Arne Reinemo; Tor Skeie; Gilad Shainer; Olav Lysne

In loss less interconnection networks such as InfiniBand, congestion control (CC) can be an effective mechanism to achieve high performance and good utilization of network resources. The InfiniBand standard describes CC functionality for detecting and resolving congestion, but the design decisions on how to implement this functionallity is left to the hardware designer. One must be cautious when making these design decisions not to introduce fairness problems, as our study shows. In this paper we study the relationship between congestion control, switch arbitration, and fairness. Specifically, we look at fairness among different traffic flows arriving at a hot spot switch on different input ports, as CC is turned on. In addition we study the fairness among traffic flows at a switch where some flows are exclusive users of their input ports while other flows are sharing an input port (the parking lot problem). Our results show that the implementation of congestion control in a switch is vulnerable to unfairness if care is not taken. In detail, we found that a threshold hysteresis of more than one MTU is needed to resolve arbitration unfairness. Furthermore, to fully solve the parking lot problem, proper configuration of the CC parameters are required.


advanced information networking and applications | 2013

Design and Implementation of the NORNET CORE Research Testbed for Multi-homed Systems

Thomas Dreibholz; Ernst Gunnar Gran

The Internet has made it possible to communicate and to use services over large geographical distances. While it has originally been built for less critical services like e-mail and file transfer, it is nowadays also increasingly often used for availability-critical services like e.g. e-commerce or healthcare. Clearly, the reach ability of such services must be ensured by so-called multi-homing of endpoints. That is, endpoints are simultaneously connected to multiple Internet Service Providers (ISP) to provide redundancy. If one ISP has problems, it is intended that the connection to another one still works. However, such assumptions have never been verified in real, large-scale setups. The intention of the Nor Net project is to build up a realistic Internet test bed for multi-homing. In this paper, we describe the design of Nor Net with focus on the implementation of its fixed-line part: \noun{Nor Net Core}. This paper is intended to give researchers an overview of its mode of operation, its capabilities as well as its interesting feature realisations. The knowledge about these items is very useful to plan own experiments in the Nor Net test bed.


IEEE Transactions on Parallel and Distributed Systems | 2015

Efficient and Cost-Effective Hybrid Congestion Control for HPC Interconnection Networks

Jesús Escudero-Sahuquillo; Ernst Gunnar Gran; Pedro Javier García; Jose Flich; Tor Skeie; Olav Lysne; Francisco J. Quiles; José Duato

Interconnection networks are key components in high-performance computing (HPC) systems, their performance having a strong influence on the overall system one. However, at high load, congestion and its negative effects (e.g., Head-of-line blocking) threaten the performance of the network, and so the one of the entire system. Congestion control (CC) is crucial to ensure an efficient utilization of the interconnection network during congestion situations. As one major trend is to reduce the effective wiring in interconnection networks to reduce cost and power consumption, the network will operate very close to its capacity. Thus, congestion control becomes essential. Existing CC techniques can be divided into two general approaches. One is to throttle traffic injection at the sources that contribute to congestion, and the other is to isolate the congested traffic in specially designated resources. However, both approaches have different, but non-overlapping weaknesses: injection throttling techniques have a slow reaction against congestion, while isolating traffic in special resources may lead the system to run out of those resources. In this paper we propose EcoCC, a new Efficient and Cost-Effective CC technique, that combines injection throttling and congested-flow isolation to minimize their respective drawbacks and maximize overall system performance. This new strategy is suitable for current commercial switch architectures, where it could be implemented without requiring significant complexity. Experimental results, using simulations under synthetic and real trace-based traffic patterns, show that this technique improves by up to 55 percent over some of the most successful congestion control techniques.


international parallel and distributed processing symposium | 2012

Exploring the Scope of the InfiniBand Congestion Control Mechanism

Ernst Gunnar Gran; Sven-Arne Reinemo; Olav Lysne; Tor Skeie; Eitan Zahavi; Gilad Shainer

In a loss less interconnection network, network congestion needs to be detected and resolved to ensure high performance and good utilization of network resources at high network load. If no countermeasure is taken, congestion at a node in the network will stimulate the growth of a congestion tree that not only affects contributors to congestion, but also other traffic flows in the network. Left untouched, the congestion tree will block traffic flows, lead to underutilization of network resources and result in a severe drop in network performance. The InfiniBand standard specifies a congestion control (CC) mechanism to detect and resolve congestion before a congestion tree is able to grow and, by that, hamper the network performance. The InfiniBand CC mechanism includes a rich set of parameters that can be tuned in order to achieve effective CC. Even though it has been shown that the CC mechanism, properly tuned, is able to improve both throughput and fairness in an interconnection network, it has been questioned whether the mechanism is fast enough to keep up with dynamic network traffic, and if a given set of parameter values for a topology is robust when it comes to different traffic patterns, or if the parameters need to be tuned depending on the applications in use. In this paper we address both these questions. Using the three-stage fat-tree topology from the Sun Data center InfiniBand Switch 648 as a basis, and a simulator tuned against CC capable InfiniBand hardware, we conduct a systematic study of the efficiency of the InfiniBand CC mechanism as the network traffic becomes increasingly more dynamic. Our studies show that the InfiniBand CC, even when using a single set of parameter values, performs very well as the traffic patterns becomes increasingly more dynamic, outperforming a network without CC in all cases. Our results show throughput increases varying from a few percent, to a seventeen-fold increase.


high performance embedded architectures and compilers | 2012

sFtree: A fully connected and deadlock-free switch-to-switch routing algorithm for fat-trees

Bartosz Bogdanski; Sven-Arne Reinemo; Frank Olaf Sem-Jacobsen; Ernst Gunnar Gran

Existing fat-tree routing algorithms fully exploit the path diversity of a fat-tree topology in the context of compute node traffic, but they lack support for deadlock-free and fully connected switch-to-switch communication. Such support is crucial for efficient system management, for example, in InfiniBand (IB) systems. With the general increase in system management capabilities found in modern InfiniBand switches, the lack of deadlock-free switch-to-switch communication is a problem for fat-tree-based IB installations because management traffic might cause routing deadlocks that bring the whole system down. This lack of deadlock-free communication affects all system management and diagnostic tools using LID routing. In this paper, we propose the sFtree routing algorithm that guarantees deadlock-free and fully connected switch-to-switch communication in fat-trees while maintaining the properties of the current fat-tree algorithm. We prove that the algorithm is deadlock free and we implement it in OpenSM for evaluation. We evaluate the performance of the sFtree algorithm experimentally on a small cluster and we do a large-scale evaluation through simulations. The results confirm that the sFtree routing algorithm is deadlock-free and show that the impact of switch-to-switch management traffic on the end-node traffic is negligible.


parallel, distributed and network-based processing | 2015

A Weighted Fat-Tree Routing Algorithm for Efficient Load-Balancing in Infini Band Enterprise Clusters

Feroz Zahid; Ernst Gunnar Gran; Bartosz Bogdanski; Bjørn Dag Johnsen; Tor Skeie

Infini Band (IB) has become a popular network interconnect for high performance computing (HPC) systems. Many of the large IB-based HPC systems use some variant of the fat-tree topology to take advantage of the useful properties fat-trees offer. The fat-tree routing algorithm is one of the most efficient deterministic routing algorithms for fat-tree topologies. The algorithm ensures that the number of routes assigned to each link are balanced across the fabric. However, one problem with its load-balancing technique is that it assumes uniform traffic distribution in the network. When routes towards nodes that mainly consume large amount of data are assigned to share links in the fabric while alternative links are underutilized, sub-optimal network throughput is obtained. Also, as the fat tree algorithm routes nodes according to the indexing order, the performance may differ for two systems cabled in the exact same way. In this paper, we propose wFatTree, a novel fat-tree routing algorithm, which considers node traffic characteristics to balance load across the network links more evenly, and with predictable network performance. Our experiments and simulations show an improvement of up to 60% in total network throughput on large fat-tree installations when using wFatTree routing. Furthermore, wFatTree can also be used to prioritize traffic flowing towards the critical nodes in the network.


international conference on parallel processing | 2011

Combining Congested-Flow Isolation and Injection Throttling in HPC Interconnection Networks

Jesús Escudero-Sahuquillo; Ernst Gunnar Gran; Pedro Javier García; Jose Flich; Tor Skeie; Olav Lysne; Francisco J. Quiles; José Duato

Existing congestion control mechanisms in interconnects can be divided into two general approaches. One is to throttle traffic injection at the sources that contribute to congestion, and the other is to isolate the congested traffic in specially designated resources. These two approaches have different, but non-overlapping weaknesses. In this paper we present in detail a method that combines injection throttling and congested-flow isolation. Through simulation studies we first demonstrate the respective flaws of the injection throttling and of flow isolation. Thereafter we show that our combined method extracts the best of both approaches in the sense that it gives fast reaction to congestion, it is scalable and it has good fairness properties with respect to the congested flows.


ieee/acm international symposium cluster, cloud and grid computing | 2015

Partition-Aware Routing to Improve Network Isolation in Infiniband Based Multi-tenant Clusters

Feroz Zahid; Ernst Gunnar Gran; Bartosz Bogdanski; Bjørn Dag Johnsen; Tor Skeie

InfiniBand (IB) is a widely used network interconnect for modern high-performance computing systems. In large IB fabrics, isolation of nodes is provided through partitioning. The routing algorithm, however, is unaware of these partitions in the network, Traffic flows belonging to different partitions might share links inside the network fabric. This sharing of intermediate links creates interference, which is particularly critical to avoid in multi-tenant environments like a cloud. In such systems, each tenant should experience predictable network performance, unaffected by the workload of other tenants. In addition, using current routing schemes, routes crossing partition boundaries are considered when distributing routes onto links in the network, despite the fact that these routes will never be used. The result is degraded load-balancing. In this paper, we present a novel partition-aware fat-tree routing algorithm, pFTree. The pFTree algorithm utilizes several mechanisms to provide network-wide isolation of partitions belonging to different tenant groups. Given the available network resources, pFTree starts by isolating partitions at the physical link level, and then moves on to utilize virtual lanes, if needed. Our experiments and simulations show that pFTree is able to significantly reduce the affect of inter-partition interference without any additional functional overhead. Furthermore, pFTree also provides improved load-balancing over the de facto standard IB fat-tree routing algorithm.

Collaboration


Dive into the Ernst Gunnar Gran's collaboration.

Top Co-Authors

Avatar

Tor Skeie

Simula Research Laboratory

View shared research outputs
Top Co-Authors

Avatar
Top Co-Authors

Avatar

Feroz Zahid

Simula Research Laboratory

View shared research outputs
Top Co-Authors

Avatar

Sven-Arne Reinemo

Simula Research Laboratory

View shared research outputs
Top Co-Authors

Avatar
Top Co-Authors

Avatar

Olav Lysne

Simula Research Laboratory

View shared research outputs
Top Co-Authors

Avatar

Bartosz Bogdanski

Simula Research Laboratory

View shared research outputs
Top Co-Authors

Avatar
Top Co-Authors

Avatar

Thomas Dreibholz

University of Duisburg-Essen

View shared research outputs
Top Co-Authors

Avatar

Amund Kvalbein

Simula Research Laboratory

View shared research outputs
Researchain Logo
Decentralizing Knowledge