Baptiste Lepers
École Polytechnique Fédérale de Lausanne
Network
Latest external collaboration on country level. Dive into details by clicking on the dots.
Publication
Featured researches published by Baptiste Lepers.
architectural support for programming languages and operating systems | 2013
Mohammad Dashti; Alexandra Fedorova; Justin R. Funston; Fabien Gaud; Renaud Lachaize; Baptiste Lepers; Vivien Quéma; Mark Roth
NUMA systems are characterized by Non-Uniform Memory Access times, where accessing data in a remote node takes longer than a local access. NUMA hardware has been built since the late 80s, and the operating systems designed for it were optimized for access locality. They co-located memory pages with the threads that accessed them, so as to avoid the cost of remote accesses. Contrary to older systems, modern NUMA hardware has much smaller remote wire delays, and so remote access costs per se are not the main concern for performance, as we discovered in this work. Instead, congestion on memory controllers and interconnects, caused by memory traffic from data-intensive applications, hurts performance a lot more. Because of that, memory placement algorithms must be redesigned to target traffic congestion. This requires an arsenal of techniques that go beyond optimizing locality. In this paper we describe Carrefour, an algorithm that addresses this goal. We implemented Carrefour in Linux and obtained performance improvements of up to 3.6 relative to the default kernel, as well as significant improvements compared to NUMA-aware patchsets available for Linux. Carrefour never hurts performance by more than 4% when memory placement cannot be improved. We present the design of Carrefour, the challenges of implementing it on modern hardware, and draw insights about hardware support that would help optimize system software on future NUMA systems.
ACM Queue | 2015
Fabien Gaud; Baptiste Lepers; Justin R. Funston; Mohammad Dashti; Alexandra Fedorova; Vivien Quéma; Renaud Lachaize; Mark Roth
Modern server-class systems are typically built as several multicore chips put together in a single system. Each chip has a local DRAM (dynamic random-access memory) module; together they are referred to as a node. Nodes are connected via a high-speed interconnect, and the system is fully coherent. This means that, transparently to the programmer, a core can issue requests to its node’s local memory as well as to the memories of other nodes. The key distinction is that remote requests will take longer, because they are subject to longer wire delays and may have to jump several hops as they traverse the interconnect. The latency of memory-access times is hence non-uniform, because it depends on where the request originates and where it is destined to go. Such systems are referred to as NUMA (non-uniform memory access).
Proceedings of the 16th Workshop on Hot Topics in Operating Systems | 2017
Baptiste Lepers; Willy Zwaenepoel; Jean-Pierre Lozi; Nicolas Palix; Redha Gouicem; Julien Sopena; Julia L. Lawall; Gilles Muller
Operating systems have been shown to waste machine resources by leaving cores idle while work is ready to be scheduled. This results in suboptimal performance for user applications, and wasted power. Recent progress in formal verification methods have led to operating systems being proven safe, but operating systems have yet to be proven free of performance bottlenecks. In this paper we instigate the first effort in proving performance properties of operating systems by designing a multicore scheduler that is proven to be work-conserving.
Proceedings of the 18th ACM/IFIP/USENIX Middleware Conference on Posters and Demos | 2017
Mathieu Bacou; Grégoire Todeschi; Alain Tchana; Daniel Hagimont; Baptiste Lepers; Willy Zwaenepoel
A lot of research work is put into reducing the power consumption of datacenters. Some works seek to improve the power efficiency of one server, but as of now the ideal state of power proportionality is only approached with a higher workload. Meanwhile, modern datacenters make use of the virtualization: the actual workload is materialized by virtual machines (VM). Thus, the problem shifts to consolidating VMs in order to increase the workload of a server. Roughly described, VM consolidation systems try to place as many VMs as possible, on as few servers as possible. Then, a server either gets a higher workload, bringing it closer to a state of energy proportionality; or hosts no VM and can be brought to sleep. However, industrial solutions are not very efficient, and in practice the number of VMs per server is increased by only 10%, with an effective workload rarely over 50% [3].
usenix annual technical conference | 2012
Renaud Lachaize; Baptiste Lepers; Vivien Quéma
usenix annual technical conference | 2015
Baptiste Lepers; Vivien Quéma; Alexandra Fedorova
usenix annual technical conference | 2014
Fabien Gaud; Baptiste Lepers; Jérémie Decouchant; Justin R. Funston; Alexandra Fedorova; Vivien Quéma
european conference on computer systems | 2016
Jean-Pierre Lozi; Baptiste Lepers; Justin R. Funston; Fabien Gaud; Vivien Quéma; Alexandra Fedorova
usenix annual technical conference | 2017
Jasmina Malicevic; Baptiste Lepers; Willy Zwaenepoel
Archive | 2011
Fabien Gaud; Renaud Lachaize; Baptiste Lepers; Gilles Muller; Vivien Quéma
Collaboration
Dive into the Baptiste Lepers's collaboration.
French Institute for Research in Computer Science and Automation
View shared research outputs