Particle physicists studying nature at highest energy scales at the Large Hadron Collider rely on simulations and data processing for their experiments.
These workloads run on the “computing grid”, a massive globally distributed computing infrastructure.
Deploying software efficiently and reliable to this grid is an important and challenging task.
CVMFS is an optimised shared file system developed specifically for this purpose: It is implemented as a POSIX read-only file system in user space (a FUSE module).
Files and directories are hosted on standard web servers and mounted in the universal namespace
In many cases, it replaces package managers and shared software areas on cluster file systems as means to distribute the software used to process experiment data.
As a FUSE file system, CVMFS critically depends on the performance and capabilities of libfuse. Since CVMFS was first created, several new features were added to libfuse - in particular FUSE_CAP_SPLICE_MOVE could potentially improve performance of CVMFS. In this project, the student should investigate the use of splice+move instead of copy and other new features of libfuse as well as set up and run a complete benchmark of relevant workloads.