Rucio - S3-compatible access interface for analysis facilities

Description

Rucio is an open-source software framework that provides functionality to scientific collaborations to organize, manage, monitor, and access their distributed data and dataflows across heterogeneous infrastructures. Rucio was originally developed to meet the requirements of the high-energy physics experiment ATLAS, and is continuously enhanced to support diverse scientific communities. Since 2016 Rucio orchestrated multiple Exabytes of data access and data transfers globally.

With this project we seek to enhance Rucio to support a new mechanism for analysis facilities, which are oriented towards object stores in order to provide a portable destination for HEP analyzers to store data products produced in their research in a portable, shareable and standardized way.

Task ideas and expected results

As a user I want to

As a user I want to have

By the end of GSoC’22 we expect that the student has developed the necessary changes in Rucio including unit tests and successfully demonstrated the above mentioned use cases.

As a stretch goal, performance optimisation for high throughput Exascale data management is of course very appreciated.

Evaluation Task

Interested students please contact Mario, Martin, and Lukas directly for the evaluation.

Requirements

Mentors

  1. Rucio GitHub
  2. Rucio Documentation
  3. Rucio system overview journal article (Springer)
  4. Rucio operational experience article (IEEE Computer Society)

Additional Information

Corresponding Project

Participating Organizations