CERN Google Summer of Code 2023

Introduction

Google Summer of Code is a program that allows students to contribute to the development of open-source projects, mentored by participating organizations.

Particle physics is an exciting field where large collaborations of scientists collect and analyze petabytes of data from high-energy physics experiments, such as those at the Large Hadron Collider, hosted at the CERN laboratory in Geneva, Switzerland. Some of the questions that we collectively ask are:

To answer these questions, particle physicists build software to simulate and analyze what happens in particle physics detectors.

The CERN software for experiments (CERN EP-SFT) group has participated in the GSoC since 2011. Since 2017 the program has expanded to involve the high-energy physics community under the umbrella of the HEP Software Foundation.

Information from last year’s GSoC can be found here.

GSoC 2023

In 2023 HSF is participating in the program as a GSoC umbrella organization, under the name CERN-HSF. The HSF project ideas are listed below. Please have a look at the general program rules and the timeline provided by Google.

The selection process in our organization will be split into two phases. All candidates will have to pass pre-selection evaluation tests prepared by mentors, demonstrating the skills needed for the respective projects. The successful candidates will have a detailed exchange with mentors on a given project idea, and their proposals are then evaluated for the final student selection. The detailed timeline of these phases is shown below.

For Students

If you are interested in the HSF projects, please start by having a look at our guidelines for students.

New projects for HSF GSoC in 2023 will be published by February 7. Meanwhile you can take a look at last year’s HSF GSoC projects since many of those may also propose subjects this year. The student application period will begin on March 20, but discussions about student project ideas with mentors will take place starting on February 22, when the list of accepted Organizations is announced. You are encouraged to monitor this website in order to get acquainted with our projects and be prepared for contacting their mentors. Due to the high number of applicants for CERN-HSF projects, the application period procedure become more formal this year, so please try to follow our guidance.

Please avoid having the first contact with the mentors of HSF projects before February 22 or after March 20! You are encouraged to take the time to read our project proposals until February 22, focusing on only one or two projects that attract your interest. Once you have identified those, you should e-mail the respective mentors, attaching your CV and describing the motivation for your choice. The mentors will first propose an evaluation test, waiting for your solution. Note that the test you are given is private, your solution should be personal and the response time is part of the evaluation. Mentors will let you know the test results, and if you passed they will start discussing and helping you develop a project idea. The student application period is March 27 - April 4, when you will have to write and review your proposal with help from your mentors.

For further information, feel free to contact the HSF GSoC admins or join our open chat channel below. Note that the channel is intended for exchanging with the admins, the other candidates and some of the CERN-HSF students from previous year, and we do not maintain a mailing list for candidates. Please avoid posting extended information about yourself in the chat channel and reserve this for the direct exchange with the mentors.

hsf-gsoc-admin@googlegroups.com

Gitter

Student pages

The CERN-HSF 2022 student blogs are now available. Check them out!

Student blogs are mandatory for receiving a positive evaluation in CERN-HSF starting with 2022 GSoC edition! If you are a CERN-HSF student, please read this blog.

For Projects and Mentors

Instructions for participating projects and mentors can be found here. The main differences compared to last year are that proposals have to be tuned for either 175-hour or 350h project length, and the target objectives must be well-defined deliverables.

We have a mentor-only Google group where we handle all announcements related to GSoC for our HSF Umbrella Organization. Please write us a mail to hsf-gsoc-admin@googlegroups.com if you want to add your preferred email to this list. Note that all participating mentors in 2022 HSF GSoC will have to be added to this group. Please let us know if for some reason your want your mail removed from this list.

For new HEP-related groups wishing to join HSF GSoC umbrella rather than being independent organizations: there is no formal procedure, you will just need to add a description and logo for your organization. Note that your project needs to have some connection with high-energy physics to join us. We need one contact person for each new group, so please write us an e-mail.

Timeline

Jan 19 Announcement of GSoC 2023 program in HSF
Jan 19 Call for 2023 HSF GSoC projects and mentors
Jan 19 - Feb 7 Mentors add project proposals available on the HSF GSoC website
Feb 7 Deadline for CERN-HSF proposals
Feb 22 Accepted Organizations announced by Google
Feb 22 - March 14

Phase 1 selection

Candidates have a first contact with project mentors.
March 14 Mentors announce to students the result of phase 1 selection
Mar 14 - March 27 Candidates discuss with mentors their project idea, timeline and objectives.
March 27 - Apr 4 Mentors help candidates improving their proposals, proposals are submitted
Apr 4 Student application deadline
Apr 4 - Apr 22

Phase 2 selection

Mentors evaluate and rank student proposals.
April 22 Rankings due for mentors, sent to HSF Org Admins
April 22 - April 27 Admins propose a range of slots to be asked, amended by the HSF committee.
April 27 Student slot requests due for Org Admins
May 4 Accepted student projects announced

Projects in 2023

Belle2

Belle II is a particle physics experiment that studies electron positron collision events produced by the the SuperKEKB accelerator at the KEK laboratory in Tsukuba, Japan. It is designed to collect 50 times more data than its predecessor to address open questions in fundamental physics with high precision measurements.

List of proposals
CERNBox

CERNBox is a cloud synchronisation service for end-users: it allows syncing and sharing files on all major mobile and desktop platforms (Linux, Windows, MacOSX, Android, iOS) aiming to provide offline availability to any data stored in the CERN EOS infrastructure.

List of proposals
CernVM-FS

The CernVM-File System (CVMFS) is a global, read-only POSIX file system that provides the universal namespace /cvmfs. It is based on content-addressable storage, Merkle trees, and HTTP data transport. CernVM-FS provides a mission critical infrastructure to small and large HEP collaborations.

List of proposals
Clad

Clad is an automatic differentiation (AD) tool for C++

List of proposals
Cppyy

Cppyy is an automatic, run-time, Python-C++ bindings generator, for calling C++ from Python and Python from C++.

List of proposals
DD4hep

A generic detector description framework. When referring to the detector description this includes, in addition to the geometry and the materials used in the device, also parameters describing e.g. the detection techniques, constants required for alignment and calibration, description of the readout structures and conditions data.

List of proposals
FASER

The FASER experiment is an LHC experiment located in a tunnel parallel to the LHC ring and is considerably smaller and low-budget compared to the titan LHC experiments such as ATLAS and CMS. The experiment seeks to detect new long-lived particles that have travelled half a kilometer from a LHC proton-proton collision site, escaping the major ATLAS experiment undetected. The only other proton collision “background” particles reaching FASER’s faraway location are muons and neutrinos. Evidence of a new particle, such as the hypothetical dark photon, may lead to answering our most fundamental questions of the Universe such as the origin of dark matter.

List of proposals
Ganga

Ganga is a computational task-management tool, which allows for the specification, submission, bookkeeping and post-processing of computational tasks on a wide set of distributed resources. Ganga has been developed to solve a problem increasingly common in scientific projects, which is that researchers must regularly switch between different processing systems, each with its own command set, to complete their computational tasks. Ganga provides a homogeneous environment for processing data on heterogeneous resources.

List of proposals
Geant4

Geant4 is a toolkit for the simulation of the passage of particles through matter.

List of proposals
HSF

The HEP Software Foundation encourages cooperation and common development of software in High Energy Physics. The HSF hosts a number of software projects where developers are working to solve current problems in particle physics.

List of proposals
MCnet

MCnet is a community linking the developers of particle-theory simulations with experimentalists and others who use and make tools for analysing the simulated events. Projects may be available from both aspects of the community.

List of proposals
MadAnalysis5

MadAnalysis 5 is a framework for phenomenological investigations at particle colliders. Based on a C++ kernel, this program allows to efficiently perform sophisticated physics analyses of event files such as those generated a large class of Monte Carlo event generators in a straightforward and user-friendly fashion.

List of proposals
Patatrack

The Patatrack project started in 2016 by a group of people with various area of expertise, such as software optimization, heterogeneous computing, track reconstruction and High Level Trigger (HLT) at the CMS experiment at CERN. The goal was to demonstrate that part of the HLT reconstruction could be efficiently offloaded on machines equipped with GPUs for parallel execution. Nowadays, Patatrack developments have been integrated into the CMS software for event reconstruction and the project focuses on the exploration of innovative software and hardware technologies to bring smart software closer to the detector read-out at CERN experiments.

List of proposals
Phoenix

Phoenix is an experiment-agnostic library for the visualisation of detector geometry and event data in a browser.

List of proposals
ROOT

A modular scientific software framework. It provides all the functionalities needed to deal with big data processing, statistical analysis, visualisation and storage. It is mainly written in C++ but integrated with other languages such as Python and R.

List of proposals
Rucio

Rucio is an open-source software framework that provides functionality to scientific collaborations to organize, manage, monitor, and access their distributed data and dataflows across heterogeneous infrastructures. Rucio was originally developed to meet the requirements of the high-energy physics experiment ATLAS, and is continuously enhanced to support diverse scientific communities.

List of proposals
SMARTHEP

SMARTHEP is an European Training Network involving LHC experiments and industry. We develop efficient software and machine learning algorithms on heterogeneous computing architectures to advance real-time analysis.

List of proposals
SWAN

SWAN is a CERN service that allows users to perform interactive data analysis in the cloud, built upon the widely-used Jupyter notebooks and CERN technologies for storage and software access.

List of proposals
iris_hep

IRIS-HEP is a software institute funded by the National Science Foundation. It aims to develop the state-of-the-art software cyberinfrastructure required for the challenges of data intensive scientific research at the High Luminosity Large Hadron Collider (HL-LHC) at CERN, and other planned HEP experiments of the 2020’s. These facilities are discovery machines which aim to understand the fundamental building blocks of nature and their interactions.

List of proposals

Participating Organizations in 2023

CEA-IRFU

The IRFU, Institute for Research on the fundamental laws of the universe, of the Fundamental Research Division of the CEA, brings together three scientific disciplines, astrophysics, nuclear physics and particle physics, as well as all the associated technological expertise.

List of proposals
CERN

At CERN, the European Organization for Nuclear Research, physicists and engineers are probing the fundamental structure of the universe. They use the world’s largest and most complex scientific instruments to study the basic constituents of matter – the fundamental particles.

List of proposals
CompRes

The Compiler Research Group is a group of programming language enthusiasts at Princeton University and CERN. It’s primary goal is to research foundational software tools that help scientists program for speed, interoperability, interactivity, flexibility, and reproducibility.

List of proposals
ImperialCollege

Imperial College London is a world top ten university with an international reputation for excellence in teaching and research. Consistently rated amongst the world’s best universities, Imperial is committed to developing the next generation of researchers, scientists and academics through collaboration across disciplines. Located in the heart of London, Imperial is a multidisciplinary space for education, research, translation and commercialisation, harnessing science and innovation to tackle global challenges.

List of proposals
IPPP

The Institute for Particle Physics Phenomenology at Durham University is UK’s national centre for particle phenomenology, researching the properties and behaviour of the most fundamental building blocks of nature. Since it has founded, IPPP has grown to become one of the largest particle phenomenology groups in the world. Our research sits at the interface between theoretical particle physics and experiments ranging from particle colliders to gravitational wave detectors.

List of proposals
iris-hep

IRIS-HEP is a software institute funded by the National Science Foundation. It aims to develop the state-of-the-art software cyberinfrastructure required for the challenges of data intensive scientific research at the High Luminosity Large Hadron Collider (HL-LHC) at CERN, and other planned HEP experiments of the 2020’s. These facilities are discovery machines which aim to understand the fundamental building blocks of nature and their interactions.

List of proposals
KIT

KIT KIT is “The Research University in the Helmholtz Association.” As one of the biggest science institutions in Europe, the only German University of Excellence with national large-scale research facilities combines a long university tradition with program-oriented top-level research.

List of proposals
LMU

LMU The Ludwig Maximilians University in Munich is one of the leading universities in Europe with a history going back over 500 years. It stands for a demanding academic education and outstanding research.

List of proposals
LPTHE

The Laboratory of Theoretical and High Energy Physics (LPTHE) is a Joint Research Unit (UMR 7589) of Sorbonne University and of the CNRS. It is affiliated with the CNRS Institute of Physics (INP). It comprises 19 CNRS researchers and 10 university professors.

List of proposals
MonashUniversity

Monash University Monash University is one of Australia’s leading universities and ranks among the world’s top 100. We help change lives through research and education.

List of proposals
princeton

Princeton University is a private Ivy League research university in Princeton, New Jersey.

List of proposals
UofGlasgow

The University of Glasgow is a leading UK research university based in Scotland’s largest city. We work on particle physics experiments from the LHC to neutrinos, particle theory from interpretations of Higgs and top-quark measurements to the strong force, and high-performance distributed computing.

List of proposals
LundU

Lund University is one of the largest universities in Sweden. In the Division of Particle Physics, we work on the construction, operations, maintenance and upgrades of our experimental installations. Our researchers are members of the ATLAS and ALICE Collaborations at CERN, but we also have a leading role in the LDMX experiment at SLAC. The necessity to process very large amounts of research data using modern computational methods lead to the emergence of e-Science, which refers to research and creation of dedicated scientific computer and software solutions, and we are among the leading contributors to e-Science solutions for Particle Physics..

List of proposals
UofManchester

The University of Manchester is a leading UK research university. We have a large particle physics group with contributions to LHC experiments, dark matter, flavour, neutrino and muon experiments. We also carry out research into new detector technologies and new data acquisition strategies for future experiments. We are also involved in distributed computing for LHC experiments, hosting one of the largest and most successful Tier-2 distributed computing centres in the UK.

List of proposals
umanitoba

The University of Manitoba is a Canadian public research university in the province of Manitoba, located on original lands of Anishinaabeg, Cree, Oji-Cree, Dakota, and Dene peoples, and on the homeland of the Métis Nation.

List of proposals
UMass

UMass Amherst is one of the major public research universities in America.

List of proposals
UNIGE

The Université de Genève (Switzerland) and its faculty of science is heavily invested in fundamental research and technology. Its department of particle physics (DPNC) studies the fundamental structure and laws of nature from the largest dimensions in the Universe to the smallest building blocks of matter. It has a strong and long-standing commitment to the CERN LHC experimental program.

List of proposals

Summary

Full list of Proposal Ideas

Full list of Mentors


Contacts

HSF GSoC mentor discussion e-group

HSF GSoC administrators: Benedikt Hegner