CERN Google Summer of Code 2020

Introduction

Google Summer of Code is a program that allows students to contribute to development of open-source projects, mentored by participating organizations.

Particle physics is an exciting field where large collaborations of scientists collect and analyze petabytes of data from high-energy physics experiments, such as the Large Hadron Collider, hosted at the CERN laboratory in Geneva, Switzerland. Some of the questions that we collectively ask are:

To answer these questions, particle physicists build software to simulate and analyze what happens in particle physics detectors.

The CERN software for experiments (CERN EP-SFT) group has participated in the GSoC since 2011. Since 2017 the program has expanded to involve the high-energy physics community under the umbrella of the HEP Software Foundation.

Information from last year’s GSoC can be found here.

GSoC 2020

In 2020 HSF is participating in the program as a GSoC umbrella organization. Project proposals are listed below.

N.B. This page archives the proposals from the 2020 Google Summer of Code. For information on the latest GSoC please see here.

For Students

If you are interested in the GSoC program, please start by having a look at our guidelines for students.

New projects for HSF GSoC in 2020 were published in February. The application period opened in mid-March, but discussions about application ideas with mentors took place the month before. You can monitor our website in order to get acquainted with the projects you are interested in and to be prepared for submitting an application. Meanwhile you may want to take a look at last year’s HSF GSoC projects since many of those may propose subjects also this year.

For further information, feel free to contact the HSF GSoC admins or join our new chat room

hsf-gsoc-admin@googlegroups.com

Gitter

For Projects and Mentors

Instructions for participating projects and mentors can be found here.

We have a Google group where we handle all future announcements related to GSoC for our HSF Umbrella Organization. Please write us a mail to hsf-gsoc-admin@googlegroups.com in case you are interested to add your preferred mail to this list. Note that all 2020 HSF GSoC mentors will have to join this group.

For new HEP-related groups wishing to join HSF GSoC umbrella rather than being independent organizations: there is no formal procedure, you will just need to add a description and logo for your organization. We need however one contact person for each new group, so please write us an e-mail.

Projects in 2020

ALICE

ALICE is one of the four major experiments at the Large Hadron Collider (LHC) at CERN. ALICE is optimized to study collisions of nuclei at the ultra-relativistic energies provided by the LHC.

List of proposals
ATLAS

ATLAS is one of the four major experiments at the Large Hadron Collider (LHC) at CERN. It is a general-purpose particle physics experiment run by an international collaboration and is designed to exploit the full discovery potential and the huge range of physics opportunities that the LHC provides.

List of proposals
AstroLab

AstroLab Software is a project from IJCLab aiming at providing advanced software tools to overcome modern science challenges faced by research groups, and allow research communities to more fully exploit the big data ecosystem tools.

List of proposals
CMS

CMS is a high-energy physics experiment at the Large Hadron Collider (LHC) at CERN. It is a general-purpose detector that is designed to observe any new physics phenomena that the LHC might reveal. CMS acts as a giant, high-speed camera, taking 3D “photographs” of particle collisions from all directions up to 40 million times each second. The CMS collects few tens of Peta-Bytes of data each year and processes them through Worldwide LHC Computing Grid infrastructure around the globe.

List of proposals
CMSSW

CMS is a particle detector that is designed to see a wide range of particles and phenomena produced in high-energy collisions in the LHC. Like a cylindrical onion, different layers of detectors measure the different particles, and use this key data to build up a picture of events at the heart of the collision. Many of CMS Software components (CMSSW) are hosted on Github. You can find here a few FAQ and tutorials on how develop in CMSSW.

List of proposals
Ceph

Ceph is a storage platform. It implements object storage on a single distributed computer cluster, and provides interfaces for object-, block- and file-level storage. It can be used to build cloud infrastructure and web-scale object storage. Ceph aims primarily for completely distributed operation without a single point of failure, that can be run on commodity hardware and is scalable to the exabyte level.

List of proposals
CernVM-FS

The CernVM-File System (CernVM-FS) is a global, read-only POSIX file system that provides the universal namespace /cvmfs. It is based on content-addressable storage, Merkle trees, and HTTP data transport. CernVM-FS provides a mission critical infrastructure to small and large HEP collaborations.

List of proposals
DEEPLENSE

DeepLense is a deep learning pipeline for particle dark matter searches with strong gravitational lensing.

List of proposals
E2E

The End-to-End Deep Learning (E2E) project focuses on the development of particle and event reconstruction and identification tasks with end-to-end deep learning approaches.

List of proposals
FALCON

Falcon is an ultra-fast non-parametric detector simulation package that automatically abstracts detector response, usually done by hand in fast-simulators used by particle physics experiments.

List of proposals
Ganga

Ganga is a computational task-management tool, which allows for the specification, submission, bookkeeping and post-processing of computational tasks on a wide set of distributed resources. Ganga has been developed to solve a problem increasingly common in scientific projects, which is that researchers must regularly switch between different processing systems, each with its own command set, to complete their computational tasks. Ganga provides a homogeneous environment for processing data on heterogeneous resources.

List of proposals
IRIS-HEP

IRIS-HEP aims to develop the state-of-the-art software cyberinfrastructure required for the challenges of data intensive scientific research at the High Luminosity Large Hadron Collider (HL-LHC) at CERN, and other planned HEP experiments of the 2020’s.

List of proposals
MCnet

MCnet is a collaboration between the developers of the major collider event modelling tools used at the Large Hadron Collider (LHC) and beyond. These tools are so-called “MC event generators”, theoretical physics packages that evaluate and sample from quantum mechanical distributions to create simulated particle collisions. MC generators are an absolutely essential component in both modern particle physics experiments and theory, and provide a crucial bridge between those two communities. MCnet projects include both event generators themselves, and tools that connect them to both experiment and theory.

List of proposals
MadGraph5

MadGraph5 is a framework that aims to simulate high Energy collisions for various accelerators and in particular for LHC physics. It has a huge focus on physics beyond the “Standard Model” allowing experiments to search for various scenarios in their experiments. MadGraph5 is highly used by both theorists and experimentalists making the code one of the most cited papers every year. MadGraph5 is a meta-code written in python that generates specialised code in low-level language (Fortran and C) to produce very fast code.

List of proposals
Phoenix

Phoenix is an experiment-agnostic library for the visualisation of detector geometry and event data in a browser.

List of proposals
QMLHEP

With this project we seek to implement Quantum Machine Learning methods for LHC High Energy Physics analysis based on Google OpenFermion framework.

List of proposals
ROOT

A modular scientific software framework. It provides all the functionalities needed to deal with big data processing, statistical analysis, visualisation and storage. It is mainly written in C++ but integrated with other languages such as Python and R.

List of proposals
Rucio

Rucio is an open-source software framework that provides functionality to scientific collaborations to organize, manage, monitor, and access their distributed data and dataflows across heterogeneous infrastructures. Rucio was originally developed to meet the requirements of the high-energy physics experiment ATLAS, and is continuously enhanced to support diverse scientific communities.

List of proposals
SWAN

SWAN is a CERN service that allows users to perform interactive data analysis in the cloud, built upon the widely-used Jupyter notebooks and CERN technologies for storage and software access.

List of proposals
TMVA

Toolkit for Multivariate Analysis (TMVA) is a multi-purpose machine learning toolkit integrated into the ROOT scientific software framework, used in many particle physics data analyses and applications.

List of proposals
VecGeom

A vectorized high-performance library usable for fast and accurate geometry computations in very complex setups.

List of proposals
XROOTD

XRootD software framework is a fully generic suite for fast, low latency and scalable data access, which can serve natively any kind of data, organized as a hierarchical filesystem-like namespace, based on the concept of directory. As a general rule, particular emphasis has been put in the quality of the core software parts.

List of proposals

Participating Organizations in 2020

Alabama

The University of Alabama is a public research university in Tuscaloosa, Alabama. Established in 1820, the University of Alabama is the oldest and largest of the public universities in Alabama as well as the flagship of the University of Alabama System

List of proposals
Brown

Brown University is a private Ivy League research university in Providence, Rhode Island, founded in 1764.

List of proposals
CC-IN2P3

IN2P3 Computing Centre (CC-IN2P3) is a CNRS support and research unit associated to IN2P3, the coordinating body for nuclear, particle, and astroparticle physics research. As a national research infrastructure, the CC-IN2P3 designs and manages a large range of computer-related services, especially a mass storage system and mass data resources.

List of proposals
CERN

At CERN, the European Organization for Nuclear Research, physicists and engineers are probing the fundamental structure of the universe. They use the world’s largest and most complex scientific instruments to study the basic constituents of matter – the fundamental particles.

List of proposals
Cornell

Cornell University is a private and statutory Ivy League research university in Ithaca, New York. Founded in 1865, the university was intended to teach and make contributions in all fields of knowledge—from the classics to the sciences, and from the theoretical to the applied. These ideals, unconventional for the time, are captured in Cornell’s founding principle, a popular 1868 Ezra Cornell quotation: “I would found an institution where any person can find instruction in any study.”

List of proposals
CROSS

The Center for Research in Open Source Software (www.cross.ucsc.edu), part of the Baskin School of Engineering at UC Santa Cruz, was founded based on the success of former UCSC student Sage Weil and the Ceph storage system. At CROSS, we create open source leaders by blending open source software strategies with graduate-level research. CROSS turns OSS prototypes into successful open source software projects.

List of proposals
Florida

University of Florida is a public institution that was founded in 1853.

List of proposals
FNAL

Fermi National Accelerator Laboratory (FNAL) is a Department of Energy national laboratory dedicated to particle physics research. Fermilab supports work by scientists, from across the country and the globe, who seek to further our understanding of matter, energy, space and time.

List of proposals
FSU

Florida State University is a public institution that was founded in 1851.

List of proposals
IJCLab

IJCLab is a French research laboratory belonging to CNRS/IN2P3 and located at Université Paris Saclay. The main topics of the research done at IJCLab are high energy physics, cosmology and accelerators.

List of proposals
IllinoisUC

University of Illinois at Urbana-Champaign is a public institution that was founded in 1867.

List of proposals
ImperialCollege

Imperial College London is a world top ten university with an international reputation for excellence in teaching and research. Consistently rated amongst the world’s best universities, Imperial is committed to developing the next generation of researchers, scientists and academics through collaboration across disciplines. Located in the heart of London, Imperial is a multidisciplinary space for education, research, translation and commercialisation, harnessing science and innovation to tackle global challenges.

List of proposals
INFN

The Istituto Nazionale di Fisica Nucleare (INFN; “National Institute for Nuclear Physics”) is the coordinating institution for nuclear, particle and astroparticle physics in Italy.

List of proposals
JHUAPL

The Johns Hopkins University Applied Physics Laboratory (APL) has provided critical contributions to critical challenges with systems engineering and integration, technology research and development, and analysis.

List of proposals
KIT

The Karlsruhe Institute of Technology is a public research university and one of the largest research and education institutions in Germany.

List of proposals
LPC-Clermont

Founded in 1958, Laboratoire de Physique de Clermont is a government-funded mixed research unit (CNRS/IN2P3 and University Clermont Auvergne), in Auvergne, France.

List of proposals
MIR

The Morgridge Institute for Research is a private, nonprofit biomedical research institute in Madison, Wis., affiliated with the University of Wisconsin–Madison. The institute works to improve human health by conducting, enabling and translating interdisciplinary biomedical research. Research in regenerative biology, virology, medical devices and core computational technology is currently underway.

List of proposals
MonashUniversity

Monash University Monash University is one of Australia’s leading universities and ranks among the world’s top 100. We help change lives through research and education.

List of proposals
OProject

OProject Open source organization, specialized in development of advanced scientific software with ROOT, focused mathematical/statistical tools, machine learning and high performance computing.

List of proposals
Pittsburgh

The University of Pittsburgh is one of the oldest institutions of higher education in the United States.

List of proposals
princeton

Princeton University is a private Ivy League research university in Princeton, New Jersey.

List of proposals
RAL

STFC Rutherford Appleton Laboratory in Oxfordshire, United Kingdom is home to national facilities including the Diamond synchrotron and ISIS neutron spallation source. The Particle Physics Department contributes to experiments including ATLAS, CMS, LHCb and T2K.

List of proposals
SLAC

SLAC National Accelerator Laboratory, originally named Stanford Linear Accelerator Center is a United States Department of Energy National Laboratory operated by Stanford University under the programmatic direction of the U.S. Department of Energy Office of Science and located in Menlo Park, California. It is the site of the Stanford Linear Accelerator, a 3.2 kilometer (2 mile) linear accelerator constructed in 1966 and shut down in the 2000s, which could accelerate electrons to energies of 50 GeV. Today SLAC research centers on a broad program in atomic and solid-state physics, chemistry, biology, and medicine using X-rays from synchrotron radiation and a free-electron laser as well as experimental and theoretical research in elementary particle physics, astroparticle physics, and cosmology.

List of proposals
UCL

Founded in 1826 in the heart of London, University College London is London’s leading multidisciplinary university, with more than 13,000 staff and 42,000 students from 150 different countries. It is one of the UK’s leading groups in the field of particle physics.

List of proposals
UDur

Durham University, founded in 1832, enjoys a global reputation for its strengths in both teaching and research as evidenced by its World Top 100 position in the QS World University Rankings 2020 (78th). The University is home to over 18,000 students from over 120 countries and employs over 4000 staff. It hosts some of the UK’s leading groups in the fields of Particle Physics Phenomenology IPPP and Computational Cosmology ICC.

List of proposals
UGlasgow

The University of Glasgow is a leading UK research university based in Scotland’s largest city. We work on particle physics experiments from the LHC to neutrinos, particle theory from interpretations of Higgs and top-quark measurements to the strong force, and high-performance distributed computing.

List of proposals
UMass

UMass Amherst is one of the major public research universities in America.

List of proposals
UNL

The University of Nebraska–Lincoln, often referred to as Nebraska, UNL or NU, is a public research university in the city of Lincoln, in the state of Nebraska in the Midwestern United States. It is the state’s oldest university, and the largest in the University of Nebraska system.

List of proposals
uwuppertal

The University of Wuppertal is a German scientific institution, located in Wuppertal, in the state of North Rhine-Westphalia. The university’s official name in German is Bergische Universität Wuppertal, or BUW. It was founded in 1972. In 2014/15, approx. 20,000 students were enrolled in a wide range of subjects with many interdisciplinary linkages between a total of 7 faculties.

List of proposals
Wisconsin

The University of Wisconsin-Madison is a public research university in Madison, Wisconsin. Since its founding in 1848, this campus has been a catalyst for the extraordinary.

List of proposals

Summary

Full list of Proposal Ideas

Full list of Mentors

Although GSoC 2020 has finished, please do not hesitate to contact the current admins if you are interested in the latest GSoC program using

hsf-gsoc-admin@googlegroups.com


Contacts

HSF GSoC mentor discussion e-group

HSF GSoC administrators: Andrei Gheata, Antoine Pérus and Xavier Valls