ML Project
The project aims at offering a centralized service to manage the full machine learning lifecycle:
- Data Extraction and Preparation
- Interactive analysis and iteration
- Distributed Training, Hyper Parameter Optimization
- Model Storage, Versioning and Serving
It also offers access to a large amount of accelerator resources like GPUs, TPUs, IPUs and FPGAs - both on premises and external.
Meetings
We run bi-weekly meetings for sprints and discussion.
- Every Second Monday 11am - events here
- Kanban Board
Communication
Milestones
Major
Ongoing Work
Presentations
Internal
- CMS Group Meeting, Jun 24 2022: gnn-jec-kubeflow.pdf
- CMS Knowledge Group Meeting, Feb 28 2022: Kubeflow at CERN
- ATLAS CAT Physics Weekly, Feb 09 2022: ML Lifecycle in CERN IT
- IT Technical Forum, Nov 19 2021: Centralized Management of Your Machine Learning Lifecycle: Preparation, Training and Model Serving
- Machine Learning Coffees, Oct 16 2020: Making ML easier with Kubeflow
- Joint AMG and WFMS Meeting on Analysis Facilities, Sep 24 2021: Kubeflow Overview
- DUNE, Oct 02 2020: Kubeflow Overview and Demo
External
- Google Summer of Code 2022, July 29 2022: Geant4-FastSim - Building an ML pipeline for Fast Shower Simulation
- Google Summer of Code 2022, July 27 2022: Geant4-FastSim - Memory Footprint Optimization for ML Fast Shower Simulation
- Kubecon Europe 2022, May 19 2022: Jet Energy Corrections with GNN Regression using Kubeflow @ CERN
- Kubernetes AI Day North America, Oct 12 2021: A Better and More Efficient ML Experience for CERN Users
- Kubecon Europe 2021, May 4-7 2021: Building and Managing a Centralized ML Platform with Kubeflow at CERN
- Fast Machine Learning for Science Workshop, Dec 01 2020: Making ML Easier with Kubeflow
- 25th International Conference on Computing in High-Energy and Nuclear Physics, May 17-21 2020: Training and Serving ML workloads with Kubeflow at CERN