Production GPU Cluster with K8s for AI and DL Workloads - Madhukar Korupolu, NVIDIA
Production Multi-node Jobs with Gang Scheduling, K8s, GPUs... Madhukar Korupolu & Sanjay Chatterjee
Building Distributed TensorFlow Using Both GPU and CPU on Kubernetes [I] - Zeyu Zheng
Building a GPU cluster for AI
GPU Sharing for Machine Learning Workload on Kubernetes - Henry Zhang & Yang Yu, VMware
Scaling AI Inference Workloads with GPUs and Kubernetes - Renaud Gaubert & Ryan Olson, NVIDIA
Kubernetes For GPU Powered Machine Learning Workloads In... - Camille Rodriguez & John-Paul Robinson
Fair Scheduling for Deep Learning Workloads in Kubernetes - Yodar Shafrir, Run:AI
Minimizing GPU Cost for Your Deep Learning on Kubernetes - Kai Zhang & Yang Che, Alibaba
AI workloads on Kubernetes - How to maximize GPU utilization and cut costs
"AI Cluster Trends" - Robert Ober
Monitoring GPUs at Scale for AI/ML and HPC Clusters - Bharti L Agrawal, NVIDIA
GPUs: Explained
Kubernetes VMware User Group: Using GPUs with K8s on vSphere - Steven Wong & Myles Gray, VMware
Building GPU-Accelerated Workflows with TensorFlow and Kubernetes [I] - Daniel Whitenack
Co-Location of CPU and GPU Workloads with High Resource Efficiency - Penghao Cen & Jian He
Lightning Talk: Managing Drivers in a Kubernetes Cluster - Renaud Gaubert, NVIDIA
Networking Optimizations for Multi-Node Deep Learning on Kubernetes - Rajat Chopra & Erez Cohen
Scale and Accelerate the Distributed Model Training in Kubernetes Cluster
DEMO: How to auto scale GPU nodes in Kubernetes cluster based on usage