Minimizing GPU Cost for Your Deep Learning on Kubernetes - Kai Zhang & Yang Che, Alibaba
Stop Wasting GPU Flops on Cold Starts: High Performance Inference with Model Streamer - AI Eng Paris
DEMO: How to auto scale GPU nodes in Kubernetes cluster based on usage
GPU Sharing for Machine Learning Workload on Kubernetes - Henry Zhang & Yang Yu, VMware
USENIX ATC '22 - Serving Heterogeneous Machine Learning Models on Multi-GPU Servers...
Using Kubernetes to Offer Scalable Deep Learning on Alibaba Cloud - Kai Zhang & Yang Che, Alibaba
Scaling AI Inference Workloads with GPUs and Kubernetes - Renaud Gaubert & Ryan Olson, NVIDIA
Reduce GPU Costs for AI
Kubernetes Cost Optimization That You Do NOT Know
GKE Cost Optimization Golden Signals: Workload Rightsizing
Kubernetes For GPU Powered Machine Learning Workloads In... - Camille Rodriguez & John-Paul Robinson
AI workloads on Kubernetes - How to maximize GPU utilization and cut costs
Do All Your AI Workloads Actually Require Expensive GPUs?
Production GPU Cluster with K8s for AI and DL Workloads - Madhukar Korupolu, NVIDIA
DISTRBUTED DEEP LEARNING part1: Detailed Tutorial to Setup GPU Enabled Kuberntes Cluster on Ubuntu
Lightning Talk: Managing Drivers in a Kubernetes Cluster - Renaud Gaubert, NVIDIA
The Path to GPU as a Service in Kubernetes - Renaud Gaubert, NVIDIA (Intermediate Skill Level)
The Path to GPU as a Service in Kubernetes
A Deep Dive on Supporting Multi-Instance GPUs in Containers and Kubernetes - Kevin Klues, NVIDIA
Deploy and Scale AI Workloads with NVIDIA Run:ai on Azure Kubernetes Service (AKS)