How to deploy Whisper Web on Minikube?

This session dives into deploying the open-source Whisper Web—a browser-based ML speech recognition tool—on Minikube. Attendees will learn to containerize the React/Node.js frontend and PyTorch-backed Whisper model using Docker, optimize images via multi-stage builds (reducing size by 97%), and configure Kubernetes deployments/services for scalability. The demo showcases Minikube cluster setup, GPU-accelerated inference, and handling challenges like proxy configurations and offline image mirroring. Practical takeaways include YAML best practices, horizontal scaling, and leveraging Kubernetes for local development. Ideal for developers exploring GenAI deployment, the session bridges cloud-native principles with real-world AI application workflows, empowering teams to adopt portable, cost-efficient solutions without compromising performance.

Wentao Liu

Manager of omfoss.com

Actions

View Speaker Profile

Please note that Sessionize is not responsible for the accuracy or validity of the data provided by speakers. If you suspect this profile to be fake or spam, please let us know.

Session

How to deploy Whisper Web on Minikube?

Wentao Liu

Links

Actions