Session
Production-ready AI Platform on Kubernetes
In recent years, advances in ML/AI have made tremendous progress yet designing large-scale data science and machine learning applications still remain challenging. The variety of machine learning frameworks, hardware accelerators, cloud vendors as well as the complexity of data science workflows brings new challenges to MLOps. For example, it’s non-trivial to build an inference system that’s suitable for models of different sizes, especially for LLMs or large models in general.
This talk presents various best practices and challenges on building large, efficient, scalable, and reliable AI/ML platforms using cloud-native technologies such as Kubernetes, Kubeflow, and KServe. We will deep dive into a reference platform dedicated for modern cloud-native AI infrastructure.

Yuan Tang
Senior Principal Software Engineer at Red Hat; Project Lead at Argo, Kubeflow, and KServe
West Lafayette, Indiana, United States
Links
Please note that Sessionize is not responsible for the accuracy or validity of the data provided by speakers. If you suspect this profile to be fake or spam, please let us know.
Jump to top