Session

Exploit generative AI as a Service with Cloud Run and GPUs!

Generative AI models often require significant computational power, even during inference.
Let’s explore how to use the power of GPUs in a fully serverless environment using Google Cloud Run.
We’ll look at how to deploy and scale LLM efficiently and cost-effectively using a combination of serverless and GPU technologies to accelerate both the development and deployment phase of models at scale.

Nicola Guglielmi

GDE Cloud • Google Cloud Architect • Google Cloud Authorized Trainer • Team Manager • GDG Community Lead 🚀

Campobasso, Italy

Actions

Please note that Sessionize is not responsible for the accuracy or validity of the data provided by speakers. If you suspect this profile to be fake or spam, please let us know.

Jump to top