AI in the browser: run your model locally with WebLLM and Angular

What if your AI model could run directly in the browser, with no server calls, no tokens, no network latency? WebLLM makes this possible: it runs quantized language models directly in the browser, leveraging WebGPU, without a single token leaving the user's machine.

In this talk I'll share how I integrated it in Angular, starting from an experiment and ending up with a working game with NPC powered by a local LLM.

We'll explore together how WebLLM works, how I integrated it in Angular, and the strategies I used to work around its limits and get the most out of it.

Davide Passafaro

Google Developer Expert in Angular ❮ ❯ | Senior Software Engineer 💻📱 | GDG Roma Città Organizer 📣

Rome, Italy

Actions

View Speaker Profile

Please note that Sessionize is not responsible for the accuracy or validity of the data provided by speakers. If you suspect this profile to be fake or spam, please let us know.

Session

AI in the browser: run your model locally with WebLLM and Angular

Davide Passafaro

Links

Actions