Session
AI in the browser: run your model locally with WebLLM and Angular
What if your AI model could run directly in the browser, with no server calls, no tokens, no network latency? WebLLM makes this possible: it runs quantized language models directly in the browser, leveraging WebGPU, without a single token leaving the user's machine.
In this talk I'll share how I integrated it in Angular, starting from an experiment and ending up with a working game with NPC powered by a local LLM.
We'll explore together how WebLLM works, how I integrated it in Angular, and the strategies I used to work around its limits and get the most out of it.
Davide Passafaro
Google Developer Expert in Angular ❮ ❯ | Senior Software Engineer 💻📱 | GDG Roma Città Organizer 📣
Rome, Italy
Links
Please note that Sessionize is not responsible for the accuracy or validity of the data provided by speakers. If you suspect this profile to be fake or spam, please let us know.
Jump to top