Running Sovereign AI on the Edge: From Fine-Tuning to Device Execution with Open Source LLMs

As AI becomes increasingly embedded in our daily lives and critical systems, the need for data sovereignty and compute efficiency is more urgent than ever. Cloud-based AI models, while powerful, often raise concerns about privacy, control, and long-term sustainability — especially in regions with strict data regulations or limited internet access.

In this talk, I’ll demonstrate how developers can take back control by fine-tuning and deploying small yet capable language models (like Qwen3) completely offline, using fully open-source tools. We’ll cover the full pipeline — from fine-tuning on CPU-based hardware to exporting optimized GGUF models and running them on edge devices using llama.cpp. I'll also showcase a real-world use case where these models are integrated into an Android app to perform tool-calling tasks (e.g., setting alarms, sending WhatsApp messages) without ever needing to touch the cloud.

C Sarath Babu

Security Researcher | Quantum Mechanics & Philosophy Aficionado

Bengaluru, India

Actions

View Speaker Profile

Please note that Sessionize is not responsible for the accuracy or validity of the data provided by speakers. If you suspect this profile to be fake or spam, please let us know.

Session

Running Sovereign AI on the Edge: From Fine-Tuning to Device Execution with Open Source LLMs

C Sarath Babu

Links

Actions