Session
Developing with Foundry Local - on Device AI solution
Looking to save cost on AI usage for your application? Look no further as we shift the AI usage cost onto the end-user's device using the Foundry Local SDK.
It provides an easy-to-use SDK (C#, JavaScript, Rust, and Python), a curated catalog of optimized models, and automatic hardware acceleration — all in a lightweight package.
User data never leaves the device, responses start immediately with zero network latency, and your app works offline. There are no per-token costs and no backend infrastructure to maintain.
The catalog covers chat completions (for example, GPT OSS, Qwen, DeepSeek, Mistral and Phi ) and audio transcription (for example, Whisper). Every model goes through extensive quantization and compression to deliver the best balance of quality and performance
Muhammad Suzaril Shah bin Zakaria
Senior IT Systems and Customer Engineer at Swift
Kuala Lumpur, Malaysia
Links
Please note that Sessionize is not responsible for the accuracy or validity of the data provided by speakers. If you suspect this profile to be fake or spam, please let us know.
Jump to top