Session

Developing with Foundry Local - on Device AI solution

Looking to save cost on AI usage for your application? Look no further as we shift the AI usage cost onto the end-user's device using the Foundry Local SDK.

It provides an easy-to-use SDK (C#, JavaScript, Rust, and Python), a curated catalog of optimized models, and automatic hardware acceleration — all in a lightweight package.

User data never leaves the device, responses start immediately with zero network latency, and your app works offline. There are no per-token costs and no backend infrastructure to maintain.

The catalog covers chat completions (for example, GPT OSS, Qwen, DeepSeek, Mistral and Phi ) and audio transcription (for example, Whisper). Every model goes through extensive quantization and compression to deliver the best balance of quality and performance

Muhammad Suzaril Shah bin Zakaria

Senior IT Systems and Customer Engineer at Swift

Kuala Lumpur, Malaysia

Actions

Please note that Sessionize is not responsible for the accuracy or validity of the data provided by speakers. If you suspect this profile to be fake or spam, please let us know.

Jump to top