Scaling Monolingual NLP Models on Kubernetes: Leveraging Trans-tokenization

Building and deploying monolingual NLP systems for low-resource languages presents challenges, especially in handling diverse scripts and optimizing for production-scale environments. This session explores trans-tokenization, a novel method for transforming tokens across languages, enhancing large language models for monolingual capabilities. Using parallel corpora like English-Hindi, we’ll demonstrate how tools such as Unsloth and Mistral enable fine-tuning to handle non-Latin scripts effectively. A major focus will be on leveraging Kubernetes to scale monolingual NLP systems. Attendees will learn how Kubernetes facilitates resource allocation, supports distributed training, and simplifies model deployment at scale. Topics include managing workloads for parallel corpora, optimizing GPU utilization, and ensuring high availability of NLP services in production environments.

Suvrakamal Das

Software Engineer @Mattoboard

San Francisco, California, United States

Actions

View Speaker Profile

Please note that Sessionize is not responsible for the accuracy or validity of the data provided by speakers. If you suspect this profile to be fake or spam, please let us know.

Session

Scaling Monolingual NLP Models on Kubernetes: Leveraging Trans-tokenization

Suvrakamal Das

Links

Actions