Best practices in building and scaling ML APIs

This talk will provide an overview of best practices in designing and scaling both synchronous and asynchronous machine learning (ML) APIs. This includes optimizing the APIs to deliver the best performance from a latency, accuracy, and cost perspective. The talk will focus on key checklist items before the handoff between from ML teams to engineering, scaling in production including infrastructure management, monitoring/troubleshooting of issues, and ML OPS. Lastly, key learnings from updating ML APIs in production while minimizing user impact will be shared

Shankar Krishnan

AWS, Product Manager - AI/ML

Boston, Massachusetts, United States

Actions

View Speaker Profile

Please note that Sessionize is not responsible for the accuracy or validity of the data provided by speakers. If you suspect this profile to be fake or spam, please let us know.

Session

Best practices in building and scaling ML APIs

Shankar Krishnan

Links

Actions