Session
Best practices in building and scaling ML APIs
This talk will provide an overview of best practices in designing and scaling both synchronous and asynchronous machine learning (ML) APIs. This includes optimizing the APIs to deliver the best performance from a latency, accuracy, and cost perspective. The talk will focus on key checklist items before the handoff between from ML teams to engineering, scaling in production including infrastructure management, monitoring/troubleshooting of issues, and ML OPS. Lastly, key learnings from updating ML APIs in production while minimizing user impact will be shared
Shankar Krishnan
AWS, Product Manager - AI/ML
Boston, Massachusetts, United States
Links
Please note that Sessionize is not responsible for the accuracy or validity of the data provided by speakers. If you suspect this profile to be fake or spam, please let us know.
Jump to top