Session

Elevating Generative AI Projects to Production: Tools and Best Practices to deliver customer value

Transitioning a generative AI (genAI) project from a simple Proof of Concept (PoC) or service demo to a robust, production-ready solution is a complex process fraught with numerous challenges. This talk aims to guide attendees through the intricacies of scaling genAI projects, highlighting common pitfalls and strategic solutions.

In this talk, we introduce a comprehensive architectural blueprint for genAI projects, covering essential aspects such as prompt and guardrail versioning, conversation management, vector database selection, and model evaluation, utilizing Amazon Bedrock’s Evaluation and Guardrails.

A highlight of this talk is a case study demonstrating the implementation of an End-to-End Retrieval-Augmented Generation (RAG) system integrated with Bedrock Knowledge with prompt versioning, knowledge-optimized chunking, conversation history, and knowledge base augmentation through agents as well as vector database and models selection.

Luca Bianchi

Chief Technology Officer @ Neosperience Spa, AWS Serverless Hero, AWS re:Invent 2022 speaker, ServerlessDays Italy and Serverless Meetup Italy co-organizer

Milan, Italy

Actions

Please note that Sessionize is not responsible for the accuracy or validity of the data provided by speakers. If you suspect this profile to be fake or spam, please let us know.

Jump to top