Going Multi-Modal: Integrating Voice, Text and Images in Azure AI RAG Applications

Multimedia processing is a very exciting and forward-looking area in Generative AI. As an engineer, I was keen to understand how to build more sophisticated, feature-rich and user-friendly applications by integrating different data types - images, audio and text. We will take a look into business use-cases that require multimedia processing in RAG applications and how they could be supported by Azure AI toolset.

Zahhar Kirillov

EPAM Switzerland, Delivery Manager

Schaffhausen, Switzerland

Actions

View Speaker Profile

Please note that Sessionize is not responsible for the accuracy or validity of the data provided by speakers. If you suspect this profile to be fake or spam, please let us know.

Session

Going Multi-Modal: Integrating Voice, Text and Images in Azure AI RAG Applications

Zahhar Kirillov

Links

Actions