Session

Structured Data from Images: Deploy a Gemini-Powered OCR App on Cloud Run

Want to turn images into valuable data? This workshop guides you through building a multimodal OCR processor using Gemini. You'll learn to:

Extract text and structured data from images with Gemini.
Develop an interactive frontend with Streamlit/Mesop.
Define structured output with Pydantic.
Deploy your app on Cloud Run and store data in BigQuery. Gain practical skills in building and deploying AI-powered data applications.

This workshop empowers you to build a practical, AI-driven OCR application using Gemini. Learn to extract text and structured data from images, create an interactive frontend, define structured outputs, and deploy your application on Cloud Run while storing data in BigQuery. Gain hands-on experience in developing and deploying multimodal AI solutions for real-world data extraction.

Sijohn Mathew

Public Speaker, Co-Founder & Head of Product Innovation @ AclarityTech, Cloud Architect specialized in App Development and Modernisation with a touch of AI

Stockholm, Sweden

Actions

Please note that Sessionize is not responsible for the accuracy or validity of the data provided by speakers. If you suspect this profile to be fake or spam, please let us know.

Jump to top