Session
Structured Data from Images: Deploy a Gemini-Powered OCR App on Cloud Run
Want to turn images into valuable data? This workshop guides you through building a multimodal OCR processor using Gemini. You'll learn to:
Extract text and structured data from images with Gemini.
Develop an interactive frontend with Streamlit/Mesop.
Define structured output with Pydantic.
Deploy your app on Cloud Run and store data in BigQuery. Gain practical skills in building and deploying AI-powered data applications.
This workshop empowers you to build a practical, AI-driven OCR application using Gemini. Learn to extract text and structured data from images, create an interactive frontend, define structured outputs, and deploy your application on Cloud Run while storing data in BigQuery. Gain hands-on experience in developing and deploying multimodal AI solutions for real-world data extraction.

Sijohn Mathew
Public Speaker, Co-Founder & Head of Product Innovation @ AclarityTech, Cloud Architect specialized in App Development and Modernisation with a touch of AI
Stockholm, Sweden
Links
Please note that Sessionize is not responsible for the accuracy or validity of the data provided by speakers. If you suspect this profile to be fake or spam, please let us know.
Jump to top