Session

The Intent-Driven Data Architect: Using LLMs to Generate Type-Safe Synthetic Test Beds

In the era of "Intent-Driven" data platforms, we no longer just build pipelines; we build systems that respond to natural language queries. But how do we test these complex, cognitive systems without compromising sensitive production data? Enter the LLM Data Architect. This session explores a modern Pythonic workflow for generating high-fidelity, schema-validated JSON datasets on demand.

I will demonstrate how to use Pydantic to define a rigorous "data contract" and leverage Instructor or Outlines to force LLMs into producing perfectly structured, type-safe synthetic records. Attendees will walk away with a blueprint for a self-validating data generator that handles complex business logic and "long-tail" edge cases, ensuring your cloud-native apps are robust before the first user ever signs in.

Varun Joshi

Senior Data Engineer at AWS

Seattle, Washington, United States

Actions

Please note that Sessionize is not responsible for the accuracy or validity of the data provided by speakers. If you suspect this profile to be fake or spam, please let us know.

Jump to top